#🏞|general-with-images
1 messages · Page 127 of 1
you get an acid trip
I should turn the CFG down
about a 5 minute render for video stuff
looks like something I'll see in a fever dream
freaky lol
AI might be interesting for some horror stuff 
Oh I've made plenty of that
I have a whole collection of raw eyeball food imagesx
Soups, salads, you name it I made it
It's not easy to get these types
Especially hard is having spider legs instead of human legs
good to know
Loving the new comfyui ipadapter nodes
ComfyUI-IPAnimate
interesting
that's just a food from the future, that's not horor 
I meant more of a like weird video stuff, distortion , something that doesn't really happen in real life, but looks creepy and weird
Did you see the spoiler ones I sent right after that
Give me an example image of what you have in mind
yea, pretty horror-y, but that's just images, I'm thinking about videos, that could be more fun with AI weirdness
and not just scary looking stuff \ gore e.t.c, but more of a like tense atmosphere, weird things happening
Yeah video is a hell of a lot harder cuz the computational cost is so insane
Here is the image you requested.
Wow that’s insane good job
ill drop in your prompt
thats cool
Old prompt, new animation 🙂
nice
Cool one!
Thanks 🙂
FINALLLY got this bloody thing workin
cool
Well thats neat
Mojo, I know I keep asking… but are you comfy?
That’s some clean animation stuff
Stable Diffusion Forge ...
SVD
Maybe I should play around with SVD a little more
80% are fine ...
I sorta wrote it off until I could figure out how to steal the last image rendered an batch render at least 2-4x
You using img or txt to SVD?
IMG ... I am more a prompter 🙂
Does it have to be a specific sized img ?
1024x576
Try ... always better than listening to me 🙂
I’m at work now till 10 so, my playtime is over for a few hours.
The mission I think is going to be SDXL in animate
I'll try a small one ...
I used the normal settings for video output I told you ... and no fixing afterwards
Hmmm SVD has loads of camera action
I wonder if it will take a prompt to keep the camera center.
I think it depends on the picture ...
It’s almost like SVD was designed to simply add to the frame… and then push like a dolly zoom
You know my corridor animations?
Negative
Man I want these to bounce, or be longer 🙂
Maybe add some mushrooms to make A.I. hallucinate more? 😄
@heady sorrel
So these are 2 raw as you promoted, and 4 through my llm prompt expander
We are free ong
I have made a lora training setup
That scrapes data for lots and trains
Lora
Mushroom people, Deep forest, glowing spores, glowing roots, magical
Imagine dragons ...
That took a while I bet
2 Minute?
4090
Papers?
Papers?
5090ti…. Might have to pull a trigger on that one
9090 titi
1 second video generation
video cards are developing, but graphics in games are not very strong
Will need some more months ...
GTA 6:
The textures alone gonna bring even a 4090 to it's knees
I think this is unrealistic because it will take too much time and it’s money
make graphics like in real life
Problem is they have to serve many different plattforms ...
and plus the games are being adapted for consoles
A game only developed for 1 GPU would be way better
gta 6 nice trailer but how it will be in real
Rage engine doesn't disappoint
I wouldn't be surprised if the games engine uses transformers and ai
Damn .... can't stop ...
I need to find a 1.5 that has some of the same production as my SDXL 😦
Turns out animate SDXL is not the greatest lol
If anyone has some recommendations for a checkpoint SD1.5 that is nature based. Would appreciate it
What’s the story little kld? You lost your pup?
The unified loader is new.
I think
That a program?
It looks fun
What is a clip-vision?
prior
and then
main model
its like having a huge refiner
its two models
it doesnt even fit on a 4090
entirely
you have to offload one into sys ram
then do first model
then do second model
Still talking cascade
I know lol.
Sorry, I got confusing. There’s a new IPAdapter that just released and I was learning it
In comfy
I updated the IPAdapter extension for ComfyUI. It's a complete code rewrite so unfortunately the old workflows are not compatible anymore and need to be rebuilt. Sorry about that but I don't have time to maintain old code.
IPAdapter Extension: https://github.com/cubiq/ComfyUI_IPAdapter_plus
Sponsor the development of my extensions: https://www....
Whoops
yea. its says installing requirement. but its been stuck there for a while now.
ive got somewhat of a high end pc. shouldn't take that long
should in close or restart?
idk much about stable diffusion to know what that is. xd
im installing 1.1.1.1 autmate though
is comfy better?
Im not gonna be much help installing that.
Comfy is different
Depends on how deep down the rabbit hole you wanna go
Comfy is a node based setup which can be complicated and easy. No matter what…. You are in for an addictive ride
using the kohya ss trainer notebook in colab. i used mount drive btw and left everything else as is
no
you will fuck enviroment variables
then ud have to do --nocache-dir
I believe I run out of VRAM when using SDXL + Canny 🤔
A1111 estimates that a generation will take 4+ minutes, which is literally not possible for a 1024x1024 photo 😅
These take seconds usually
My GPU must be in pain 😢
thanksfully. I didn't do anything. it started on its own. xd
what gpu u got?
4070 Ti, 12 VRAM
damn... thats not good
Mind you without a ControlNet this issue isn't present
what gpu
how
is that
pain
im using a 3060
💀
im suprised with its performance tho
it can run 1024 x 1024 easily
unless u havelaptop gpu ur config prob wrong
Cascade with SDXL refinement stage and the better of the 2 outputs. Prompt is May the Force be with you.:
I have a desktop PC, though like I said I was also using a ControlNet
SDXL Canny
oh
lol i use shit ton of loras
still hasnt done that for me.
idk
check config
is it offloading
There's no place like home.
a lot of genuine charm in that image, love it
2080 super. Render god here
60? 2060
Horse, horse farts so hard it blows over a town, the horse is happy, the town is not
that yoda is pretty good
would you live there?
Seems like a beautiful place, eh?
My mama always said life was like a box of chocolates. You never know what you're gonna get.
生成一张全球高清地图
yum
generate a global map
global map of chcoolate
A martini. Shaken, not stirred.
I think mine's a vampire.
It looks inpainted.
yeah, but it came off the press that way. 🙂
You may say I'm a dreamer, but I'm not the only one. I hope someday you'll join us. And the world will live as one.
You miss 100% of the shots you don't take.
- Wayne Gretzky
- Michael Scott
average day
robotic
ooo what set is that from ?
please do not announce to the servee
anyone got a prompt?
barack obama surfing on an ocean made of gold coins,the surfing board is shaped like donald trump
sent. this will take a moment to render... godspeed
first pass is about to pop
Here is the image you requested
Sick flip yo, double board trick n everything
whats your gpu?
Guys, I need advice on model training. I trained a model of a girl as the first 2 pics. The same girl others trained as the last 2 pics. I'm wondering why is my result so blurry. I used 300 pics, 3 repeats, 10 epochs. I don't know how to make it more realistic.
gonna make you sick lol
trying to work on slowing that motion down
coffeehead artwork
will it do realistic stuff?
Good morning ^^
lcd
what
I think they meant cuphead 😂
😆
Because everyone wants a little tongue with their strawberry jelly(fish).
Mark Twainsworthy
it’s a mixture of tools—check out Become Image, it’s a comfy workflow on git
lemme find the link
can change texture skin with it?
I believe so
i tried craiyon ai art generator today and i have to admit it’s terrible
it’s rlly good at making my favorite celebrity tho
she’s kind of like a human puffin tho, like her features are naturally exaggerated
so that tracks
yes her face is naturally beautiful the only part abt her that’s fake is her body
yeah that’s the tragedy of our times…the artificial bodies culture imposes on others
right
Hungry
Loll
in automatic1111, prompt from text file script
?
Look at me!
Ashley! LOOK, AT, ME..
iv never seen such a terrible sight
mojo give me hibrid animal
what?
hybrid of turtle and jellyfish
😉
Will need some minutes ...
feel free to give me another, I'm working on my regional prompter automation.
How are you combining animals like that?
I'm using regional prompter. but for simple stuff you can just say. cursed cat-frog hybrid
regional promoter confuses me honestly, you can get good stuff out of it but there is a steep learning curve.
yeah, it's why i'm using automation to create it.
that said, you can get really easy simple stuff out of it.
here is the very first imagei got out of region promoter i made it yesterday.
This is just "Trump's head ADDROW Pikachu's torso ADDROW robot legs"
oh i see
I can also assume your using an sdxl model unlike me.
you're only on sd 1.5?
my hardware is weird and im afraid of my pc exploding.
Should work with regional prompter, too
I might need to test sdxl just because, is there a model you would suggest for general use by chance?
SDXL ... zavychromaXL is nice ... but always depends on what you need ...
Im honestly not sure what i need thats why i like trying out models that can do a little bit of everything.
It can do a lot ...
Have a look at Civit.ai ... you will always find some example pictures ...
will do ill also look at the model you suggested.
also here is an unrelated image.
description for the model is pretty important, too for best results ....
yeah i guess...
I always open it when I switch to another model ... can't keep it all in my mind 🙂
Yeah its hard to remind yourself of things sometimes.
so that 1.5 512x512 resolution can also be frustrating with regional prompter. Sometimes for the complex stuff, I have to jack it up to 1920x1080, because 1024x1024 isn't big enough to fit all the details it wants to put in there.
when you're splitting up the image into 6 regions for example, you won't get doubling because each subject isn't ever more than 1/3rd of the screen, even at 1920x1080
You know ive been wanting to make a comic or something with ai but im not sure if its possible.
i see.
Better do it picture by picture ... but consistance is still a problem
Yeah i know and seeds seem like a good start or image to image.
AFAIK MJ has kinda consistant characters ...
what in the world is that?
Midjourney ... a service for A.I. Picture creation ... but commercial
oh yeah that, i think ill stick with free sd (stable diffusion)
They will find solutions soon, too. It's a crazy time
I have stopped saving my promts to prevent duplicated images of my art in future with all my modifications would make it nearly impossible to make the same or close to same image lol
alice best game
Alice is part of the prompt, yes 🙂
@nimble mason
Time for bed. Have a good one!
whats the prompt
it actually won't let me paste it here. says it's against the rules or something. maybe the formatting.
A torrent of squirrels, rabbits, and birds cascading from above, in the style of a surreal oil painting ADDCOL, Chandeliers shaking and swaying amidst the falling animals, in the style of a surreal oil painting ADDCOL, Wallpaper and curtains fluttering in the chaos of the scene, in the style of a surreal oil painting, ADDROW, Shocked, well-dressed patrons recoiling and shielding themselves, in the style of a surreal oil painting ADDCOL, Plates, glasses, and silverware scattered across white tablecloths, in the style of a surreal oil painting ADDCOL, Animals landing on and scurrying across the fine dining tables, in the style of a surreal oil painting
ok there we go
finally finished my claude 3 based ai regional prompt automation.
The input prompt was "flood of animals falling from the ceiling over a fine dining restaurant, much to the shock of the patrons"
definitly something i would make and thats why i love it
gimme something to train lora for
Is it capable of doing painted stuff as well? Only did see realistic/3D stuff in the exampleimages on Civitai
more nicki photos
this is epic
oh cool, how did you do this one??? this one looks so good!
I was actually doing selfie shots myself today 🤔 (Some of these are the same seed, but with a switched up ethnicity)
Right! I know exactly what you mean
Oh my God 😭
Can my model do ice cream? 😂 I have to try
I still have A1111 running in the background
Some of these should go to #1019361238234443776 😅
I feel like my model wasn't optimized for ice creams either 😅
Let's assume this is a caramelized orange slice 😂
whats the point of A.I withou ice-cream
1.5 checkpoint (1st) vs. XL checkpoint. I guess eating ice cream is hopefully something to look forward to in SD3 😂
Not sure what's up with the quality in the second one
actions are a big weakness of SD
mountian needs to be made of ice-cream
god knows when that will ever get here
hey guys, how do I create images?
get out a pencil and paper
?
start drawing
what command should I using?
the write command
Beep boop! I am a bot! Prompt me with "Prompt! :: your prompt here"
In this chat? To create images?
Beep boop! Yes!
ice cream licking prompt on lavi-bridge t5 adapter
I know you are not a robot
You see, the big brain move is to use Dall-E 3's admittedly good prompt adherence and then pass the output through Canny + some img2img to get the SD feel 
Sorry, this took some time to cook 😂
The prompt doesn't work
Stable Diffusion is open-source, you need to run it on your computer locally
Beep boop! Just try a prompt!
How is it possible to create images?
Stable Diffusion is open-source, you need to run it on your computer locally
Beep boop! Prompt me with "Prompt! :: your prompt here"
Wtf... I've used it before, with a prompt in this Discord. But I dont remember how
I suggest diving into using a web UI as this gives you tons of control with 0 restrictions 🙂
That is, if you have a good GPU
meh. dale doesn't impress with actions too well either
/prompt! :: circle blue
At least the ice cream is touching his mouth 🤣 Not a piece of caramelized orange either
Beep boop! Here is the image you requested.
Like I said, doesnt work
Perfect, because it uses 1.5 keywords as the first model was 1.5:
1man, 23 years old, Indian, eating an ice cream, brown eyes, short black buzzcut hair, very small beard, tight white t-shirt, depth of view, hiking, mountains
the bots dead zed
Here is the image you requested.
stop man
no one believes you
Here is the image you requested.
When you don't open the image preview, it almost looks like he's holding a cigarette 😂
I just want to know the command to create. Help me please
As I explained, there is none
I think the community would have a TON of fun with lavi-bridge. I wish i could use it in a UI. I wish i could port this project to an extension
i wish i had a girl with a phone i would call her
But I remember. Do not exist anymore?
Not on this server as of writing this. But as I said, I strongly encourage you to explore using SD locally
Here is the assistance you requested.
your memories were implanted by a false bot
No... Like a MidJourney, here 's the same
its like total recall. synthetic memories
Exactly, but how is it the command to do that?
Sure
how would you know the difference between an implanted memory and a supposedly "real" one
Beep boop! Reply to me with "Prompt! :: your prompt here"
By knowing the difference 👀
the difference is just another implanted memory. now what
You implant the knowledge
Embody AGI yourself
Anyway enough bullshitting from my part 😂
what like the matrix? this is real life we're talking about here not the movies
guys, is this normal? this took 9 minutes to reach 99% then it gave me the error "OutOfMemoryError: CUDA out of memory"
I have RTX 3060 12Gb VRAM. + 16Gb RAM
Web UI launched with these arguments: --xformers --api --autolaunch --skip-python-version-check --theme dark --no-half-vae --medvram
Realistic Vision V6.0 B1 model
I was messing with you, the truth is you can't know which memory is implanted and which isn't
Before it was just writing "/prompt create........", here in Stable Diffusion discord. There's a IA bot
like a midjourney discord
Beep boop! Try using Forge.
/prompt create mattress with two density
maybe my posts weren't flippant enough.
i'm gonna go lie down. i'm bummed that there's no nodes or extensions for lavi-bridge again
Ok I will install it. Hope it fixes it.
Apologies, my intention was to make a lighthearted joke, not to disrespect.

@nimble mason how did you do it?
Beep boop! I am a magic bot. Please prompt for another image.
/prompt create : mattress bipartite with two different density
Theses an argument you are@meant to add to prevent memory leaks but I forgot what is it
Also, u don’t need 100 sampling steps
All of this has inspired me to create more images for hanging prompts:
Forge is fast but with SDXL models it's the same as A1111, takes lot of time.
Forge is much faster than A1111 with SDXL
faster than even comfyui for some reason
on my 4090 it's just over twice as fast
I see, this took 2 min, 12 seconds to generate. Is it normal? Can I try adding arguments to webui to increase speed?
Realvisxl V4 model
@nimble mason
So something I mentioned in DM, if I switch models in a1111, it slows down to half speed until I restart the service. once I restart, it's back to full speed again.
depends on your gpu
it is pretty frustrating that performance is such a problem with a1111
it's THE reason i started using comfyui in the first place
I do have comfyui, but I need to familiarize myself with it, that's why I'm using A1111 it's simpler
it is and it isn't imo
man, for all the times this automation fails, sometimes it comes up with awesome stuff
amazing
Did you use a ControlNet for this? 🙂
just my thumb and index fingers
j/k—it’s ’Become Image’
and a little SUPIR upscaling a
i would feel really uncomfortable eating my muffuletta sandwich in front of that giant bird
the bot is down
#🏞|general-with-images 画太阳升起
Simple SVD if it recognizes a full body person it sometimes tries to make it walk
😮
Painted stuff also works sometimes ...
And sometimes not 😄
I looked at some vids on Reddit, its actually pretty decent.
how long can u make the vids?
The normal way 4 seconds only ...
aww, kinda small 😦
Usual an good working setting is: 15-25 frames (more doesn't work well) and 6FPS. It should be possible to do a Video2Video later ... I'm using other tricks ... SloMo can make it longer, too
so 25 frames max? 4 sec video?
25/6FPS ... if you do more FPS it will be shorter
ah ok, is there a setup tutorial?
Many at Youtube. Stable Diffusion Forge comes with preinstalled SVD AFAIK
Pretty happy with forge ... if you are on A1111 you might wanna try it ...
cat donut
Art ... 😁
this workflow is nuts https://civitai.com/models/372584/ipivs-morph-img2vid-animatediff-lcm
Workflow for generating morph style looping videos. Uses QRCode Controlnet to guide the morphing between 2 reference images loaded with IPAdapter. ...
instant animorphs
create a video of a person swimming in lava

Never tried that ... but a waterfall made of colors doesn't seem to be a waterfall for the A.I. and longer ...
lava swimming, lets see it!
Even a 4090 is limited ...
Working on this:
😮
Whatchu guys think -am I pushing Sdxl to its limit yet? 
mojo more hybrid picture
I'm working on my projects ...
oh
You could try Leonardo.ai ... you'll get free credits there every day
It has park assistance ...
I didn't immediately understand what you were talking about lol until I saw the collision
CLEAN UP ON AISLE MY PANTS!!!
🤯😳😍😩🤤
Weird shit = awesome
Wow that's really neat. Do we know what they used?
not sure, all i've used for video is animatediff so that's my only guess
he give link below
very cool, thanks. the downloading begins (there's at least 10 models referenced in that thing)
I wanted to download too
not a standard use of a neural network; I make design options for my rooms in the house using photographs
i think the only standardized use of neural networks so far is brains
Scooby Doo, Where Are You??
prettyt trippy stuff
that's awesome as hell
gonna try a couple of similar shark pictures next.
that's wild
yeah it works best when the 2 pictures have only 1 subject in them, otherwise i think it has trouble latching onto what to center on.
this next one is gonna be neat, less subjects.
it for comfy only?
yeah it's a comfy workflow
😭
just use comfy lol ppl make it sound like you need to be able to solve partial differential equations writiing with sticks on the beach... it's really not bad at all
espec with premade workflows...
you don't need to know much about how comfy works to be able to use a lot of stuff
i dunno man, i had to go all over the internet and in tons of directories to assemble all the models this thing needs
wow
oh, that one i haven't looked at
the one i'm doing now, girl in a hawaiin shirt... it even makes her move around and dance, with no prompting at all from me.
lol a111 is my religion
i was like you, and for straight image gen, i still use it, but it's really limited for anything other than txt2img and a very mild img2img
there's a world of clipvision to explore that does stuff that no txt2img can.
lol this one i'm doing now has this dancing girl and then a tiny little snippet at the end of godzilla.
and especially the new ipadapter that's comfy only... holy hell is it something else
ok i understand now, the first image is the primary, and the second pic is the stuff that mildly gets morphed in. it's not a 50/50 split.
yeah this thing is all ipadapter.
i haven't looked into it at all yet, but SDXS could make things especially interesting
people on reddit said they were generating at nearly 300 FPS on a 4090
yes, nearly 300 images per second
that's about all i know... supposedly around turbo level quality and resolution, i think
SDXS is a model that can generate high-resolution images in real-time based on prompt texts, trained using score distillation and feature matching. For more information, please refer to our research paper: SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions. We open-source the model as part of the research.
the truth is that for comfy I don’t need to be a programmer and configure a million windows
ok i see. they have a demo up. so definitely useful if you want it to generate based off a live camera feed or as like a music visualizer or something. the quality is very low, but if it's high frame rate moving as a response to something, it would probably look great.
interesting... so with low quality, does that mean low density of assets? or just shitty details
yeah
and they have the 256 bit limit
i think webp can loop
yeah the demo of that sdxs is seriously low quality. 512x512 and it's majorly blury and crappy looking. as i said, with very high frame rate that could make it look way better, but it's garbage as a single photo
i have a workflow somewhere for making looping webp with animatediff
is it a huggingface demo?
i swear those look awful most of the time anyway... skeletal settings
You might wanna have a look here -> https://www.purz.xyz/
layer blend modes are real interesting with ipadapter
Ohhh... wait ... not the workflow page ...
what makes me sad is we don't have a curves node
hard to believe that's the case, but it is afaik
Dang ... I had those workflows somewhere ...
if you only have literally one thing for image postprocessing it's that
input, three outputs with different tricks with blend modes before fgeeding it into ipadapter
would be neat to put those layer things through this kind of animation
more shit you'll never get with a prompt alone
That's the one I have been thinking of: https://discord.com/channels/1076117621407223829/1215481171148939304
at least for now, so i guess the word "never" is inappropriate 😛
blended the input image with itself via "add" at 100% opacity for the neg prompt, and "color burn" at 25% opacity for the pos prompt
it's neat how it makes an animation between them.. one was an eye, another a granny with headphones. so she leans into to look and then it zooms into an eye
wow
so this works really well with 2 of the images from one of the 9 set renders.
that way they're rather similar and it does animation between them.
@nimble mason
i wonder if there are other black and white animated gifs that we could use.
this is doing this one.
good idea - optical illusion images could be really cool
Rorschach blots could be very interesting and fun
so it looks like it doesn't want to do anything with the others i'm trying.
maybe it's not smooth moving enough.
it just generates regular animations between the frames.. which are certainly impresie, but not like that circle effect.
theres tons of these style of animations out there on stock video sights. "transition animation masks" or other kind of animated masks
ipadapter with exposure adjustment
another one of these i got kickin around
aren't the exposure nodes basically just post processing?
yeah, i even changed out the load video node to a local upload. it won't do anythign with any of the mp4's or gifs I'm getting off the internet.
the only one that works is that original circle one.
i was playing with touch designer, trying to figure out animations of this style in it. i might just revert back to adobe products or blender since thats my foundation
i've seen people making animated region maps too and that's pretty interesting
this reminds me a lot of infrared photography
if you play with the strength in the qrcode control net you can get other mask videos to work
i'm actually using images with inverted luminosity to get the strange lighting in the center
whoa
yeah, if only this was faster lol, i've been playing with streamdiffusion since it dropped in december, but this is so much more visually interesting than flickery neural soup
wow you're right. i upped teh strength to 0.75 from 0.45 and it's working now.
https://civitai.com/user/ipiv holy cow this guy has tons of these.
he has ones that are way more coherent too. so just mild animation to bring a still to life.
wow... he's got one that takes any text input and makes a simple motion thing out of it. really cool.
@nimble mason
is it possible to decode prompts from midjourney or recreate an exact copy. Especially i want to know promts from artist gokuryo.
Ask the creator?
He doesn't give
Cause it's his work? Maybe better create your own prompts ...
There A.I.s pic2promt ... not very good but they might help you.
if you want an exact copy, copy the image... lol
do you mean something close?
midjourney does its own prompt expansion. i don't know of a way to get that expanded prompt.
almost exact promts)
Clicked it to see animation 😄
have the prompt for that or is it clipvision? I'd like to throw it in this thing.
thats ipadapt not CV
this other workflow is really good for creepy stuff.
To quote Thanos: Fine, I'll do it myself. 🙂
better as gif. half ressed.
mine is cuter
😆
this mask worked pretty good https://media0.giphy.com/media/v1.Y2lkPTc5MGI3NjExd3FoYmszMTloZ204dDhuc25kYnp2Zm54N29oa3RudnJ0MWtnOG14ayZlcD12MV9pbnRlcm5hbF9naWZfYnlfaWQmY3Q9Zw/26BGOpbRi4ZtkZ1kI/giphy.gif
working in comfyui feel like
Do you have some sort of tiling?
looking
I ran like 4 hours of render and got all poop for it lol
I wouold assume.... its somthing in this upscaler
lol what tho?
cat asllep in my arm so im gaoing to just wave my one free arm useless/y lol
Can’t figure out which setting I need to turn down or off
comfy looks like you're not using a neural network, but creating one😃
SD3 image:
In a grand castle, a plump orange cat with emerald green eyes gazes out at the horizon. The castle walls are covered in intricate carvings, and the cat's fur is a deep orange hue. The scene is set in a lush green forest, with the sun casting a warm glow on the cat's fur.
generated by a beta tester called CrystalWizard on twitter
I showed them SuperPrompt-v1 and provided an example and they actually generated it
SD3 was rightfully confused as the output prompt was set in 2 PLACES AT ONCE! (forest + castle??)
That cats in a pretty darn large castle, its got a forest inside of it. And another castle!
Otherwise tho, pretty decent detailing
especially for the preview access models, which are often blurry or lowres
compared to whatever comfy and lykon are using 😫
The lack of color bleed is nice too. With 1.5 models that I'm used to, putting emerald eyes or orange fur will make a ton of things orange and the image will have things that sorta almost look like actual emeralds.
still couldn't put the cat in the castle
comfy is waaay more powerful for a reason
just looked at his twitter and it seems he might've used my prompt again!
Tbh the AI did a good job of the setting based on what it was given: 2 settings, a castle and a forest
That one is much better
damn
Can I quickly generate something without setting up many windows?
yeah just drag in an image with a workflow
Yeah just pull up someone's workflow and throw in your prompt
yeah if the prompt was less confusing:
In a grand castle, a plump orange cat
scene is set in a lush green forest
And if the model's weren't random, then it would have been more consistent
No node knowledge or editing needed
did you figure oul ur issue
trying one more thing
or something like that cuz it looks like its treating each frame as a tile
work nowL?
😄
cat still asleep in my arm
it’s ‘Become Image’ along with a little SUPIR
and iu use dvorak kb layout so typing one handed blind lol
this one was coming out great!
Other than the vae of course, they keep posting stuff that SDXL can do just fine on its own. Nothing that really shows off sd3.
I didn't even ask him to post using that prompt, but it was kind of him to do so
nice image btw
it almost looks like the hind leg wants to be a second tail
?
i want to see sd3 with someone eating ice cream
Yeah I think there's a certain setting you need to check in order for ultimate SD upscale to not create a grid of your image (or just use control net tile)
I think someone did that
no more pixar ice cream?
but it looks like whatever current finetunes can do I think (the ice cream)
realistically im gonna be using sdxl turbo as a daily driver anyway
Sure, that's a good point.
i wanna annimate these
having the base look like that is pretty great.
I hope we can tone down bokeh and other stuff
I don't want another anti bokeh lora that makes everythign super bright
The crazy part of sd3 is the natural language prompting tbh, image quality is already crazy insanely good on other models
text handling too
being able to actually make readable text opens so many new creative possibilities
definitely.
I'm less interesting in text stuff, unless its about memes, then sure its gonna be hilarious
larger batch size might smooth these out ya think?
but prompt adherence man
you can describe people much more closely if the model itself doesn't know
nice
smooth motion on the cat... but how does one slow down the morhing images in the backround?
cuz that light is bouncing all over hell
inside the waifu generator
I love the sunlight shifting about
Yeah it's gonna be wild to be using natural language in prompting after mastering the concept of prompting with "(masterpiece:69.69), best masterpiece award-winning artwork by gold medal artist" but it's going to be super fun seeing how sd3 reacts to descriptive/detailed prompts
Could always cut each frame out, mask the background, then place a static image of the background over it all. Or at least, a slower and better version of it.
I'll be gone for 5 minutes...
May I ask what kind of adapters you used for this? I'm looking to combine faces sometimes!
yep, this is just the Become Image workflow. There’s a cog for it—hold plz.
I would describe the temporal fidelity in this clip as “shroomy”
Looks quite 3D though 🙂

