#▶|stable-video-diffusion

1 messages · Page 5 of 1

noble wolf
#

or do you just mean it's nice looking?

weak bison
#

Nice looking + the parameters it's using are quiet sufficient for a decent animation and fast generation.

noble wolf
#

ah yeah fair enough

iron topaz
#

i have a font error "ERROR:root:!!! Exception during processing !!!
ERROR:root:Traceback (most recent call last):
File "/home/boy21/Downloads/AI/ImmageGenerator/ComfyUI/execution.py", line 153, in recursive_execute
output_data, output_ui = get_output_data(obj, input_data_all)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/boy21/Downloads/AI/ImmageGenerator/ComfyUI/execution.py", line 83, in get_output_data
return_values = map_node_over_list(obj, input_data_all, obj.FUNCTION, allow_interrupt=True)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/boy21/Downloads/AI/ImmageGenerator/ComfyUI/execution.py", line 76, in map_node_over_list
results.append(getattr(obj, func)(**slice_dict(input_data_all, i)))
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/boy21/Downloads/AI/ImmageGenerator/ComfyUI/custom_nodes/ComfyUI-KJNodes/nodes.py", line 585, in createtextmask
font = ImageFont.truetype(font_path, font_size)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/boy21/anaconda3/envs/comfyui/lib/python3.11/site-packages/PIL/ImageFont.py", line 996, in truetype
return freetype(font)
^^^^^^^^^^^^^^
File "/home/boy21/anaconda3/envs/comfyui/lib/python3.11/site-packages/PIL/ImageFont.py", line 993, in freetype
return FreeTypeFont(font, size, index, encoding, layout_engine)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/boy21/anaconda3/envs/comfyui/lib/python3.11/site-packages/PIL/ImageFont.py", line 248, in init
self.font = core.getfont(
^^^^^^^^^^^^^
OSError: cannot open resource
"

fossil atlas
#

svd is surprisingly fast... it only takes about twice as long as generating a single image

noble wolf
#

remember to use svdxt if you're doing 25 frame gens

iron topaz
noble wolf
#

turn it on and off again 🙂

#

ez pz

fossil atlas
#

try changing it to "times.tff" that worked for me. may not work for you, depending on your OS

#

PIL font loading is very bad

fossil atlas
#

on these nodes. there are four of them

iron topaz
#

why is ther no strg z 😭

obtuse lance
#

any way to run stable animation in auto 1111?

noble wolf
#

strg z?

fossil atlas
#

ctrl-z/undo i guess

#

they've just accidentally duplicated a bunch of nodes and now have to delete them all manually

noble wolf
#

ahhh there's a plugin for that, but don't get it because it fucks comfy ui up on the latest update

#

might've fixed it by now but idk

fossil atlas
#

in english and basically every other keyboard layout it's labelled ctrl (control)

noble wolf
#

Is there a customnode for just the queue prompt button?

#

I just want a big red button in the middle of my nodes

fossil atlas
#

there are a few nodes that can queue the workflow like that. typically they queue it n times, for animation

#

mtg animation nodes is one

#

or is it mtb? i forget

#

they are kind of hacky though, like the undo thing they sometimes break

unkempt mica
clever pendant
iron topaz
fossil atlas
#

try higher fps on the img2vid conditioner

#

this will make the result shorter but possibly more coherent

#

(shorter in time, it will be the same number of frames)

noble wolf
#

damn bro kicked that croc and himself out of reality

clever pendant
weak bison
weak bison
fossil atlas
#

the workflow has four copies of the gen workflow in it, then at the end it generates a 2x2 with all four results in it. so you can just set whatever params you want on each one

#

it's not really xy because it doesn't loop over the product of two lists

weak bison
fossil atlas
#

just change it on the nodes directly

#

you can't change the svd params because it only runs that bit once

#

but if you make four svd nodes, one in each duplicate, then you can

weak bison
#

K, I'll try !

fossil atlas
#

but the whole point of this workflow is to test text prompting

#

making real loops in comfyui is currently extremely hard because of the internal architecture. there's some work towards refactoring to make it easier but it's quite complicated. even figuring out how a loop should be structured out of nodes is not trivial because a node graph is necessarily acyclic. so you have to handle it behind the scenes somehow.

#

oh also i recommend disabling that free_u v2 node. it does speed up generation but it seems to hurt quality

severe moon
clever pendant
severe moon
severe moon
noble wolf
#

That one is actualyl so fucking sick. I guess it's super easy to keep consistent because it's just a bunch of cubes huh

open heron
unkempt mica
weak bison
silent hinge
unkempt mica
narrow canyon
#

@grim tangle Thanks for the concat workflow! Playing around with it now with the plan to try and force the model to zoom all the time but interesting enough, the zoom prompt seems less coherent than just the default

cloud knot
severe moon
unkempt mica
bold wave
#

did that three layers guy end up sharing his workflow? I havent been keeping up

unkempt mica
#

perfect

unkempt mica
bold wave
#

I was hoping you would do that.

sand plinth
open heron
severe moon
unkempt mica
bold wave
#

I did myself as homelander, and it did not disappoint. 11/10

open heron
noble wolf
#

Fucking nice!

#

I'm just so hyped people can make basic videos for any purpose now with like 95% ai that's pretty decent

long stream
severe moon
unkempt mica
#

this one is almost really good lol

severe moon
#

takes about twice as long for me

#

can shave off about 15 sec just actually setting dimensions to 720p.... "720p is the standard high-definition (HD) display resolution of 1280x720 pixels, with progressive scanning, at a 16x9 aspect ratio." - the googley

noble wolf
#

god damn those on twitter are pushing it and it looks beautiful, pushing 24gb of vram xD

severe moon
mental magnet
grim tangle
wicked wren
#

Can’t wait for this to run on apple silicon, performance on m3 max should be pretty noice no??

long stream
noble wolf
#

what do you guys use to compress an image? just set an upscaler at like 0.8 or something?

#

just need more details before I shrink it to fit the video size

buoyant aspen
noble wolf
mental magnet
hushed coral
lean gate
#

Hey, feel free to post this at #1092446741984444416 , I will delete it from here since this channel is for Stable Video related content.

tepid stream
hushed coral
tepid stream
narrow canyon
tepid stream
hushed coral
tepid stream
sand plinth
wicked wren
#

I’m getting this error on my Apple silicon: ”There appear to be 1 leaked semaphore objects to clean up at shutdown” when trying to render SVD with comfyUI

#

Segmentation fault

severe bear
#

most of my outputs just end up as camera pans sadcat

#

could've done the same in 1/100th of the time

silent hinge
#

yes i get a lot of those too

bold wave
bold wave
severe bear
severe moon
silent hinge
#

not sure if you referring to the tutorial we have now on layers

#

from Fancy

bold wave
#

oh no, what I mean is to have a profile character (or car, or frog, or whatever) and then a detailed background.

when there are no background details it tends to look... bad.

tidal pendant
severe moon
# bold wave oh no, what I mean is to have a profile character (or car, or frog, or whatever)...

In my experience you want something with lots of 'depth' to it, that can be from tonal shift (orangey subjects against blueish background), heavy bokeh, obvious perpective lines.... stuff that divides the image into foreground or background. You can jsut choose images that already have this effect but also use an image editor like Photoshop to tweak colors or focus yourself. And even then it's still a crapshoot how the seed will respond. Sometimes it's better results to just try a different seed or motion bucket value instead of trying to tweak the image.

bold wave
severe moon
bold wave
half hound
severe moon
#

nothing official

bold wave
#

versus a B&W image, plus a boring background.

severe moon
#

yeah it needs 'stuff' to grab onto

tepid stream
bold wave
tepid stream
#

Topaz Video AI dogsmile

open heron
tidal shadow
tepid stream
tidal shadow
#

I just hope that you aren't@tepid stream paying for topaz.

bold wave
#

Hey I say if you have access to professional grade software, use it.

tidal shadow
#

People recommend it like they recommend Maya. It's the industry standard of upscaling. It's not good but also not the worst.

It cost so much because it's private code that has an any number value they want to attach to it. Topaz only has their image upscaler going for them.

bold wave
#

Replace that with anything. Photoshop. Microsoft Word. Etc etc.

For all you know, he is a graphic designer and his biz pays for it.

tepid stream
#

I didn't pay a penny, my company bought it

tidal shadow
molten eagle
#

Let's agree: Stable Video Diffusion is already mind-blowing in its first basic version.

Midjourney generation as image input.

#svd #stablevideodiffusion #ai #midjourney #aivideo #aianimation #stablediffusion #stabilityai @ai.art.community @stability.ai @midjourney.gallery @midjourneyartwork @midjourney.artwork.ai

#

so cool to play around with 🤩

sterile mesa
#

I've collected so many workflows for svd, idk which to use.

molten eagle
#

haha

#

is there a workflow to basically take the svd output as a new input to generate better details on top or maybe interpolate frames?

tepid stream
tidal shadow
tepid stream
#

Why start it then

tidal shadow
tepid stream
#

Makes even less sense now dogsmile

sterile mesa
#

So.... how about them videos

severe bear
cloud knot
#

Damn, just went back a few, these are soo good!

hazy gorge
#

Hi guys does anyone know if they have already done fine-tuning on the Stable video diffusion model? Or at least if there is a method to train the model? 🧐

sterile mesa
#

Someone released pruned safetenors of the models a few days ago. Half the size of the original.

hazy gorge
sterile mesa
#

Research paper is in the pinned link in this discord channel

keen shuttle
#

I'm loving SVD .I'd like to control what it choose to render the zoom out ,or pan and movement any options for that yet?

fossil atlas
#

not yet

tired egret
fossil atlas
#

the paper mentions camera motion loras but they haven't released them yet

open heron
#

Trying lemon for the first time

keen shuttle
#

are you using style loras with this?

fossil atlas
#

you can try to mix a text prompt into the conditioning but it very rarely does anything

keen shuttle
#

there was a movement lora for animate diff wonder if that will work

#

I also wanna plug in text prompt,does it work well?

fossil atlas
#

no, it is almost entirely useless except in a few cases

tired egret
#

you can only generate the initial image, or you can change the image with unClip

fossil atlas
#

you can mix image and text conditioning but 99% of the time it does nothing or destroys the image

keen shuttle
#

with my 1024 x 1024 renders ,I been cropping with node it to 1024 x 576 seens more stable

fossil atlas
#

1024x576 is the native res of svd model

keen shuttle
#

so does it also like that to start?

fossil atlas
#

the node auto crops and resizes the input i think

#

idk, it doesn't seem to make any difference if you pre-crop it anyway

keen shuttle
#

I think only to control where it crops

fossil atlas
#

yeah, that's true

trail finch
#

any idea why i get this ?

robust scarab
fossil atlas
trail finch
fossil atlas
#

you have to use this one

robust scarab
fossil atlas
#

svd doesn't have a clip model in it, so CLIP output is going to return None

trail finch
#

i feel like

fossil atlas
#

if you're trying to just do normal txt2image then... you should load an sdxl model, not the svd model

trail finch
fossil atlas
#

no, it is different

#

this is where the problem is

#

you can tell, because that node has a purple outline

#

and the problem is the node before it wants an SDXL checkpoint, but you have selected the SVD checkpoint

#

the SVD checkpoint doesn't have a CLIP to output, so it outputs None

trail finch
#

so this one doesn't works ?

fossil atlas
#

not for txt2img

#

it can only do img2vid

trail finch
#

i disconnected the text part and connected an image like in a yt vid

fossil atlas
#

yes, but you still have a image preview node at the end of the txt2img chain, which means it will still try to run

trail finch
#

do i delete it then ?

fossil atlas
#

you can mute it with ctrl-m or delete it

trail finch
#

ok i got no preview but it's calculating now

#

thanks btw

open heron
silent hinge
#

(wave seem more fitting for the subject haha)

solar willow
solar willow
noble wolf
#

god damn that looks tasty

#

I wanna make me some of that

open heron
native imp
#

im getting the same, how did you fix this error?

severe moon
native imp
severe moon
severe moon
tough crypt
pastel matrix
native imp
#

hi again 🙂 how do you make those normal videos? i only get that weird .webp thing

clever pendant
green valve
#

Hello, does anyone know how exactly the kohya deep shrink node works?, I have tested it with SVD and the results are much more coherent when working at a higher resolution than 1024x576, it seems to be 75% functional at double the resolution, it may be better to move the undocumented node parameters, or maybe kohya make one for SVD, the results are really encouraging when using it like this by default

severe bear
severe bear
#

haven't tested it with svd but that's a pretty good idea

green valve
#

Of course, as I said, I get this by doing double the resolution.

severe bear
#

that is trippy

calm temple
#

hey just wanted to pop in and say thanks for posting your notes, it helped me figure out my own wacky issues on linux with getting xformers and cuda to see each other properly because it was seeing a cached ver of my venv and wasn't linking up the reqs quite right

severe bear
#

dall-e images work pretty well as input (much better than sd sadcat)

green valve
#

I put together a video with SVD videos, all with SDXL generations

clever pendant
unkempt mica
#

@severe moon I have gone down a Miku rabbit hole and I blame you.

astral beacon
unkempt mica
silent hinge
#

what program would you experts suggest for someone to make video of his clips from stable video. you know if i want to paste a bunch together maybe with some music. i have never done this before. sorry if ot

severe moon
tidal pendant
tidal pendant
severe moon
#

Those are the most basic and easy tools that are probably already on their system for most people.

silent hinge
#

thanks. it always intimidate me to try before but i want to string some images together haha

tidal pendant
severe moon
#

I mean these vids we're outputting are so short and lower res, it's not like they need editing other than putting them one after the other on a timeline, so CLipchamp or equivalent should be plenty.

severe moon
tidal pendant
#

Ooo

#

Thanks

severe moon
silent hinge
#

cool image even without the video

unkempt mica
icy valley
#

This is from txt2img2vid in one workflow

half hound
#

anyone have any good upscaling workflows?

severe moon
green tree
#

Any chances for Stable Video Diffusion at Dreamstudio?

zinc garden
#

So, is the motion it ends up making in any way tied to the seed that is fed into the sampler? Is there a way to 'encourage' motion behavior?

#

Because right now it's a crap shoot if I get something 'animated' or just panning

severe moon
severe moon
# zinc garden So, is the motion it ends up making in any way tied to the seed that is fed into...

It all seems random at this stage. This is a research model. The biggest effects on motion from my playing around with this is: image, motion_bucket_id, seed, and video_frames+fps . Some images are just more likely to encourage certain types of movement. The motion bucket tends to add more motion the higher it gets, but whether or not increasing it leads to more motion depends on what's in the image. The seed can totally change what's happening, but sometimes changing it just means variations of the same 'bad' motion. And finally the video_frames+fps determines how many frames get created and how close or far they are from each other temporally. You can get really jittery videos if you set fps high and choose long amount of video frames, but this usually lokos bad and takes a long time to render so it's not worth it for what I like to do.

zinc garden
#

Gotcha, thanks. What motion_bucket_id do you usally go with?

#

And do you mess with the augmentation_level?

severe moon
# zinc garden Gotcha, thanks. What `motion_bucket_id` do you usally go with?

I've just picked random values, I like 33, 66 ,111, 222, 333.... There's not enough info on what exactly they are, but I heard some discussion that they are like some sort of list of stored types of motion where the smallest amount and least chaotic are the lowest values. But it also doesn't sound as simple as it just increases some chaos factor that adds motion.... it sounded more like it was an actual index of stored types of motion from slow to fast but we don't have a list of what the motions actually are.....

hushed coral
#

Having some success with bucket 255

fallen wren
zinc garden
#

Any tips on augmentation_level and the sampler/scheduler (and CFG)? I'm often losing coherence FAST and wondering if that too is just a matter of generation a bunch to cherry pick good ones

severe moon
tidal pendant
bold wave
# tidal pendant

lol this looks like the sort of thing my 6 year old watches on youtube

tidal pendant
#

XD

open heron
#

It may not seem like it but this is a complex 2-stage SVD work that I will try to perfect tomorrow

bold wave
#

still working on the image, but I think I found a good flow, I took the IP adapters, merged the profile with the background, and got some pretty decent results! A lot less cherry picking when you're that specific!

icy valley
#

as far as I've gotten, anyways

hushed coral
severe moon
icy valley
severe moon
wicked wren
#

Hi! Trying to get Stable Video Diffusion to run in ComfyUI on my M1 Ultra (Mac Studio, OsX Ventura) but getting this error when queuing the prompt. Any ideas?

hard ice
#

Water splashed out of the basin

copper berry
copper berry
#

oh ok then i said nothing

fallen wren
icy valley
#

step 1: purchase the $3,999 rendering function from apple.fckU

hushed coral
alpine warren
#

Hi, I got permission denied when access to the github repo, do I need to apply for a stable video wait list for this?

alpine warren
fallen wren
alpine warren
#

I didn't edit, I just tried to clone. And I got message like this: Please type 'yes', 'no' or the fingerprint: yes
Warning: Permanently added 'github.com' (ED25519) to the list of known hosts.
git@github.com: Permission denied (publickey).
fatal: Could not read from remote repository.

fallen wren
#

github requires elevated perms to clone with git@ for some reason

alpine warren
fallen wren
#

Oh, yeah, that's a weird line, i'll poke tim about it

finite breach
fossil atlas
#

because git@ is a ssh url. it works as long as you have a pubkey on your gh account, even if you have no specific permissions on the repo

next acorn
restive onyx
#

can you run this with a rtx 2060 6gb vram ?>

icy valley
tepid stream
restive onyx
#

thanks and if you generate do you have an option to say what you want ? like : lion in a cage dancing

tepid stream
restive onyx
#

can you define how much time ?

tepid stream
#

Yes you can define Frames and Frames per second as well as a few other knobs and twists

restive onyx
#

ty

icy valley
#

I feed it a prompt and the workflow makes the image and sends it to SVD shrug

#

Same thing really

severe bear
#

does this actually work for anybody here in any capacity or just snake oil?

#

all my results are just as random as they were

ruby pine
#

I'm amazed at results but it feels entirely like very first SD model. Like 0 control over what we can get from it

ruby pine
#

can't wait for smth like LCM to drop

#

feels wierd to generate frames so slow

tepid stream
ruby pine
glad dew
tidal pendant
glad dew
tidal pendant
severe moon
tidal pendant
#

XD

severe moon
tidal pendant
severe moon
tough crypt
tidal pendant
manic stag
#

help

#

cat fly

tidal pendant
open heron
tidal pendant
severe moon
severe bear
#

would be sick if the parts moved or something but guess that's asking too much from svd

fossil atlas
#

it can do it... but it's very rare

severe bear
#

I guess it would be easier to experiment if I didn't have to wait 6 minutes for 25 frames

#

(lcm any time soon?)

fossil atlas
#

well you can use the 14 frame model... it's faster, about 2 minutes for me, and i can't even run the 25 frame one

buoyant aspen
#

Can't wait for they next model if we can little force a movement it will be great tool.

open heron
severe bear
#

meh, it's like 3-4 minutes with decoding and stuff

bold wave
#

until youre done experimenting

#

Cat (+ original images)

severe moon
tidal pendant
#

im going to make a video were i take the last frame from the last video and make a new video and repeat that till the video reaches a minute

bold wave
severe moon
tidal pendant
#

i did it and its cursed XD

fossil atlas
#

new Tool video dropped?

severe moon
severe moon
tidal pendant
#

yea

severe moon
#

but anyway if anyone's thinking of doing high motion buckets at 1080p just know the model really freaks out

tidal pendant
#

are you running the model at actuall 1080

severe moon
#

yup, for science

tidal pendant
#

i think my gpu would die if i tried that

severe moon
#

lol

tidal pendant
#

ima try it but i only have 12gb of vram

severe moon
#

oh no,

tidal pendant
#

what gpu are you using?

#

yup instant video memory issue

severe moon
#

I have a 4090, it can do 1080p in about 90 seconds for this 10 video_frames setup. uses around 18GB for the sampling then jumps up to maybe 22 or 23Gb during the VAE so I don't recommend trying it for anyone with less than 24GB of VRAM

tidal pendant
#

me with my 3060 XD

#

welp im goin back to 1024 x 576

severe moon
#

i was just seeing if it could do 1080 without hitting shared memory

#

but the model can't handle that much res, everything just tends to wanna either explode or just do nothing

#

maybe some simple static shots with water in them would do better at this res

tidal pendant
#

im going to create a video with a really low res to see what happens

#

that was the fastest generationg i have ever seen

#

wonder if there ever going to release a model that has text input for guiding it

severe moon
#

oh yeah, they said this is just the research model. txt2vid is part of the plan eventually

tidal pendant
#

oo nice

#

since i usually only use local ai models the last video ai one was modelscope and that one was getting old

severe moon
tidal pendant
#

i love how this model moves in 3d space

severe moon
#

yeah it likes that the most for me. i know it can do walking, moving people, talking stuff but it rarely seems to happen

tidal pendant
#

i wonder what other random stuff we can push it to do for research

ocean flare
#

And on for the golden ticket:

tidal pendant
#

boat train

#

XD

severe moon
#

high speed sail

#

bullet boat?

#

starring brad pitt

tidal pendant
#

bur

#

discord embed falier

#

ima resend

#

for this one i forced it to render 40 frames instead of 25

#

and it seems like it got unstable

severe moon
#

yeah increasing video_frames beyond the model training limit tends to do that

tidal pendant
#

im surprised i never got a vram error

severe moon
tidal pendant
#

hes calling so much magic energy

severe moon
#

it's definitely a cool effect if you want it

tidal pendant
#

whats the weirdest generation result you got from this

severe moon
tidal pendant
#

oOOoo spooky

#

ilike the ghost XD

severe moon
#

man it's gonna be so cool once we can actually ask for stuff like that instead of just changing seeds and hoping for the best

tidal pendant
#

ikr

#

then you could give it context to what its ment to generate

#

;|

tidal pendant
#

it vanished

#

no drugs allowed
drugs despawn

#

maybe they can help

#

👍

severe moon
bronze pollen
#

Any idea of the model architecture of pika lab?

tidal pendant
#

what

severe moon
#

they're probably training their own models

open heron
#

Even in a bad drawing it knows how it should move. Amazing

severe moon
#

man I can't wait until we get txt/imgTo3d

open heron
tidal pendant
severe moon
tidal pendant
#

never tried 2d stuff with it

severe moon
tidal pendant
#

:$

#

also y does your video loop

#

is it something in your workflow

soft stratus
#

ping pong

severe moon
tidal pendant
#

XDDD

soft stratus
#

loop would be playing again

severe moon
tidal pendant
#

i figured when i saw dragon's screenshot but when you randomly said it i thought you were just randomly adding to conversation XD

soft stratus
#

Gotta think of what the term would mean... ping pong = back and forth. 🙂

tidal pendant
#

yea

severe moon
#

Video Combine also has a loop count if you need set number of loops for a gif or video

tidal pendant
#

o ok

#

i was just wondering what it was im prob going to keep it off because the ending frames are funny most the time

severe moon
#

it's just an easy flag to make the videos twice as long for free and look good if osmeone turns loop on their player

tidal pendant
ruby pine
#

Note: 40GB of VRAM required

Why they claim this?

tidal pendant
#

its running with my 12gb vram card so i have no idea

ruby pine
#

10,6/24, so much free vram that it's possible to leverage upscale and anything you like. What an absurd statement about 40gb. Could be that we do smth wrong or what

#

damn ffmpeg consumes more vram than this whole generation

severe moon
ruby pine
#

also what samplers do you use? smh i don't like dpmpp_2m

#

but euler is too blurry

tidal pendant
open heron
ruby pine
#

wish we had some semblance of "control" over this pandora box

tidal pendant
#

im waiting for them to make a text input that we can use

ruby pine
#

it's fun but

#

i doubt they can just add it if it was trained without pairing to text tokens

tidal pendant
#

in the research paper its showing them using text to video

#

so it exist somewhere its just not out yet i think

zinc karma
# ruby pine Note: 40GB of VRAM required Why they claim this?

on Windows if it need more VRAM (more than 24GB) for me it works with VRAM swap into normal RAM where it took like 56 GB and it is just 50% slower in last/2nd VRAM hungry phase. It doesn't crash at all. I think this is windows feautre and they run it on Linux where it crash.. so maybe that's why they think it needs 40GB.
I haven't tested it on linux but I guess it doesn't have VRAM swapping by default.

ruby pine
#

Currently allocated : 15.81 GiB
Requested : 83.08 GiB
Device limit : 24.00 GiB

#

occasionally 4x upscale got into film interpolation

#

and it didn't end well

tidal pendant
#

i added a photo of a clock hopping it would animating the hands but it just moved the camera

zinc karma
# ruby pine i managed to get BSOD

I have sometimes with stable diffusion start after another AI program exit.. that is probably drivers fault it doesn't clean the RAM properly. Maybe try Studio drivers if you have gaming.

ruby pine
#

imma studio, well it's fist bsod ever

#

just some glitch

#

imma messed it real bad so, it's fine

#

but yeah swap works for me too. but i can't say that it's 1/2 speed, for me it's like super slow

fossil atlas
#

so what's the worst possible test we can throw at svd? i think something like a picture of a wine glass falling. in the video we should see it smash, like this: https://www.youtube.com/watch?v=mWRuzFsmQHg

A small test animation of a wineglass breaking. I used the free plugin Thrausi http://nitro4d.x10hosting.com/blog/?page_id=79 to break the glass apart. Thrausi is a very useful destruction tool. You´ll find more information and also more free plugins on the Nitro 4D http://nitro4d.com/blog/ site.

Again made with Cinema 4D rendered in Vray.
...

▶ Play video
#

so far i've seen absolutely no indication that svd is capable of anything even close to this

#

maybe if you start with a picture where the glass is already beginning to break

#

but that's cheating

tidal pendant
#

maybe when the text input comes out you could specify glass breaking or something

tidal pendant
#

did another video chain

fossil atlas
#

try lower cfg. it might survive a bit longer

barren hound
#

Is there a text to video through comfy yet? Or only img2vid so far?

fossil atlas
#

only img2vid with svd model

#

the paper talks about a txt2vid version, but they haven't released it afaik

#

it also mentions camera motion loras, also not released yet

tidal pendant
fossil atlas
#

what if instead of taking the last frame, you take the second frame?

#

or the middle frame etc

tidal pendant
#

i like watching it slowly get worse

fossil atlas
#

it will still do that, but if you take the second frame, it will be continuously smooth, instead of big (downward) jumps in quality every 25 frames

tidal pendant
#

true

fossil atlas
#

it might still jump just as far though, meaning you get like a 4 frame video. worth a try though i think

tidal pendant
#

yea

fossil atlas
#

if you set video_frames to 2, are the two generated frames identical to the first two when it is set to 25?

#

or do you get something completely different?

tidal pendant
#

no idea

severe moon
fossil atlas
#

how about quality?

#

i mean with 2 frames, will the second frame be as fried as 2/25, or 25/25?

severe moon
#

Good question, probably depends on your CFG scaling

fossil atlas
#

i don't actually use that cfg scaler node

severe moon
#

You just run straight into the SVD node from the model?

fossil atlas
#

yeah. the scaler just seems to make every video explode at the end

severe moon
#

Interesting

fossil atlas
#

i don't think it actually is a temporal thing anyway, despite what the docs say

severe moon
#

I leave it connected but almost always just the same setting as on kSampler

tough crypt
fossil atlas
#

i don't see how it can be if it is tweaking the model

#

hmm yeah, setting it equal or higher than what is on ksampler results in it behaving like it is not there at all. i did notice that, and then removed it 🙂

severe moon
#

I think the intention was to ramp from lower initial CFG to final samples CFG to control a tendency to get fuzzy towards the end

fossil atlas
#

i think it actually modulates over steps rather than frames. don't know. it seems to affect the first frames just as much as the later ones, it's just that with lower cfg the video is more likely to tend towards more motion

#

but if you set motion bucket real low and fps real high so there is no motion, then compare with and without it, with it just straight up looks worse

#

and every frame should be identical in both cases

noble wolf
#

if I use upscaler with controlnet, do I need to upscale the reference image first so my apply controlnet is 1024x1024, sorry i know its wrong channel but others are dead

fossil atlas
severe moon
tidal pendant
#

these are big old guns

open heron
#

Snap fingers. Very surprising animation

unkempt mica
#

That's a good one.

tidal pendant
severe moon
#

burned through too many seeds trying to get one where not just the background moved so I decided to just throw them all into one long clip....

ruby pine
#

smh struggle to just slap them into ffmpeg

#

wonder if there are any convenient nodes

ornate apex
#

Any good getting started guides for local Stable Video Diffusion out there? Also, any idea of a good GPU? I have an NVidia with 12Gb -- is that enough?

tidal pendant
tidal pendant
severe moon
severe moon
severe moon
tidal pendant
#

i know when it comes out im going to be making more cursed stuff

severe moon
tidal pendant
#

are you using ping pong to go back to the original frame to add on

severe moon
#

no, just changing seed

tidal pendant
#

e

severe moon
#

that a cool thing about pingpong, anytime you change something and get a new video they all start from the same image so that's always the first and last frame and you can pretty easily string them together seemlessly to showcase

tidal pendant
severe moon
#

🙂

tidal pendant
severe moon
#

no worries, I just wanted to be clear in case anyone got confused reading this later and thought the way to make new stuff was make a video and pull the last frame out for Load Image to use to start the next one

#

which is totally a way to string stuff together but not really necessary if you're using ping-pong

tidal pendant
#

time to create a video with motion bucket at 1023

severe moon
#

oh yeah I also used ping-pong initially to make the videos longer so i'd hit 3 seconds for anything i wanted to put on instagram

tidal pendant
#

I just render the videos at 6 - 15 fps and interpolate thrm

severe moon
#

yeah i usually render at 8 and interpolate x3 so a 12 or 24 frame long clip hits a full one or two seconds

cloud knot
# ornate apex Any good getting started guides for local Stable Video Diffusion out there? Also...

Just found this video describing the first aspects of SVD like frames_per_second motion_bucket_id etc https://youtu.be/82l0DsbLHhY?t=376 , also talks about to either install pinokio.computer and run locally or run it in huggingspace etc

Stable Diffusion has released an exciting new AI video model - Stable Diffusion Video. This revolutionary model generates short high quality videos from images.

In this video, I test out Stable Diffusion Video using different tools like Pinocchio, Hugging Face, and Replicate. I showcase the video quality, motion controls, and 3D understanding o...

▶ Play video
noble wolf
#

how do I get my comfy ui to use more than 8gb of vram when doing svd ?

#

when I load in it detects my vram and says it loads normally, so it shouldn't be automatically using lowvram or anything

#

nvm got it

severe moon
noble wolf
#

... don't eat the mushrooms

icy valley
#

This one rotates the subject all the way to a profile view

#

shrug go figure

shut narwhal
#

Anyone got a workflow with the different Motion Settings tests to generate all at once by any chance ? habby

keen ember
#

help:
i generate 4 secs using an input image
feed back in the last image so i can get 4 more sec (8 sec video now)
but it gets blurrier and now has purple ghost stuff appearing, omg it is deteriorating, how to avoid this?

hushed coral
#

that's a good idea. how do you grab the final image?

copper berry
#

or create your node in comfy to do that

severe bear
#

you can just grab the last latent from the batch that svd outputs

#

I got the whole thing down to 4 minutes, as in txt2img2vid2upscale2render so not that much of a wait anymore, but still just camera pans sadcat

silver dirge
left gulch
ruby pine
woven fractal
#

I keep on getting a cuda out of memory error

jagged oriole
woven fractal
#

my GPU has 6gb of VRAM. I think the problem here is with pytorch

torpid eagle
# woven fractal my GPU has 6gb of VRAM. I think the problem here is with pytorch

Have seen folks running svd with as low as 8gb VRAM, but not sure if anyone has gotten it working with 6gb yet.

https://github.com/comfyanonymous/ComfyUI has a --lowvram option that is supposed to be flipped on automatically if it detects low VRAM so might be worth seeing if you have any luck with that if you haven't already.

GitHub

The most powerful and modular stable diffusion GUI with a graph/nodes interface. - GitHub - comfyanonymous/ComfyUI: The most powerful and modular stable diffusion GUI with a graph/nodes interface.

woven fractal
#

I do have comfy UI

#

how do I get SVD on it?

torpid eagle
# woven fractal how do I get SVD on it?

If you have https://github.com/ltdrdata/ComfyUI-Manager installed you might be able to use the UI to install the SVD model (I haven't gone through this workflow myself).

If not, you'll want to have the SVD checkpoints file in the models/checkpoints directory within the ComfyUI directory.

wget -O /models/checkpoints/svd.safetensors 'https://huggingface.co/stabilityai/stable-video-diffusion-img2vid/resolve/main/svd.safetensors?download=true'

Once you have the SVD model installed, you can checkout https://comfyanonymous.github.io/ComfyUI_examples/video/#some-explanations-for-the-parameters which has links to image to video workflows. You can copy one of the workflows from the links and load them in ComfyUI.

GitHub

Contribute to ltdrdata/ComfyUI-Manager development by creating an account on GitHub.

woven fractal
#

Alright, thanks!

severe moon
dusty lance
#

who are you crazy people, and how in the hell did you get 40gb vram

#

did ya'll download vram or something?

heady coral
#

Heheh

#

Always download more ram

#

Go into your public library and download more ram

Be a man coolcow

glad dew
tawny storm
#

? Any idea why?

severe moon
tawny storm
severe moon
tawny storm
severe moon
tidal pendant
severe moon
# tawny storm still the same issue 🫠

i dunno man, maybe go to the comfy github and raise an issue there? it looks like some sort of connection error because [remote computer refused connection]. Github doesn't appear to be down so either you have an internet issue or the build of comfy you're using has some sort of issue itself.

fervent belfry
tidal pendant
#

i dont know about 8gb im using ComfyUI rn

white ember
#

Does anyone know how to control the motion? Thanks!

bold wave
#

got a decent streamline going! mix 2 IP adapters for front and background, interrogate those clips, and then have a bit of guidance prompts. so far, it's been working quite well! I've tested this with flipping out cats with dogs and humans, and it seems to be super nice!!

sterile mesa
bold wave
sterile mesa
icy valley
tidal pendant
solemn turtle
#

Hey does stable video diffusion work now with mac using comfy ui?

sterile mesa
bold wave
# sterile mesa Oh thank you! Yoink!

enjoy! youll have to remove the custom lora but there's not much else to it. comfy is good at daisychaining different ideas and putting it into one cohesive thing.

sterile mesa
severe bear
#

just wondering but are there any sadtalker nodes for comfy? feel like it would go well with svd thinking
looked at the code but it doesn't seem immediately obvious how to implement them

pallid wolf
#

whats the best workflow atm for svd?

severe moon
severe moon
tidal pendant
ocean flare
#

Did a small video animation for a SUNO music generation, pretty amazing how fast you can set something up now. Even if its pretty rough

tidal pendant
fluid quest
#

anyone know why it's reloading the model in comfyui every single time I generate a video?

fierce schooner
bold wave
#

yeah the fry is always due to high CFG

tidal pendant
warm acorn
fluid quest
pastel storm
#

4 questions for SVD in comfyUI: 1) how do I change the model and output directory 2) how do I add the SDXL refiner in my queue workflow 3) How do I make the motion in SVD better and actually relate to the image (i typically get results of a still image with the canvas panning away from it) 4) can someone explain cfg and augmentation level for SVD Thanks

open heron
severe moon
open heron
severe moon
# pastel storm 4 questions for SVD in comfyUI: 1) how do I change the model and output director...

3 - you change the seed and hope for the best. motion_bucket is like a dial to go from slow to fast but it's not really possible to dial in anything specific. a simple trick is to change the ksampler steps down to around 4, this will look like crap but give you much faster output and a general indication of what type of movement will happen. You should be able to see if it pans the right way, or zooms, or looks like subject is a moving cloud, those are the main types of movements and then you can decide if you want to change the step# back to a proper render value.

severe moon
# pastel storm 4 questions for SVD in comfyUI: 1) how do I change the model and output director...

4 - Augmentation level is basically adding extra noise back into each step, it can help fast moving stuff from getting all smooth looking. I still ahven't seen a good guide or explanation of what it actually is doing though so take that with a grain of salt. CFG sort of fries the image, if it's too low everything will look all soft,fluffy, and pastel but too high and everything gets too contrasty with weird edge artificats and looks like melted plastic, so adjust to taste I guess....

severe moon
severe moon
# pastel storm 4 questions for SVD in comfyUI: 1) how do I change the model and output director...

1 - I wouldn't change the model directories unless you already know about symlinks and config files. If you know what those are then the github for ComfyUI has a section under Installation explaining how to setup your own config file and point it to wherever you're keeping your models. It will probably cause problems though to not just put things where Comfy expects them, especially if you start loading Custom Nodes.
You can save files wherever you want by looking for filename_prefix in whatever nodes are saving stuff in your workflow and setting the path there.

pastel storm
fierce schooner
#

I bought an M.2 SSD to store all of the checkpoints/models. It improves performance when switching models. Then just use symlinks in each apps folders to point to the SSD.

severe moon
fierce schooner
#

Only problem with symlinks is comfyui kills the links when it updates. So I just wrote a batch script to redo them after every update.

severe moon
#

Yuck

#

I finally decided that ComfyUI will be my daily driver for most AI stuff so I can just symlink anything else to where it stores things. Life has been much simpler since then.

warm acorn
#

so what's the lowdown on the whole Stability rumor they might sell to Jasper? should we download and archive everything so we can still use it if/when everything is no longer open source? 🙂

#

time to get a few massive hard drives hah

severe moon
clever pendant
severe moon
severe moon
severe moon
icy valley
tepid stream
#

Whats a warthong? Susge

winged iris
#

anyone knows how to generate videos exceeds 4-5 seconds?like 8-10 seconds or more longer

copper berry
bold ocean
copper berry
#

if you find this node you should be able to loop (maybe you also need a loop node)

winged glacier
#

a lonely man walking on the beach

severe bear
tawny storm
#

Can somebody please send me the folders of the following Missing Nodes:
...

Incase if you are wondering why I am not using Manger Extgension, then just beleive me, I have tried too many attempts, but in the end, it states Faled to Establsih Remote Connection..

Apparenly, I mannualy installed RIFE Video into Custom Ndoes folder, and it worked..

So, incase any of you hguys have the above mentioned Custom Ndoes installed, please send me the link, or just put it in a dropbox

left zenith
#

Hi, new here. I've done two videos with SVD generated with online solutions, however now I'm trying to run it locally (6 GB VRAM) via my Pallaidium: https://github.com/tin2tin/Pallaidium add-on for Blender, and the inference time is insane: 20 minutes for 14 frames(with Zeroscope it was max 2-3 minutes).
My SVD videos:
https://www.youtube.com/watch?v=3dco2igWyvk
https://www.youtube.com/watch?v=aEAy24d8F6E
I would love to hear if you could share your sweet-spot settings, VRAM, and inference time with me?

GitHub

Generative AI for the Blender VSE: Text, video or image to video, image and audio in Blender Video Sequence Editor using Zeroscope (SD, XL, upscale to XL), Animov, Potat1, Stable Diffusion(1.5, 2.0...

Done in the Blender add-on Pallaidium and SVD

▶ Play video
bold ocean
left zenith
# bold ocean You videos are great ....I've heard a little bit about Palllaidium, how good is ...

As mentioned, I developed it on 6 GB of RAM. So , everything implemented works with this little RAM. I never tried ComfyUI and can't compare it. The main advantage of Pallaidium is that you're generating images, video, sound, speech, music, and text directly into the timeline of a video editing program, and you can use strips as inputs for batch operations. Ex. select all text strips and batch convert them into images, or speech, or images into videos or images into text into images... or whatever...

However, I mainly posted here to ask for people's sweet-spot SVD settings, VRAM and inference times?

fossil atlas
bold ocean
fossil atlas
#

i think it uses comfy api though so you still have to run the web server

left zenith
#

How long does ComfyUI take to render 14 frames of SVD on your amount of VRAM?

bold ocean
#

Using SVD (not SVD_XT) I'm getting 1-2minutes on my 4080 16GB

copper berry
#

with a A40 it took 110s 🙂 (XT version and 25 steps and with model loading)

#

with a CPU tho (yes i tested with a R7 5700) it took me like 45m to 1h

fossil atlas
tepid stream
left zenith
#

Uhhh... I'll need to upgrade my hardware. Out of curiosity, I'll have to see how Comfy works on my 6 GB then. Any hints on how get SVD & ComfyUI up and running quickly?

tepid stream
#

But depends on the image size you feed it obviously

left zenith
#

26 minutes!!!

tepid stream
#

I dont envy people with lower hardware man

#

Cant imagine waiting half an hour just for a camera slide

severe bear
#

honestly just take the depth map and animate the pan yourself at this point 😆

#

you're going to have 25 minutes left over too

tepid stream
left zenith
shut narwhal
tepid stream
#

Its SDV with topaz for frame gen

shut narwhal
#

Stable Video Diffusion did that animation ? Then you upscaled the result on Topaz ?

tepid stream
#

Yup

#

As well as interpolation

tawny storm
tepid stream
tawny storm
tepid stream
#

what do you mean?

tawny storm
tepid stream
#

Its not the workflow youre trying to load tho

tawny storm
tepid stream
#

Workflow is embedded

open heron
tepid stream
tawny storm
# tepid stream

Gotch.a.... many many thanks brother for the link.. its now all set!

tepid stream
tawny storm
#

Let me try my 1st Image to video

tepid stream
#

You should try to set up comfy manager tho, try to resolve the error

tawny storm
tawny storm
tepid stream
bold wave
tawny storm
#

Sorry for the noob question though.. but why there is no Preview in the Video Combine Tab?

tepid stream
#

You have to run it once for a preview to appear

#

If there is nothing to preview it wont show anything

tawny storm
tepid stream
#

Run your workflow

#

If it finishes you get a preview

tawny storm
tepid stream
#

Press queue prompt on the right

tawny storm
tepid stream
#

All good AMpepethumbsup

open heron
unkempt mica
quartz vortex
sour tulip
sour tulip
fringe condor
#

Could anyone tell me what happened? I tried to install the missing nodes I needed for SDV inside of ComfyUI and once I restarted and it tried to install everything, I am getting this fat error. 😦

#

Here's the full log

fossil atlas
#

you installed the svd custom nodepack, which is unnecessary because svd nodes are in core now. anyway that node pack tries to download svd models automatically for you, and it does so using a program called wget, which you do not have installed

fringe condor
#

So I simply should install wget?

fossil atlas
#

and it doesn't check for that possibility, so it just crashes instead

#

that may not be the actual error though

fringe condor
#

Do you think I could delete any of the recently installed custom nodes to revert it back to how it was before?

fossil atlas
#

it appears that the actual crash is something torch related

fringe condor
#

Probably a version not truly compatible with something else :c

fossil atlas
#

well you should delete ComfyUI-Stable-Video-Diffusion because it's in the core now

fringe condor
#

That got rid of the wget errors

#

Last resort momento

open heron
fringe condor
zenith depot
#

made an experiment, got the last pic of the first sequence and made a second sequence out of it and merged it

#

i wish we would have more control over it

dusk dagger
#

has anyone here prompted SVD for motion?

zenith depot
#

is currently not possible

dusk dagger
#

I have seen tweets from someone claiming it is cat_thonk are they lyin

#

will try to find them

fossil atlas
#

yeah people here have done it

#

it doesn't work very well

#

like it rarely works at all

severe bear
#

it just acts like some random noise added to the conditioning, the results are nothing like what the prompt describes

fossil atlas
#

almost always, yes

#

anyway the workflow is at that link so try it if you want

dusk dagger
#

okay, has anyone tried using a diffusion image prior, like say form kandinsky-2, to jump from text space to image space? would make sense that it's mostly just noise if it's only trained from image embeddings

fossil atlas
#

yeah, i think people tried that too, but i haven't seen any workflows

dusk dagger
#

hmmm

#

does look like the prompt is fighting the natural motion

fossil atlas
#

note that the latent output from svd conditioner is completely empty. it's just a convenience. you can try putting something not empty in to the sampler

open heron
zenith depot
#

@open heron i think it depends also on what exactly happens in the picture

balmy geyser
#

Hi,
I can't seem to find video memory requirements for stable Diffusion Video? Do you have any reference on that?

open heron
sour tulip
#

@grim tangle your parameter grid images are great. thanks!
Can you tell me some more about how prompt affects the motion? I don't understand. The prompt isn't fed into the SVD_img2vid_Conditioning node, so does the prompt actually affect the motion?

open heron
grim tangle
# sour tulip <@228118453062467585> your parameter grid images are great. thanks! Can you tell...

It's not a proper method really, it's just concatenating clip text encoding to the SVD conditioning. In my tests, in some cases very strong prompt affected the motion, mostly the word "panning" was very consistent, but other camera related words worked, if they were possible with the init image in the first place. It's not a reliable method and has adverse effects overall, just experimenting, proper official controls are coming later anyway.

sour tulip
night meadow
icy valley
quick badger
sharp herald
#

Flamingo

severe moon
glass helm
#

How long is a second of animation taking y’all to generate on home gpus? (e.g., a 3080)

severe moon
severe moon
glass helm
#

Thanks and damn! A minute well spent by the looks of it!

severe moon
#

these settings are pretty low on CFG, sampler steps and I use interpolation to smooth things out. it's pretty easy to crank up just a few values and make stuff that takes 10+ minutes to render, but obviously, you don't need to to get fun results....

glass helm
#

Makes sense - I’m trying to figure if I can just use an extra gpu to run a nightly batch job for a generative game, so lower settings are fine. sounds like a maybe

severe moon
#

hmmm, maybe. the problem with batching it is the motion you get is so random that you'd probably end up with a lot of vids that are just panning or zooming shots

severe moon
echo blade
nocturne magnetBOT
#
FAQ: I’m missing credits/having issues processing payments. What do I do?

If your account balance is incorrectly showing 0, please wait 2-3 minutes to see if the system auto-corrects. If not, please confirm with your bank to ensure the transaction completed successfully & then fill out a request using the following form on DreamStudio so the team can investigate it further: https://beta.dreamstudio.ai/support

severe moon
severe moon
latent pecan
# severe moon

reminded me of that japanese wave drawing from its structure

severe moon
grizzled spruce
#

Can I try stable video now on my personal computer?

tawny storm
grizzled spruce
tawny storm
#

or Pinokio

grizzled spruce
#

Thanks I'll try to install it.

tawny storm
# grizzled spruce Thanks I'll try to install it.

#StableVideoDiffusion Stable Video Diffusion has been Released! This easy Guide shows you how to set everything up. Run Stable Video Diffusion on your Computer today! This Image to Video Render Worflow is super easy to use and gives you stunning results.

Links from the Video

Download Workflow: https://drive.google.com/file/d/17UQXmD...

▶ Play video
grizzled spruce
#

Noice!

small heron
#

a long blonde hair girl working before the computer with McDonald hamburger on her hand

vital spindle
tepid stream
tepid stream
surreal maple
#

Hey guys, an absolute beginner here.
I want to get my hands dirty sort of speak
what is an easy way to play with these new image-to-video capabilities?

tepid stream
surreal maple
tepid stream
#

I think it runs with 8GB nowadays, I dont know about any online solutions

left zenith
severe moon
shut narwhal
#

Wooow, so dooope!

severe moon
severe moon
real flax
#

Did someone tried to add the SDXL refiner in the loop for each image?

#

it could help to stabilize the video, if there is some human faces

granite hawk
# severe moon

Hi! Could you explain to me how do you succeed creating this kind of stuff. I think that it could fit the need of some website animation. Which tool do you use?

severe moon
granite hawk
severe bear
#

somehow didn't connect that lower fps means more motion till now bobagirl

severe moon
severe moon
copper berry
#

i wonder if it's possible to use SVD to have 360° of an object and then put it into a scanner for 3d models

severe moon
copper berry
severe moon
copper berry
copper berry
severe moon
#

I think though that if the input images are very similar, like same sized subject in same location and same background, then the seed would give similar motion for both images. The more divergent the images are then the greater chance seed will just choose to do motion differently.

severe moon
copper berry
tepid stream
severe moon
copper berry
severe moon
copper berry
#

ya it's funny when it happen

bold wave
#

fantastic!! except the sword is a tiny bit wavy haha

glass helm
#

Are there any good ways to control the animation? Bucket seems to control intensity, but how control what kind of animation?

sonic path
copper berry
#

noice

keen shuttle
copper berry
icy valley
ember rune
#

any updates on metal support for mac? 😅

#

seems to be stemming from lack of Conv3D support for MPS... maybe a smart monkeh could figure it out. 🙂

open heron
#

4 different animations, 100 frames and interpolation

open heron
tidal pendant
half scaffold
soft stratus
silent hinge
#

is there a way to prevent stable video diffusion from getting so grainy

fringe condor
#

It feels like it gets stuck

sullen harbor
#

you are not using the right node

fringe condor
#

Oh that's so weird! I thought I dropped in the second example and was working with it, but I guess ComfyUI being tricky with me

nocturne magnetBOT
#
FAQ: What is Stability AI?

Our vibrant communities consist of experts, leaders and partners across the globe. They are developing cutting-edge open AI models for Image, Language, Audio, Video, 3D and Biology.. AI by the people, for the people. Learn more here stability

jolly cedar
severe moon
#

Above is with Face Restorer applied directly to video, below is applied to image first.

#

I think applying it to the video yields a slightly better quality but it has these weird glitches where it maps to the face slightly differently in a few frames.

restive berry
#

/

leaden hamlet
#

anyone know how i get the turbo scheduler to show up in comfyui? 😦

#

wow, those vids are fire

severe moon
severe moon
arctic reef
#

tried some of my figures, wasn't disappointed

severe moon
sterile mesa
# severe moon

You happen to share your workflow prior? Would love to test

severe moon
#

I usually test motions around 5 steps, and render at 12-16 but this middle guy's scales kept wriggling around so I upped to 30 for the render I posted.

sterile mesa
severe moon
fringe condor
#

I woke up and hopped on ComfyUI to try out SVD but man I can't get anything at all to work

#

Both ComfyUI example workflows doesn't work for me.

ashen tusk
fringe condor
ashen tusk
fringe condor
#

I have both checkpoints for SVD located in this directory

#

I'll test with svd

ashen tusk
#

Put them in checkpoints, that's where I have mine, and it works just fine

fringe condor
#

hmm okay

#

I still get the issue no matter what I am using it seems, or where the checkpoints are located

ashen tusk
fringe condor
#

Yeah I have an AMD GPU, but Intel CPU

ashen tusk
#

Ah, probably cuda related then.

fringe condor
prime plover
#

galaxie

open heron
onyx thicket
# open heron Raining

Looks like this is 3 vids stitched together using last frames of the previous vid. Do you have a workflow that automates this?

tidal pendant
tidal pendant
#

im trying to put in a low quality photo but its moving it like a slide show XD

tidal pendant
#

wanted to see what it would look like if i combine ebsynth with this

sterile vigil
#

Is it possible to get stable video diffusion on Automatic1111?

copper lynx
#

I tried SVD install on an older machine last week but got busy. checking back in now. anyone have the latest workflow for low VRAM GPU?

severe moon
#

This is the best video I've seen on getting specific motions so far: https://youtu.be/m-ZoxcYNWFg?si=B3IWdGjro1WJS69N

This is a comprehensive workflow tutorial on using Stable Video Diffusion in Comfy UI. Stable Video Weighted Models have officially been released by Stabality AI and support up to 25 frames per second of video generation. While it might seem that the motion generation is random, it is not. In the tutorial, I showcase 6 unique workflow examples e...

▶ Play video
copper lynx
#

what's the lowest tier GPU that people have had success in generating animations with SVD?

#

I got it to say

Sampler: EulerEDMSampler
Discretization: EDMDiscretization
Guider: LinearPredictionGuider
Sampling with EulerEDMSampler for 26 steps: 0%| | 0/26 [00:00<?, ?it/s]

3050ti in my laptop with 32gb RAM. I wonder if it will even generate anything

severe moon
#

The biggest limiting factor seems to be VRAM, I don't recall seeing anyone with less than 8GB of VRAM reporting success. Those with lower VRAM should probably be starting out with the SVD and not SVD-XT model to test their rig out.

copper lynx
#

a little movement!

Sampling with EulerEDMSampler for 26 steps: 8%|██████▋ | 2/26 [31:41<6:15:38, 939.12s/it]

severe moon
#

yikes

copper lynx
#

LOL is that just to produce one frame?

open heron
severe moon
#

maybe someone else can let you know what settings are best on lower cards but I find lowering CFG and sampler steps really decreases the generation time

copper lynx
silent hinge
#

I forgot where i installed comfy ui. Its over

open heron
# copper lynx much appreciated

Press Q and hit cancel. The 14 frames model is another model file that you need to download. You can also do smaller than 1024x576 images, that will makes things faster

severe moon
#

ouch.

silent hinge
#

Is video diffusion any better than animate diffusion?

severe moon
#

if Comfy is in the middle of a doing a node that takes forever it can be a real pain to wait for Cancel to work. in the cmd prompt terminal window you can try hitting CTRL-C to stop the ComfyUI process and then open Task Manager and find the python Process, right click and End Process

#

then you'll need to restart Comfy again

copper lynx
copper lynx
severe moon
#

SVD is the 14 frame model and XT is the 25 one

copper lynx
severe moon
open heron
severe moon
#

of course you can put whatever you want for video_frames

copper lynx