#▶|stable-video-diffusion

1 messages · Page 3 of 1

spice yarrow
fallen plinth
silent hinge
#

so when can joe average use this video stuffs?

open heron
silent hinge
#

me haha

open heron
silent hinge
#

is this just like something to dl on automatics?

open heron
silent hinge
#

interesting. thanks. ill have to look into comfy. quite frankly i thought it was horrible design

#

first time i tried it and got rid of it

#

im on a 3080 too so i guess i can run it

open heron
silent hinge
#

interesting

#

how long can the videos be? 5 second or so?

open heron
#

It's looks like that

open heron
silent hinge
#

how long until we can prompt a movie haha

#

5 years? 🤔

open heron
tender nova
silent hinge
#

what format are the files?

open heron
#

New version, look those lights/reflex moving. This thing is really cool

trim juniper
#

Anyone getting this silly error "IndexError: list index out of range"

open heron
trim juniper
glass marten
dull thunder
boreal valve
#

👀

boreal valve
#

im guessing its because svd is strictly for research and not commercial apps rn?

dull thunder
#

yes

#

his website is charging for generation using the model

quick finch
#

I'm still having issues with eyes.

trim juniper
boreal valve
#

read what he typed and also what his bio states lol

dull thunder
open heron
robust scarab
#

what gpu?

#

how long do these renders take?

teal harbor
teal harbor
robust scarab
#

im a bit outta the loop whats the min req on svd?

#

vram wise

teal harbor
#

I think I've seen people generate stuff with 10 GB VRAM

robust scarab
#

thanks ill look into it

teal harbor
glass sierra
#

I'm going OOM with 16GB 3080 and decoding_t=1, what am I doing wrong?

open heron
dull thunder
open heron
glass sierra
#

Went OOM on an L40 w/ 48Gb of ram doing the same thing... I'm definitely doing something wrong.

#

clone, make a venv, pip install, python -m scripts.sampling.simple_video_sample

open heron
glass sierra
#

and download the models, before all that

#

I never got into comfy, UIs are cancer

#

I'll look at it though, since it works!

quick finch
open heron
grim tangle
glass sierra
#

I actually just got it working on a 48Gb L40, decoding_t=4 seems to take up ~20Gb so I don't get why it's going OOM locally w/ 16Gb... 😦

teal harbor
#

activate the lowvram_mode

glass sierra
#

i'm trying not to use streamlit

trim juniper
grim tangle
grim tangle
glass sierra
#

ah, I'll just fix it then... I figured it was something like that...
I also figured they'd have tried to make the code runnable on 16Gb in their sample

#

I guess there's enough momentum they can just release whatever and everyone else will fix it 🙂

grim tangle
#

bit hasty release in that sense sure, but it's just research release

#

however the implementation in comfyUI is already excellent now

glass sierra
#

cool, I'll look at their code then, thanks!

icy valley
#

just tried it for the first time, really cool!

sand plinth
#

@grim tangle You have already generated many grids, what do you think are the best default parameters for SVD?

grim tangle
#

can always increase steps if you get something you like

#

motion bucket/aug seems to depend on the input/seed, motion amount is consistently controllable with those in most cases

#

so many things to take into account though, much to learn still!

bold wave
#

I've been playing with this all day today.

  • rife with a 4x multiplier helps, otherwise, it looks pretty jittery
  • on a 12 GB card, I can do seemingly as many frames as I want.
  • CFG 6.0 seems to yield more 'active' results. I can't help but feel that results vary though haha.
  • bucket set to 50 has the best results across the board that I have seen so far
  • I moved denoise to .75 instead of 1.0 and it seems to be helping. could be voodoo magic though.
icy valley
#

This is all I seem to be able to do so far

icy valley
#

I have yet to try img2vid tho

hazy gorge
#

Which graphics card would be better to buy for SDV a gtx 1080ti 11gb or Rtx 3070 8gb?

bold wave
# trim juniper will try this now

here's my setup so far. I have had a harder time managing the SVT_XT for some reason, so I just do SVT. I use facial restore as well, sometimes the facial changes look a little janky, so that normalizes it.

severe moon
trim juniper
#

I will add the face restorer to it didnt even think of it

#

ty

harsh storm
severe moon
#

Same input as before but I upped frames to 48 and changed denoise in KSampler to .66 - behold wiggly jello planet

severe moon
sterile mesa
bold wave
#

running in cli may or may not have better results though catlook

sterile mesa
severe moon
#

I got it running in CLI originally but have had best results with ComfyUI so far on the latest portable build. It has a good set of Torch/CUDA built in that you may not already have setup if you run from CLI. So I recommend anyone trying this to start with ComfyUI....

bold wave
#

its certainly easier and more customizable in comfy as well. and - you can throw in interpolation, which looks ten times better.

severe moon
#

yeah it's pretty janky with low fps if you want to render in reasonable time

sullen harbor
#

1000 series nvidia is slow so you should avoid it even if it has more vram

#

3000 series is going to be much faster

trim juniper
#

so far generations are taking 45 seconds to 1 minute

#

with 25 frames

bold wave
#

thats amazing. how many samples are you using? mine take a couple minutes at 25 samples

trim juniper
#

samples as in steps?

#

30

icy valley
#

Pretty insane how easily it can do this with txt2vid tbh

severe moon
icy valley
#

Eh I've already seen 25 people doing that

bold wave
#

trying photographs now. cant share the results though haha.

next im going to try scenery shots. should have good results.

icy valley
#

the default values in this workflow made a scenery shot

#

no idea why this pans right but when I changed the content it starts to zoom in

trim juniper
icy valley
#

Okay here we go

#

Starting to get more interesting

#

It's like it masks and everything for you somehow

severe moon
#

i think it depends on the colors of the input image, stuff with colors that imply depth tend to get more of that layered look and zooming motion from my experiments

silent hinge
#

what are prompts like for this stuffs?

#

and so you have to use it like image 2 image??

icy valley
#

I'm making these txt2vid

trim juniper
icy valley
#

Yup

quick finch
icy valley
#

I'm not even using control nets or anything

#

It's able to do this pure txt2img generation with just my LoRA, it's actually pretty sweet

quick finch
icy valley
# quick finch great, I don't have a Lora for that, so if I need a logo, or Text on an Image I ...

Yeh qrmonster is great, but that's for more placing text on the canvas and then generating over it. With my model, you generate the entire image from scratch without anything else, no control nets, img2img, text from fonts, etc https://civitai.com/models/176555/harrlogos-xl-finally-custom-text-generation-in-sd

Introducing Harrlogos v2.0! Solving Stable Diffusion text generation one LoRA at a time! THIS DOES NOT REQUIRE CONTROL NET TO WORK Harrlogos Create...

quick finch
icy valley
#

Awesome! I'd love to see what you create.

#

OlivioTutorials actually just did a video on it tonight, with a pretty thorough walkthrough for using it

quick finch
icy valley
#

If you take a peek at the gallery on the bottom of the model page, you will see that people have come up with some insane stuff already

severe moon
latent pecan
#

oh wow, could script an entire story with this

icy valley
icy valley
severe moon
icy valley
#

Rainbow + pixel art = HarFlames

#

Hmm I wonder if the graffiti style would have it parallax from the wall 🤔

latent pecan
sand plinth
icy valley
latent pecan
icy valley
severe moon
icy valley
#

I dunno what causes that waviness tbh but when I got rid of it, all I get now is zooming straight in and out

severe moon
#

it's just sort of random, the training vids only had a few types of movement so changing the seed for the sampler after the SVD node has the most effect on type of camera movement

icy valley
#

Are you guys doing txt2vid?

latent pecan
#

testing out 404 now

icy valley
#

Although I guess it's the same, you're just making the image to send it in the same workflow

latent pecan
#

well txt to image and image to SV

severe moon
icy valley
#

that looks awesome 😂

#

YEP AWESOME

latent pecan
icy valley
#

Welcome to the zoomgang

severe moon
#

the seed before this for the one above was just a straight zoom, same img input....

icy valley
#

although I guess I've defected to GlitchGang

open heron
latent pecan
icy valley
latent pecan
#

lora_panrightv2 mixed with your text lora might do it, but probably a bit more technical than that 😄

icy valley
#

Just Harrlogos and the 404ra so far. But Harrlogos is unbelieavably versatile.

#

Oh are you guys using AD motion loras with it?

latent pecan
icy valley
#

Cuz like if you see the first animation I did

#

the motion is completely different

#

And ever since adding in my LoRA I havent been able to achieve anything like this again

latent pecan
#

tried to make the text 555

icy valley
#

hahahahaha yeah the 404ra is trained to do only that, although a buddy of my made it to 1337 before

icy valley
noble wolf
#

can this run with 8gb?

icy valley
#

Supposedly

latent pecan
icy valley
#

then yes

noble wolf
#

Ayoooo lets go

trim juniper
noble wolf
#

I need to learn comfy, yall know a good guide?

icy valley
#

Yes, I'm using my Harrlogos LoRA to do the text

severe moon
#

i have made so many that don't actually say CATGIRLS that I gave up, have a CATRIS instead.... lol

latent pecan
#

maybe do individual letters 😄

noble wolf
#

Is there something like a controlnet for video yet?

silent hinge
#

lol it just came out dude

sand plinth
icy valley
noble wolf
#

I'm not expecting there to be anything like contrlnet, I'm just saying it doesn't hurt to check with how fast this shit is going

icy valley
#

I can tell in the first 2 seconds if the spelling missed so I can try again

severe moon
icy valley
#

Hm I dunno, rarely takes me more than a couple tries, maybe it's the specific text, or your resolution

latent pecan
icy valley
#

LOL

#

Science Fiton

#

My favorite genre

latent pecan
#

yea, futuristic pants you fit on 😄

severe moon
sand plinth
latent pecan
icy valley
#

Looks awesome

latent pecan
icy valley
latent pecan
#

seed 425324399642679

#

try that seed out 😄

icy valley
#

Ahah that little ear wiggle

noble wolf
#

yo is there a way to increase font size in comfy ui?

severe moon
noble wolf
#

sadge

#

I dont wanna zoom in when I'm messing with my prompt, I mean i don't need to but it's just nice to look at

severe moon
noble wolf
#

Yooooo that's dope

icy valley
severe moon
#

that magic seed man: 425324399642679

icy valley
#

Thats probably the coolest shit I've seen yet

severe moon
#

i like how it just randomly flipped the G for no reason

icy valley
severe moon
icy valley
severe moon
#

it jsut does it

severe moon
#

they're really nothing special tho, just seems random how the motion will turn out

icy valley
#

It looks awesome! Thank you for sharing I'm gonna try it when I'm back at my pc

severe moon
icy valley
#

That's actually sick because chrome is an activation word for my model I trained to do just that

noble wolf
#

magic man

severe moon
#

just like it always seems to like animating anything cloud or smoke like in an image:

icy valley
#

I bet space stuff will work pretty well

severe moon
icy valley
#

That's awesome, what model are you using?

severe moon
#

Midjourney, that's just straight img2vid

silent hinge
#

Is there a way to download stable video diffusion and run it locally?

robust scarab
#

yes that what people are doing here you need to use comfyui though

silent hinge
#

Thanks. Will it work on my 6gb vram

severe moon
robust scarab
#

theres a auto version?

robust scarab
silent hinge
#

Alr thx

severe moon
#

hmmm. i've seen people getting it to work with 8GB VRAM, no one has mentioned 6.... try it for science and report back to us all!

silent hinge
#

Ok. Btw are there any tutorials for stable video diffusion?

severe moon
severe moon
silent hinge
uncut axle
#

A classic banger brought back

severe moon
# silent hinge Wdym

like how to install ComfyUI or the streamlit version and set up a workflow that has SVD in it.... there's not a lot about how to get good results yet

silent hinge
#

I got ComfyUI today I just want to be able to get SVD running lol

severe moon
#

there might be some for running a Google Collab version but i haven;t checked any of those out

latent pecan
#

comfyui standalone _=-> comfyui manager -> grab someones image workflow with SV -> install missing custom nodes -> restart comfyui 😄

noble wolf
#

wait where do I download img2vid?

sullen harbor
severe moon
#

yeah it's all built into the latest ComfyUI, thanks @sullen harbor !

robust scarab
#

is it possible to add text conditioning?

silent hinge
#

Can you use custom 1.5 models for stable video diffusion?

severe moon
#

so it's img2vid workflow..... how you get the img is up to you

sullen harbor
#

it's possible to pass these models text conditioning but it might not work very well in practice

severe moon
#

you can easily attach your favorite 1.5 workflow to make the initial image....

silent hinge
#

Alr thx.

sullen harbor
#

you can use the conditioning concat node with "conditioning_from" as the output of a clip text encode connected to the SD2.x CLIP model

latent pecan
#

@next acorn this is their workflow they shared on deforum discord

robust scarab
#

what does augmentation level do?

icy valley
river mauve
#

SDXL > SVD > TOPAZ

icy valley
#

Damn that quality is wild

severe moon
icy valley
#

That's goddang double cheating

trim juniper
severe moon
trim juniper
#

lol

noble wolf
#

anyone get this error for img2vid?

robust scarab
#

did you download the svd model?

river mauve
robust scarab
icy valley
sand plinth
tender nova
#

So is there a way in comfyui to make a node chain that goes from txt2img straight into SVD img2vid in one generation?

trim juniper
#

Nvm found it ty

#

Ayway to get the js

sharp kestrel
noble wolf
#

What is freeu?

severe moon
noble wolf
#

and where do i disable it

devout flame
#

hi, regaring svd and full-body videos, i wonder if theres a way to fix the awful faces in them?

#

is it possible atm?

devout flame
trim juniper
#

ive been using it

devout flame
trim juniper
icy valley
#

I have a simple txt2img workflow if anyone needs

devout flame
#

saying i dont have a module called "segment anything

icy valley
#

Did you disable FreeU

devout flame
#

how do i do that?

noble wolf
#

yes please xD

#

how do you do that?

#

I was asking like 5 mins ago

devout flame
noble wolf
#

I found like one reddit post but don't think its relevant to this

#

but maybe idk

devout flame
#

yeah im so confused

noble wolf
#

are you getting the same error I posted earlier?

devout flame
noble wolf
#

hmm

fallen plinth
trim juniper
#

im getting cursed images when doing t2img to img2vid

severe moon
severe moon
devout flame
devout flame
trim juniper
#

maybe the lora is the thing messing things up

devout flame
#

it happens when i add the custom nodes to the folder, i start it up and it gives me an error saying i dont have segmentanything

noble wolf
#

I just loaded up the default workflow

#

and its not working

severe moon
trim juniper
#

not much

#

might have to add an upscaler

#

im just using common sense idk what im doing

#

lol

severe moon
trim juniper
#

1.5

severe moon
#

yeah maybe a double ksampler upscale type workflow to add details

trim juniper
#

let me check

latent pecan
#

this is iskariots workflow , rename this to .json, you may want to change the default resolution to 1024x576 to run on 8gb card

#

the lora node thing required me to install custom nodes, thou you can disable it, its not needed

#

put this in your SD models folder

severe moon
latent pecan
#

damn this didnt animate at all :\

noble wolf
#

master epic E

sharp kestrel
arctic reef
#

Could you please share your comfyui workflow? These results are fantastic!

icy valley
sharp kestrel
latent pecan
latent pecan
sharp kestrel
sharp kestrel
devout flame
#

ok my next issue, anyone got a workflow json for facedetailer and svd?

latent pecan
#

if i had a clean workflow id export but i have a whole bunch of purple nodes to ignore for now

robust scarab
#

what does the fps setting do? does it increase the context?

latent pecan
#

installing facedetailer breaks my opencv, i made a custom .bat file for any nodes that break opencv (a lot of nodes seem to! ) .\python_embeded\python -s -m pip uninstall -y opencv-python opencv-contrib-python opencv-python-headless
.\python_embeded\python -s -m pip install opencv-python==4.7.0.72

trim juniper
#

idk txt2img to img2vid not worth that much

severe moon
latent pecan
warm acorn
sharp kestrel
latent pecan
sharp kestrel
#

just like you download a lora, there's a small post button on them... it helps creator stay creative 🙂

#

the add post button

latent pecan
#

ok done, thou it posted as image not gif

latent pecan
#

(with face restore)

#

is it worse :D?

#

face restore probably better for mid shots / sd1.5 models

severe moon
latent pecan
noble wolf
#

Yooo I finally got it working lets gooo

#

Now how to download them as mp4...

latent pecan
#

it puts them all by default n comfyui/output

#

you just change that last node from gif/mp4

noble wolf
#

all I see are those

autumn spindle
noble wolf
#

Ty brother ❤️

latent pecan
sterile mesa
#

fuck i've been sleeping on XL and comfy. Just installed that shit and I'm jawdropping on the ease of customization

noble wolf
#

yeah it's awesome, especially with comfy, just downloaded it a few hours ago

#

so poppin

#

I'm trying to figure out where to place those video format custom nodes xD

autumn spindle
#

Do you have ComfyUI-Manager installed?

noble wolf
#

uh nah?

#

idk

autumn spindle
noble wolf
#

I just installed it and a few other things

autumn spindle
#

It'll help you get the custom nodes

noble wolf
#

ty ty

autumn spindle
#

Especially the Install Missing Custom Nodes command

sharp kestrel
noble wolf
#

gawdam it's a holy grail

#

I'm evolving

sharp kestrel
noble wolf
#

and it's so efficient I can do a bunch of shit while generating videos on an 8gb card

#

lets feckin go

latent pecan
sharp kestrel
#

i asked for 100 frames

latent pecan
#

oh 😄

#

wonder if that works on my 8gb card

sharp kestrel
latent pecan
sharp kestrel
#

not everything turns out good right away

noble wolf
#

nightmares are my fav yo

#

deep frying with stable diffusion, truly revolutionary

sharp kestrel
sterile mesa
#

it is 1am and im just installing nodes and workflows. need to sleep

noble wolf
#

literally same

#

got my grubby hands on the manager and am downloading everything

#

Although I wish there was something like tabs you could save as a system instead of workflows, but I guess I'll get used to it

latent pecan
#

i find it hard to get a good naming convention 😄 so i just put every main node in the file description 😄

#

its pretty awful for qol

silent hinge
#

yo how are u guys generating with this sd video

#

do u guys have 40 gigs of vram bro

noble wolf
#

comfy ui is super efficient for me at least, using 8gb vram

silent hinge
#

damn so if I have comfy ui I can get svd

noble wolf
#

Yesss

#

come join us brother

#

its so ez

#

Today we cover the basics on how to use ComfyUI to create AI Art using stable diffusion models. This node based editor is an ideal workflow tool to leave how AI art is generated, but also how you can really mess with the internal elements much more than you can with any other AI Art interface out there today. #comfyUI #stablediffusion

Install ...

▶ Play video
silent hinge
#

nvm it isnt

#

but is there a tutorial to how to get svd

noble wolf
#

I think someone said it was the unofficial official guide

#

oh for svd its ez

#

get this and use the manager to auto install svd on comfyui

latent pecan
grim tangle
#

SVD comes built in comfy now, the custom nodes should not be used

silent hinge
#

anyways I only have 6gb of vram but soon im gonna buy new pc

latent pecan
grim tangle
latent pecan
sharp kestrel
#

so is anyone keeping track on the seeds and directions or is that a myth?

arctic reef
#

saving raw video generation, and a second one after postprocessing (face swap, upscale)

#

mostly gets the job done

latent pecan
latent pecan
narrow canyon
#

Is there a node that will save the generated frames out as individual images?

#

I just found ComfyUI-VideoHelperSuite, been hunting down that VHS_VideoCombine node I've seen in some workflows

sterile mesa
narrow canyon
#

Just trying to figure out how to install it

sterile mesa
#

Thanks a ton. I've got a long day of comfy ui setup tmw. Was testing it tonight and very impressed

dull thunder
robust scarab
#

what does the fps setting do if the final frame count is always the set amount?

latent pecan
clever pendant
robust scarab
#

does fps increase vram usage?

grim tangle
narrow canyon
#

It's my first day using Comfy

grim tangle
narrow canyon
sharp kestrel
#

which one of these 2 guys use?

robust scarab
#

model

grim tangle
#

who named those, what

narrow canyon
#

Is the decoder model needed for the example workflow?

grim tangle
#

it's not needed for anything, it's just named awkwardly

narrow canyon
#

Oh good, I can save 9gb then

sterile mesa
sand plinth
grim tangle
#

I'm not an expert on that though

arctic reef
river mauve
woven fractal
#

Howdo I get stable video diffusion?

grim tangle
woven fractal
#

it doen't seem to have any interface like A1111

woven fractal
#

thanks

grim tangle
woven fractal
#

So I updated it

#

and I put th ejosn in

#

I keep on getting an error

#

Error occurred when executing SVD_img2vid_Conditioning:

'NoneType' object has no attribute 'encode_image'

#

that's the error

grim tangle
#

did you change the image?

sand plinth
sharp kestrel
# grim tangle

cfg scaled 2-3 means that you start with 2 and up to 3?

grim tangle
sharp kestrel
#

so you start at 2 here?

#

ending in 3 here

#

?

#

is 1 to 2 not enough?

grim tangle
#

Depends on the video, sometimes it benefits from more, but it can also burn it

#

Still testing different values, these aren't meant to be optimal or anything

icy valley
#

they seem to be on a video case basis as well

grim tangle
#

Slow video burns if the end cfg is high while faster can benefit from increased detail against the blurriness that occurs

wicked wren
grim tangle
wicked wren
grim tangle
#

not too user friendly, just made it for myself to test stuff

wicked wren
#

So cfg is something different in video (comfy UI) from cfg when doing stills in auto1111? Correct?

grim tangle
wicked wren
grim tangle
arctic reef
hazy gorge
#

So far, which one do you think is better, Stable Video diffusion or Pikalabs?

sonic stirrup
#

is it p[ossible to run svd on an amd gpu?????

wicked wren
somber oar
#

The subtle movement of hair and clothing

clever pendant
frigid grail
#

Ah, so ComfyUI SVD not working for Max silicon - right? (I get the conv3d is not supported on MPS error)

arctic reef
solemn turtle
# sullen harbor For the examples on here you only need comfyui itself, no need for any custom no...

Hi, im getting this error while running on the Mac. 'NoneType' object has no attribute 'tokenize'

File "/Users/../ComfyUI/execution.py", line 153, in recursive_execute
output_data, output_ui = get_output_data(obj, input_data_all)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/
../ComfyUI/execution.py", line 83, in get_output_data
return_values = map_node_over_list(obj, input_data_all, obj.FUNCTION, allow_interrupt=True)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/ComfyUI/execution.py", line 76, in map_node_over_list
results.append(getattr(obj, func)(**slice_dict(input_data_all, i)))
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/*../ComfyUI/nodes.py", line 55, in encode
tokens = clip.tokenize(text)
^^^^^^^^^^^^^

granite tartan
#

what's the best way to upscale this and improve the quality of the faces?

arctic reef
haughty garden
arctic reef
granite tartan
flint orchid
arctic reef
#

before and after postprocess (my workflow outputs both versions)

solemn turtle
#

Yo did anyone run stable video with comfy on a mac?

half hound
half hound
#

@grim tangle Do you know what nodes you can use to export each frame of the video?

grim tangle
#

it will save every frame in the batch as .png

half hound
#

thanks!

bold prawn
silent hinge
#

so what do you need for this? comfy and a certain model?

silent hinge
#

how do you prompt? like do you prompt the camera movement?

arctic reef
silent hinge
#

so just prompt like an image?

arctic reef
#

no, you put image and it is a starting point

silent hinge
#

interesting. start image and end image?

arctic reef
#

just the start image (i'm also using face restore image for face correction)

silent hinge
#

looks like algebra

#

more like confusingai haha

#

really, that couldnt be any less 'comfy'

severe moon
frigid grail
#

What are you all using for frame interpolation?

arctic reef
# silent hinge more like confusingai haha

Not really. It is so flexible, that I can have face reconstructed, upscaled, added interpolation frames and written as 2 separate videos. All with a single click.
This is far from the most simple workflow.

severe moon
silent hinge
#

nice. im dl'd it now so i'll be asking some questions im sure haha

#

what model do i need?

silent hinge
#

noice. cheers for the link

half hound
#

Is there a node that can key out green from the image?

silent hinge
#

do i need all the files?

half hound
#

can you layer videos on top of each other in comfyUI?

severe moon
# silent hinge noice. cheers for the link

If you install ComfyUI then also highly recommend to install comfyui manager, then you can just search for models directly inside comfy, there should be install videos earlier in this chat that cover getting SVD running in comfy....

flat crystal
silent hinge
#

any ideas?

#

was trying to run_nvidia_gpu from comfy to extract

#

guess not haha

severe moon
severe moon
silent hinge
#

id id then i got that error i linked

severe moon
#

did you extract the entire .7z file into a new location before trying to run it?

silent hinge
#

maybe i screwed that up

half hound
#

Maybe that can be used

severe moon
silent hinge
#

oh there it finally worked, i did change the directory first time to windows idk

half hound
silent hinge
#

ok so now i have comfyui up what is my next step

#

should i run the update thing first?

half hound
#

is it a fresh install? Than no. Load in the json workflow

silent hinge
#

how do i do that

#

yes fresh install

half hound
silent hinge
#

and img2video model just goes in checkpoints right

half hound
#

download this jsons by right clicking and save link as

#

then open in comfyui

#

one of them

#

you can also take a look at my tut and follow along https://www.youtube.com/watch?v=hoIobzZmNiM

An easier way to generate videos using stable video diffusion models.

Stable Video Diffusion ComfyUI install:

Requirements:

ComfyUI: https://github.com/comfyanonymous/ComfyUI#installing

ComfyUI-Manager: https://github.com/ltdrdata/ComfyUI-Manager

SDXL: https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/tree/main
you can use any...

▶ Play video
#

just skip to the end

silent hinge
#

where do i put it when i save it

half hound
#

wherever you want. I put mine in downloads

silent hinge
#

and put them all in checkpoint?

half hound
#

just this one svd.safetensors

silent hinge
#

nice

arctic reef
silent hinge
#

they go in checkpoints like reg models right?

half hound
#

yes

severe moon
#

yeah XT was trained on longer videos, once you get it all running and know your rig can handle SVD you'll probably use SVDXT the most....

silent hinge
#

ive got a 3080 with 12 gig hopefully i can do something and not melt it haha

arctic reef
teal harbor
half hound
#

yeah that would be great

severe moon
silent hinge
#

using the example, so you just load an image in and hit queue prompt? nothing more to it, not actual prompting or other guidance?

half hound
#

why do you need to convert to gif? Can't you just use the video combine node and then select gif in comfy

silent hinge
#

so roll of the dice it looks good or get something good haha

teal harbor
silent hinge
#

and i take it this json i dl'd is optimized no need to fiddle with steps, cfg etc?

half hound
severe moon
silent hinge
teal harbor
severe moon
#

nice

silent hinge
#

from image to video already, even if just a few seconds

#

crazy times haha

half hound
severe moon
#

here are my current value settings:

half hound
#

these 2 settings

silent hinge
#

nice. appreciate the example dragon

half hound
#

video_frames max for SDXT is 25

severe moon
#

these models were both trained on 1024x576 pixel videos so you don;t want to stray too far from those dimensions

half hound
#

I got some decent outputs using 1024x1024, but I haven't done that much testing on it and the renders took a lot longer.

severe moon
#

yeah sqaure ouput with sq inpiut seemed to do well. i tried vertical tiktok sized videos and things got weird

#

when i did sq i did it around 768x768

half hound
#

how did it look at that size?

severe moon
#

you want to multiply the numbers and get close to 1.5M

silent hinge
#

what setting sets the length?

#

video_frames id guess?

#

and what is motion bucket for/do?

teal harbor
#

video_frames * fps = length in seconds
motion_bucket_id = amount of movement (higher number means more movement)

severe moon
half hound
#

man I wish I tested out segment anything more. I am going to have to watch some tutorials on it

#

is there an SDXL segment anything model out?

severe moon
#

the parameters are covered on the official ComfyUI Video Examples page but here it is again for our discussion:
`Some explanations for the parameters:
video_frames: The number of video frames to generate.

motion_bucket_id: The higher the number the more motion will be in the video.

fps: The higher the fps the less choppy the video will be.

augmentation level: The amount of noise added to the init image, the higher it is the less the video will look like the init image. Increase it for more motion.`

teal harbor
severe moon
#

in my workflow I output at 8fps from the SVD node and then use interpolation to smooth the framerate back to 24fps:

bold wave
severe moon
harsh storm
bold wave
#

ahhh, that makes sense. Thanks!

harsh storm
#

The node for RIFE is RIFE VFi

severe moon
harsh storm
bold wave
#

OK so - for more realistic shots, upping the resolution to 1024 really helps. however, it gets super janky with artistic images.

also using upscale moved a 24MB file into a 700MB file thomas proooobably not viable lol.

severe moon
teal harbor
#

are you upscaling it to 16K or what?

half hound
bold wave
severe moon
#

oh i see. i've seen other methods where you take the first ksampler output and then run it through a basic pixel multiplier and controlnet to increase dimensions and then another ksampler with lower denoise value to get more detail. that was for animatediff workflows. i'll probably try that on the SVD workflow today

bold wave
#

oh yeah, ive done that before too, maybe i should give that a shot. this one is where A1111 feels like it has superiority, you just kinda run it.

at that, I could probably apply a little bit of a prompt too.

severe moon
#

yeah

sharp kestrel
#

not sure why, but all my recent ones were super shakey or strange

severe moon
sharp kestrel
#

using this model

severe moon
sharp kestrel
#

yea

#

now running one with the non compressed

severe moon
#

are you changing the seed randomly or using same seed for all of them?

sharp kestrel
#

randomly

severe moon
#

some seeds are just shaky

teal harbor
severe moon
#

what's your motion bucket set at?

thorny niche
sharp kestrel
sinful vine
thorny niche
#

SVD is nice but soooo slow

sharp kestrel
teal harbor
sharp kestrel
teal harbor
thorny niche
sharp kestrel
#

i got good 80 frames earlier

#

will try with normal euler

teal harbor
sinful vine
#

@sharp kestrel originaly yes but some of that was run through SD afterwards and then into SDV

sharp kestrel
#

just making sure i'm still on edge 😛

sinful vine
#

No i didnt take it that way 🙂

#

Thanks

sinful vine
sharp kestrel
severe moon
teal harbor
# severe moon

I'm looking forward to the day when we get AI that will make the gears and what not on her head move

sharp kestrel
#

i was trying so hard to get some hears with animated diff last night.

sand plinth
sharp kestrel
#

you get the blinking from SVD?

sand plinth
sharp kestrel
#

nice

sharp kestrel
bold wave
#

I feel like at 512x512 there's a bit of loss, though. Trying 768 and hoping for a happy medium haha

sand plinth
sharp kestrel
#

and it works?

sand plinth
sharp kestrel
#

lol

silent hinge
#

lol now i know why nobody responds to my statements... i was in the openai dalle discord talking about this haha

#

but damn this is wild, like moving lips, hands, not just pan and scans on some images

severe moon
bold wave
#

I did my son's school photo and his hair and fingers were moving, it was pretty cool.

silent hinge
#

yeah some of the motions surprise me, i love it. is there a default max length?

#

i wonder how far away we are from even 1 minute of video

bold wave
silent hinge
#

i'll give 64 a shot then

#

I was trying to generate a video but I got "ComfyUI_windows_portable>pause
Press any key to continue . . ." and when i press a key it stops comfyui what did i do wrong

#

this happens running both cpu and gpu

bold wave
severe moon
bold wave
# severe moon

dang dude. that's amazing. its crazy how a few seconds of video can make it so much more immersing.

severe moon
silent hinge
#

like what i'm new to comfyui

bold wave
severe moon
silent hinge
#

yeah dragon that is awesome.

#

did you use dalle for the image?

severe moon
#

Midjourney

silent hinge
#

noice

#

what is motion bucket id?

silent hinge
bold wave
# severe moon Midjourney

Mind sharing your workflow? I just want to compare notes. I feel like I am close, but yours feels just awesome.

silent hinge
#

give us the secret sauce dragon haha

severe moon
#

ummm... be good at midjourney... lol

silent hinge
#

im bing/dalle until 6.0 haha

bold wave
#

768 is significantly better, it seems. here's a side by side. only thing is though, lower res seems to move around more.

#

so maybe 512 with upscale would be the trick.

silent hinge
#

where do you upscale?

severe moon
#

i mean actually though midjourney does all the heavy lifting, I didn't even spell steampunk right:

silent hinge
#

what about your comfy settings though is what i think we meant

silent hinge
#

33 for motion bucket just a random you chose or reccommended?

severe moon
silent hinge
#

this shiz is fun as hell though

severe moon
#

just having a good source image is half the battle

#

like i was experimenting with text overlays in photoshop so this without text seems fine

silent hinge
#

nice

bold wave
# silent hinge Here's my workflow

I dont see anything that stands out. 14 FPS might be a little high - you can interpolate that. Also, I don't like WEBP at all, if you do video combine, you can save as MP4 or GIF. I do gif for now because it is the best for sharing online.

silent hinge
#

alr thx

bold wave
#

if you can screenshot the error - if you have one - it might help.

silent hinge
#

where do you change the file format?

bold wave
#

use that one instead of the webp one

silent hinge
#

i must be blind i dont see it

#

but im new to comfy

severe moon
bold wave
#

oh. do you have a setup for custom nodes?

#

load mine in... that should ahve the metadata in it, if discord doesnt delete it

silent hinge
#

im using a json i downloaded from somewhere, its been so much going on im not sure haha

severe moon
silent hinge
bold wave
#

if the metadata is there, itll load. if not, i can share the json output instead

sharp kestrel
silent hinge
#

yeah im not sure it changed if you could share the json thatd be tight

#

im sure im just doing something wrong

#

when i press a key to continue it just terminates comfyui

#

and hmm, should i have a VAE or not needed?

bold wave
sharp kestrel
silent hinge
#

awesome thanks raydestar

bold wave
silent hinge
#

looks like i need some files maybe

#

When loading the graph, the following node types were not found:
RIFE VFI
FaceRestoreCFWithModel
FaceRestoreModelLoader
VHS_VideoCombine

severe moon
silent hinge
#

not sure, i just dl'd comfy an hour or so ago so whatever is standard id have i guess

severe moon
bold wave
#

once you get the manager

severe moon
#

after Manager is installed when you load someone else's workflow it'll help you find all the Missing Custom Nodes and models

bold wave
#

and run updates. lol everything updates all the time, it's a result of being on bleeding edge tech

silent hinge
bold wave
silent hinge
#

after updating should i restart comfyui

severe moon
silent hinge
#

yesterday evening

#

thanks for all this help btw everyone

#

oh yea thx too

#

woot and the manager appeared

bold wave
severe moon
silent hinge
#

yes

#

cpu is unreasonably slow

#

hold on it might bee working

#

80 seconds/it lol

severe moon
#

I had problems with my older ComfyUI running the new update after they added SVD so I grabbed a newer build:

silent hinge
#

alr

#

I'll report back in a few mins when it either finishes or errors out

barren hound
#

Has there been made any locally ran video diffusion yet? thinky

severe moon
#

that one has newer torch/cuda built in which is needed to do run the SVD model

bold wave
#

OK this one is using dragons setup. At first I tried at 512x512, and booyyyyy it did not like that. Also, I was using SVD before

silent hinge
#

what model is that instead?

bold wave
#

SVD_XT

silent hinge
#

whats the image decoder version of the model for?

bold wave
half hound
#

I think I am close to figuring out being able to create layers in the SVD anyone have any insight in this error? @grim tangle @severe moon

severe moon
silent hinge
#

btw how can i speed up loading the SVD_img2vid model while keeping it in low vram mode?

bold wave
#

there's that fp16 version, you might try that. I dont know if it offers quicker load speeds, but it could.

once it's cached though, you don't need to wait so long.

silent hinge
#

where does it cache

bold wave
#

lol this turned out just awesome.

silent hinge
#

lol I love it

bold wave
severe moon
silent hinge
#

Also were do i download fp16 version

severe moon
#

hmm someone linked to it earlier

bold wave
severe moon
bold wave
#

theyre safetensors though, which is good

silent hinge
#

alr

wicked wren
bold wave
wicked wren
#

When this thing get easier UIs and better control of motion / expressions etc it will be revolutionary for sure

silent hinge
#

Just wondering, whats the lowest vram someone has been able to run SVD on so far?

severe moon
wicked wren
#

One thing that would be cool is embedded alpha channels to be able to use layers in a VFX environment

surreal haven
#

While I was running this piece of code, I encountered an error. Can you tell me what the issue is? How should I go about fixing it?

silent hinge
#

what does this mean? (You shouldn't move a model when it is dispatched on multiple devices.)

sharp kestrel
silent hinge
#

HOLY CRAP IT WORKED ON 6GB VRAM

silent hinge
#

I was dumb and didn't set the resolution right so it is cut off but it worked

surreal haven
silent hinge
#

only took 30 mins lol

sharp kestrel
half hound
#

I did it boys I figured out how to layer videos in ComfyUI. @severe moon

#

needs to do some cleanup

#

but the nodes are working

silent hinge
#

you mean 2 'videos' together? noice

half hound
#

yep

silent hinge
#

sweet

silent hinge
sharp kestrel
half hound
#

yeah something like that

surreal haven
barren hound
#

Shit, managed to reply to the wrong. Sorry bout that

half hound
#

I am going to try an example with 3 layers and cleanup the nodes then I'll share it

severe moon
sharp kestrel
silent hinge
#

If SVD can work on 6gb vram, could it theoretically, just very slowly, work on 4gb or even 2gb? I have some cards like that I could try if I wanna suffer lol.

bold wave
wicked wren
silent hinge
#

My cpu is much slower than my 1660 ti

barren hound
half hound
wicked wren
severe moon
silent hinge
#

nice

#

hae you guys played with samplers or is euler the only one video works on?

severe moon
severe moon
#

i even tried LCM it looks all blurry most of the time tho

silent hinge
#

which do you prefer dragon?

#

are you sticking with euler?

severe moon
#

i'll try a different one but i don't think it makes a huge difference so euler seems fine for testing stuff out

bold wave
silent hinge
#

sticking with euler raydestarr?

scarlet bay
#

https://www.youtube.com/watch?v=NN8jfMZVzZ8
runway x stable diffusion colab

#runway #mixtape #aianimation

This is the second track of my upcoming mixtape. This track is titled 'Lightyears' it's a bit of an experiment, both musically and audio-visually hehe.

I wanted to produce something a little spaced out and eclectic, fusing a few influences and lots of colours, cause we could use some colour in these grey saturate...

▶ Play video
barren hound
#

Can someone link to the custom nodes needed for video diffusion in the pins?

barren hound
#

Ah, so i'm simply just outdated :P

half hound
#

yep

bold wave
#

There's a fine line between really bad and really good catlook I hope I didnt just mess up the settings with my next iteration.

silent hinge
#

whoa

#

Why's it so blurry?

severe moon
silent hinge
#

holy shit

#

i cant believe how good some of the motions are

arctic reef
severe moon
teal harbor
bold wave
# severe moon

throw in facial restore, it might fix the janky eyes. that looks way good.

silent hinge
#

oh yeah how do i get facial restore again

#

now that i have the manager

#

oh i think i see

bold wave
#

I use the model, and the loader.

silent hinge
#

sweet. nice and ez

severe moon
silent hinge
#

lol

bold wave
severe moon
silent hinge
#

should i insall all facerestore? theres 3 options

arctic reef
bold wave
#

a lot of this is just experimentation and seeing what works best for you. you can always reload my model and then say install missing nodes. you might have to download the face restore model separately.

silent hinge
#

whoa

arctic reef
severe moon
silent hinge
#

those are wild jj

arctic reef
silent hinge
#

so if i installed face restore gfpgan 1.4 once its installed and i refresh im good? nothing to click or check in the ui right

arctic reef
#

I'm digging through my best generations from SD 1.5

barren hound
#

hmm, what can cause the blurryness?

bold wave
barren hound
#

Just set res to 1024, fps to 25, 5 second clip and motion to 255

sterile mesa
noble wolf
#

what do seeds control for img2vid, how it moves?

silent hinge
#

i think the way its moves is kind of random/rng, but that is more i believe motion-bucket-id, sampler, fps, steps

noble wolf
#

motiona-bucket id is how much it moves, fps is just how fast it goes through images, and steps is just how detailed each image is gonna be with newly generated parts no? That's why I was assuming seed had something to do with the "how" of it moving, maybe not idk. It takes too long for me to want to test each individual setting. I need to get more vrammmm

#

cause this one is only 20 steps, but 100 motion-bucket id, so its moving a bit, but the steps can't keep up to make it look decent, at least I think?

silent hinge
#

oh yeah, how do i change the format for the file again?

noble wolf
#

install that through comfy ui manager

#

do u have the manager?

#

and then there should just be a custom node called "video combine vhs"

#

has lots of formats

silent hinge
#

oh ok i think i see

arctic reef
bold wave
noble wolf
#

and where do I find both of those, face resto and upscale

arctic reef
silent hinge
barren hound
#

Wait.. I might have read the info wrong. when it states 25 frame gens, that means total frames? Or fps?

noble wolf
#

restart

#

always restart after new install

#

"refresh" button doesn't always work

#

specifically for custom nodes I think? maybe

silent hinge
#

oh ok. does it need a certain workflow json too?

broken storm
#

question here, how do i install SVD with the github repository? i downloaded the file the repository provides, but i dont know what to do after that

silent hinge
#

got it now. thanks

noble wolf
#

you can use the images here for workflows I believe

#

it has a basic setup for running video gens

#

or wait maybe not the right link

#

one sec

silent hinge
#

do you reccomend any other installs for video

noble wolf
#

sorry here

#

uhm any other installs? hmmm idk sorry honestly I downloaded it last night

bold wave
noble wolf
#

just been messing with it since then xD

silent hinge
#

ahah ok

noble wolf
#

I learned comfy ui last night at 1am 😛

broken storm
noble wolf
#

its included by default now I believe

#

click this

#

drag one of the video onto your comfyui for the workflow

#

it should be self explanatory, put image in the left, set the settings fps/frames/movement and stuff

silent hinge
#

duskal did you install any face upscalers?

noble wolf
#

Ah no not yet!

broken storm
noble wolf
#

I've been messing with forest and shits mostly

#

not with good results, only face I did was up close

silent hinge
#

not bad!

noble wolf
#

its only not warping the face because I'm super up close

urban linden
#

if anyone is finding a workflow to extend the video, please let me know, I'll light a candle for you or smth.

broken storm
#

so the SVD github download should include it by default

#

iirc for SD i had to do a lot of setting up for the gui to work

noble wolf
#

extend the vid...

#

how do you do that I wonder huh