onyx ore Nov 21, 2023, 7:15 PM

#

YAYY I'm firat

glass helm Nov 21, 2023, 7:15 PM

#

!!!!

hidden temple Nov 21, 2023, 7:15 PM

#

hi

jade vessel Nov 21, 2023, 7:15 PM

#

trim bone Nov 21, 2023, 7:15 PM

#

Yahoo

storm kettle Nov 21, 2023, 7:15 PM

#

whats up

iron dust Nov 21, 2023, 7:15 PM

#

onyx ore YAYY I'm firat

Nice to meet you, Firat!

onyx ore Nov 21, 2023, 7:15 PM

#

robust lagoon Nov 21, 2023, 7:15 PM

#

lol noice

silent hinge Nov 21, 2023, 7:16 PM

#

https://tenor.com/view/hollyweencandy-cat-cat-drink-milk-cat-eating-cat-drinking-gif-16978701194275077898

Tenor

glossy trellis Nov 21, 2023, 7:16 PM

#

https://tenor.com/view/mike-wazowski-cursed-terror-moving-move-gif-16644513

Tenor

tidal cosmos Nov 21, 2023, 7:16 PM

#

https://tenor.com/view/404-not-found-error-20th-century-fox-gif-24907780

Tenor

robust lagoon Nov 21, 2023, 7:16 PM

#

lmao

forest jay Nov 21, 2023, 7:16 PM

#

https://media.discordapp.net/attachments/829661123913056266/874437296718479420/gif-speed.gif

jade vessel Nov 21, 2023, 7:16 PM

#

https://cdn.discordapp.com/attachments/938798387766120498/1170842312830623784/meme.jpg?ex=655a82e0&is=65480de0&hm=9d2bfa257b377fe272d5f58317a36eb5021291298caa62b706d34ea358c54f81&=

grave mantle Nov 21, 2023, 7:16 PM

#

Woman_representing_Eve_with_a_black_gothic_style_dress_garden_of_red_and_purple_roses_and_emerald_green_snake_style-Analog_Film_width-768_height-1344_aspect-9-16_seed-0ts-1700588836_idx-0.png

nimble sapphire Nov 21, 2023, 7:16 PM

#

https://tenor.com/view/ultrakill-panopticon-live-panopticon-reaction-live-reaction-ultrakill-live-reaction-gif-9147875870853660310

Tenor

humble iris Nov 21, 2023, 7:16 PM

#

40vram required ? xD

silent hinge Nov 21, 2023, 7:16 PM

#

https://imgur.com/a/Ec206U5

Imgur

Untitled Album

▶ Play video

#

gib vram

jade vessel Nov 21, 2023, 7:16 PM

#

12kb vram required

glossy trellis Nov 21, 2023, 7:16 PM

#

silent hinge gib vram

just download some vram

onyx ore Nov 21, 2023, 7:16 PM

#

humble iris 40vram required ? xD

You can wait for bot

forest jay Nov 21, 2023, 7:16 PM

#

Where am going to get 40gb of VRAM

humble iris Nov 21, 2023, 7:16 PM

#

jade vessel 12kb vram required

oh what is this 40 ram

gentle nest Nov 21, 2023, 7:16 PM

#

how many 4090s is that?

onyx ore Nov 21, 2023, 7:16 PM

#

Like I do for hugging face spaces

quartz bolt Nov 21, 2023, 7:17 PM

#

404

silent hinge Nov 21, 2023, 7:17 PM

#

glossy trellis just download some vram

lol i cant afford a A200 gpu

thin sand Nov 21, 2023, 7:17 PM

#

o

forest jay Nov 21, 2023, 7:17 PM

#

https://tenor.com/view/epic-facepalm-fat-guy-fat-guy-facepalm-oh-no-noooo-gif-23718365

Tenor

silent hinge Nov 21, 2023, 7:17 PM

#

gentle nest how many 4090s is that?

2

flat crystal Nov 21, 2023, 7:17 PM

#

gentle nest how many 4090s is that?

At least 2 4090s, so $4000 lmao

silent hinge Nov 21, 2023, 7:17 PM

#

i got one

#

sadcat

mortal crypt Nov 21, 2023, 7:17 PM

#

Welcome in!

gentle nest Nov 21, 2023, 7:17 PM

#

silent hinge i got one

you can make half video

ebon salmon Nov 21, 2023, 7:17 PM

#

forest jay Where am going to get 40gb of VRAM

You can rent GPUs on services like runpod, 40gb VRAM costs under $0.8/h on it

wide lava Nov 21, 2023, 7:17 PM

#

i dont have 2 4090s 😢

thin sand Nov 21, 2023, 7:17 PM

#

woah, i love this!

forest jay Nov 21, 2023, 7:17 PM

#

ebon salmon You can rent GPUs on services like runpod, 40gb VRAM costs under $0.8/h on it

I see

wide lava Nov 21, 2023, 7:17 PM

#

i only have one 4070ti

thin sand Nov 21, 2023, 7:18 PM

#

wide lava i dont have 2 4090s 😢

wait you need that to run it!?

mighty sentinel Nov 21, 2023, 7:18 PM

#

40GB vram bruh

copper berry Nov 21, 2023, 7:18 PM

#

run it on your cpu xdd

humble iris Nov 21, 2023, 7:18 PM

#

onyx ore Like I do for hugging face spaces

how that

wide lava Nov 21, 2023, 7:18 PM

#

yup 40gb vram

flat crystal Nov 21, 2023, 7:18 PM

#

thin sand wait you need that to run it!?

For 40 GB VRAM yes

thin sand Nov 21, 2023, 7:18 PM

#

thin sand woah, i love this!

ok now i love it slightly less

flat crystal Nov 21, 2023, 7:18 PM

#

1x 4090 = 24GB VRAM

zenith spoke Nov 21, 2023, 7:18 PM

#

me with my 6700xt.. wonder if it is even worth trying

wraith vale Nov 21, 2023, 7:18 PM

#

🤔 hm.... I WANT THIS!

valid trail Nov 21, 2023, 7:18 PM

#

the vram from 2 gpus won't add up afaik

iron dust Nov 21, 2023, 7:18 PM

#

wraith vale 🤔 hm.... I WANT THIS!

Updated link in the announcement post!

thin sand Nov 21, 2023, 7:18 PM

#

i got an idea! we throw away 4/5 of all the weights in the model!!!

#

i am a genius

onyx ore Nov 21, 2023, 7:18 PM

#

wraith vale 🤔 hm.... I WANT THIS!

thin sand Nov 21, 2023, 7:19 PM

#

thin sand i got an idea! we throw away 4/5 of all the weights in the model!!!

we will only need 8gb

wraith vale Nov 21, 2023, 7:19 PM

#

flat crystal For 40 GB VRAM yes

Only for 6090!

humble iris Nov 21, 2023, 7:19 PM

#

wide lava i only have one 4070ti

nvidia smiles with 12GB : well 12-16 now is not a big deal

iron dust Nov 21, 2023, 7:19 PM

#

onyx ore

No spam please.

valid trail Nov 21, 2023, 7:19 PM

#

mods bot account

silent hinge Nov 21, 2023, 7:19 PM

#

40 GBS OF VRAM BRUHH

zenith spoke Nov 21, 2023, 7:19 PM

#

humble iris nvidia smiles with 12GB : well 12-16 now is not a big deal

12 is sure good for an NVIDIA

ebon salmon Nov 21, 2023, 7:19 PM

#

humble iris nvidia smiles with 12GB : well 12-16 now is not a big deal

12-16 now is even not enough at all, if you do ML stuff ^^'

left geode Nov 21, 2023, 7:20 PM

#

ML?

silent hinge Nov 21, 2023, 7:20 PM

#

you need rtx 6000 ada generation gpu for this video ai

hollow wren Nov 21, 2023, 7:20 PM

#

Yo anyone got 10k? https://www.amazon.com/NVIDIA-Tesla-A100-Ampere-Graphics/dp/B0BGZJ27SL

NVIDIA Tesla A100 Ampere 40 GB Graphics Card - PCIe 4.0 - Dual Slot

The NVIDIA A100 Tensor Core GPU delivers unprecedented acceleration at every scale for AI, data analytics, and high-performance computing (HPC) to tackle the world's toughest computing challenges. As the engine of the NVIDIA data center platform, A100 can efficiently scale to thousands of GPUs or...

ebon salmon Nov 21, 2023, 7:20 PM

#

left geode ML?

Machine Learning

humble iris Nov 21, 2023, 7:20 PM

#

zenith spoke 12 is sure good for an NVIDIA

i mean we were complaining about it while 40gb not even debatable xD

silent hinge Nov 21, 2023, 7:20 PM

#

ebon salmon 12-16 now is even not enough at all, if you do ML stuff ^^'

so u need rtx 4090 for machine learning?

rustic hinge Nov 21, 2023, 7:20 PM

#

can this run across two GPUs?

flat crystal Nov 21, 2023, 7:20 PM

#

wraith vale Only for 6090!

RTX 5090 is rumored to have 48 GB VRAM, but no confirmation on this yet 🙂

humble iris Nov 21, 2023, 7:20 PM

#

ebon salmon 12-16 now is even not enough at all, if you do ML stuff ^^'

yes i totally agree xD i hope they forgot a " zero " by mistake xD

ebon salmon Nov 21, 2023, 7:20 PM

#

silent hinge so u need rtx 4090 for machine learning?

Depends on what you do, but for training purposes you even want more... and several of them in parallel

knotty stream Nov 21, 2023, 7:20 PM

#

silent hinge so u need rtx 4090 for machine learning?

I think you need even more powerfull GPUs

valid trail Nov 21, 2023, 7:21 PM

#

flat crystal RTX 5090 is rumored to have 48 GB VRAM, but no confirmation on this yet 🙂

topping out at 36 from what i gather

humble iris Nov 21, 2023, 7:21 PM

#

flat crystal RTX 5090 is rumored to have 48 GB VRAM, but no confirmation on this yet 🙂

in 2 years from now ig hearing those numbers wont be impossible

silent hinge Nov 21, 2023, 7:21 PM

#

knotty stream I think you need even more powerfull GPUs

damn idk how i can I get it then

floral chasm Nov 21, 2023, 7:21 PM

#

Stability.AI should release their own GPU lineup with their models at this point

keen knot Nov 21, 2023, 7:21 PM

#

how to use this

thorny charm Nov 21, 2023, 7:21 PM

#

We'll probably see an optimized version of the model soon, able to run on 3090s at least

silent hinge Nov 21, 2023, 7:21 PM

#

I wonder how good it is

iron dust Nov 21, 2023, 7:21 PM

#

humble iris yes i totally agree xD i hope they forgot a " zero " by mistake xD

40GB of VRAM is correct for the local model - but there will be a web version for people to try out soontm !
Keep in mind this is targeted towards researchers & not the full commercial release so set expectations accordingly ❤️

tepid stream Nov 21, 2023, 7:21 PM

#

flat crystal RTX 5090 is rumored to have 48 GB VRAM, but no confirmation on this yet 🙂

Knowing Nvidia next gen will have the same amount as this dogsmile

humble iris Nov 21, 2023, 7:21 PM

#

silent hinge damn idk how i can I get it then

ig renting gpu

copper berry Nov 21, 2023, 7:21 PM

#

the pipeline can't finish xdd

silent hinge Nov 21, 2023, 7:21 PM

#

humble iris ig renting gpu

u can rent gpu now?

flat crystal Nov 21, 2023, 7:21 PM

#

valid trail topping out at 36 from what i gather

Yeah, just rumors ofc, it's most definitely up to change

ebon salmon Nov 21, 2023, 7:21 PM

#

humble iris in 2 years from now ig hearing those numbers wont be impossible

You can already buy an A100 with 80gb or vram... for like $20k ^^'

gentle nest Nov 21, 2023, 7:21 PM

#

21,000€

keen hazel Nov 21, 2023, 7:21 PM

#

Damn

#

A car

left geode Nov 21, 2023, 7:21 PM

#

guess I gotta wait for the 24gb patch

silent hinge Nov 21, 2023, 7:21 PM

#

iron dust 40GB of VRAM is correct for the local model - but there will be a web version fo...

damn

keen hazel Nov 21, 2023, 7:22 PM

#

Or some ai videos

wide lava Nov 21, 2023, 7:22 PM

#

humble iris nvidia smiles with 12GB : well 12-16 now is not a big deal

sadly 😭

supple peak Nov 21, 2023, 7:22 PM

#

thin sand i got an idea! we throw away 4/5 of all the weights in the model!!!

rickthink

humble iris Nov 21, 2023, 7:22 PM

#

iron dust 40GB of VRAM is correct for the local model - but there will be a web version fo...

thanx a lott for explaining, it makes sense for someone like me seeing those 40vram was a shock xD

rustic hinge Nov 21, 2023, 7:22 PM

#

iron dust 40GB of VRAM is correct for the local model - but there will be a web version fo...

can this run on two 24GB GPUs?

modest coyote Nov 21, 2023, 7:22 PM

#

40GB VRAM for inference, rip fine tuning already

iron dust Nov 21, 2023, 7:22 PM

#

humble iris thanx a lott for explaining, it makes sense for someone like me seeing those 40v...

For sure! I can understand why lol!

thin sand Nov 21, 2023, 7:22 PM

#

supple peak <:rickthink:360942637622099979>

WHY ARE YOU EVERYWHERE

keen hazel Nov 21, 2023, 7:22 PM

#

12gb so it can run in Google colab

ebon salmon Nov 21, 2023, 7:22 PM

#

keen hazel Or some ai videos

You can already do AI vids with way less than 40gb to be fair, with AnimateDiff

supple peak Nov 21, 2023, 7:22 PM

#

thin sand WHY ARE YOU EVERYWHERE

EvilLaugh

ebon salmon Nov 21, 2023, 7:22 PM

#

Result quality might vary though

tepid stream Nov 21, 2023, 7:22 PM

#

You can still settle with tools that will do animations with less VRAM cheems

keen hazel Nov 21, 2023, 7:23 PM

#

ebon salmon You can already do AI vids with way less than 40gb to be fair, with AnimateDiff

Yea

modest coyote Nov 21, 2023, 7:23 PM

#

ebon salmon You can already do AI vids with way less than 40gb to be fair, with AnimateDiff

those arent AI videos

ebon salmon Nov 21, 2023, 7:23 PM

#

modest coyote those arent AI videos

What are they if not that?

silent hinge Nov 21, 2023, 7:23 PM

#

I wanna see how good this one is

thin sand Nov 21, 2023, 7:23 PM

#

supple peak <:EvilLaugh:360845026609332224>

14 mutual servers be like

flat crystal Nov 21, 2023, 7:23 PM

#

All you gotta do is buy this bad boy: https://www.nvidia.com/en-us/data-center/a100/

NVIDIA

NVIDIA A100 GPUs Power the Modern Data Center

The fastest data center platform for AI and HPC.

silent hinge Nov 21, 2023, 7:23 PM

#

maybe in the future we will get ai so good it will replace animators

supple peak Nov 21, 2023, 7:23 PM

#

thin sand 14 mutual servers be like

yup, and so unrelated

modest coyote Nov 21, 2023, 7:23 PM

#

all you need is runpod, vastai, bananadev or something like that

keen hazel Nov 21, 2023, 7:23 PM

#

thin sand WHY ARE YOU EVERYWHERE

For real I share 7 servers with the guy

thin sand Nov 21, 2023, 7:23 PM

#

lol

prime echo Nov 21, 2023, 7:23 PM

#

lets go we're about to get some quality gifs up here

hollow wren Nov 21, 2023, 7:23 PM

#

Someone mentionned the rtx 6000 ada, they're legit cheaper: https://store.nvidia.com/en-us/nvidia-rtx/store/?page=1&limit=9&locale=en-us&category=GPU

@NVIDIA

Buy Professional Graphics Cards & Workstations | NVIDIA RTX

Buy professional graphics cards, desktop and mobile workstations, and more in the NVIDIA RTX Store

prime echo Nov 21, 2023, 7:23 PM

#

and videos of course

marble folio Nov 21, 2023, 7:24 PM

#

@ebon salmon I see you keep on top of the news

prime echo Nov 21, 2023, 7:24 PM

#

hollow wren Someone mentionned the rtx 6000 ada, they're legit cheaper: https://store.nvidia...

"cheaper"
sorry but i can't even afford a gumball wtf 😭

ebon salmon Nov 21, 2023, 7:24 PM

#

marble folio <@230763790323548161> I see you keep on top of the news

Glad to see you there 😛

supple peak Nov 21, 2023, 7:24 PM

#

keen hazel For real I share 7 servers with the guy

Ours are pretty much all related though

marble folio Nov 21, 2023, 7:24 PM

#

ebon salmon Glad to see you there 😛

rip my beta member role

iron dust Nov 21, 2023, 7:24 PM

#

silent hinge I wanna see how good this one is

There are some examples available on the website if you'd like to give them a look! https://stability.ai/news/stable-video-diffusion-open-ai-video-model

(Here's a video too!)

Stability AI

Introducing Stable Video Diffusion — Stability AI

Stable Video Diffusion is a proud addition to our diverse range of open-source models. Spanning across modalities including image, language, audio, 3D, and code, our portfolio is a testament to Stability AI’s dedication to amplifying human intelligence.

keen hazel Nov 21, 2023, 7:24 PM

#

supple peak Ours are pretty much all related though

Yea

wraith vale Nov 21, 2023, 7:24 PM

#

hollow wren Yo anyone got 10k? https://www.amazon.com/NVIDIA-Tesla-A100-Ampere-Graphics/dp/B...

No, thanks!

peak shore Nov 21, 2023, 7:24 PM

#

oh so you need 40GB VRAM? rip

silent hinge Nov 21, 2023, 7:24 PM

#

hollow wren Someone mentionned the rtx 6000 ada, they're legit cheaper: https://store.nvidia...

yes

hollow wren Nov 21, 2023, 7:24 PM

#

ebon salmon Glad to see you there 😛

Cheaper don't mean cheap lmao

silent hinge Nov 21, 2023, 7:24 PM

#

iron dust There are some examples available on the website if you'd like to give them a lo...

ok ima see

ebon salmon Nov 21, 2023, 7:25 PM

#

hollow wren Cheaper don't mean cheap lmao

I mean, even a 3090 today is nowhere near cheap >.<

#

And nowhere near enough either depending on what you do ^^'

wide lava Nov 21, 2023, 7:25 PM

#

iron dust There are some examples available on the website if you'd like to give them a lo...

this is absolutely absurd

flat crystal Nov 21, 2023, 7:25 PM

#

wraith vale No, thanks!

Isn't this the old model? I thought newer A100's have 80GB VRAM

haughty garden Nov 21, 2023, 7:25 PM

#

If I save money on a Ferrari is it still a deal?

wide lava Nov 21, 2023, 7:25 PM

#

thats amazing

ebon salmon Nov 21, 2023, 7:25 PM

#

flat crystal Isn't this the old model? I thought newer A100's have 80GB VRAM

They do, there are 40gb and 80gb models

knotty stream Nov 21, 2023, 7:25 PM

#

iron dust There are some examples available on the website if you'd like to give them a lo...

teal vigil Nov 21, 2023, 7:25 PM

#

Go now to waitlist... just have company name at hand XD https://stability.ai/contact

flat crystal Nov 21, 2023, 7:25 PM

#

Gotcha

#

I cannot for the life of me find the 80GB models available anywhere online..

ebon salmon Nov 21, 2023, 7:26 PM

#

flat crystal I cannot for the life of me find the 80GB models available anywhere online..

Runpod

humble iris Nov 21, 2023, 7:26 PM

#

but i cant get what is the different between this sd video and animated diffusion node ?

modest coyote Nov 21, 2023, 7:26 PM

#

iron dust There are some examples available on the website if you'd like to give them a lo...

why do they look like animated vector graphics

silent hinge Nov 21, 2023, 7:26 PM

#

I will compare same prompts in runway now

wide lava Nov 21, 2023, 7:26 PM

#

i might be able to convince our media production to get 80gb vram.

flat crystal Nov 21, 2023, 7:26 PM

#

ebon salmon Runpod

I mean locally buy one

#

Do they have that on that website/

rustic hinge Nov 21, 2023, 7:27 PM

#

does anyone know how to run this and if it'll work across two GPUs?

royal mauve Nov 21, 2023, 7:27 PM

#

Yoh, so how do we run this model ? Not obvious

floral chasm Nov 21, 2023, 7:27 PM

#

ebon salmon Runpod

Pls no don't make all of my workers throttled -_-

supple peak Nov 21, 2023, 7:27 PM

#

get 500gb and run the trillion param LLM

hollow wren Nov 21, 2023, 7:27 PM

#

flat crystal I mean locally buy one

Idk if they sell those locally

wide lava Nov 21, 2023, 7:27 PM

#

rustic hinge does anyone know how to run this and if it'll work across two GPUs?

apply for the waitlist

humble iris Nov 21, 2023, 7:27 PM

#

silent hinge I will compare same prompts in runway now

i would like to see if u can mention me with comparison

modest coyote Nov 21, 2023, 7:27 PM

#

rustic hinge does anyone know how to run this and if it'll work across two GPUs?

yes, it just released and i read through all documentation in like 3 minutes

rustic hinge Nov 21, 2023, 7:27 PM

#

where even is the documentation?

flat crystal Nov 21, 2023, 7:27 PM

#

hollow wren Idk if they sell those locally

I wish they would 😦

silent hinge Nov 21, 2023, 7:27 PM

#

humble iris i would like to see if u can mention me with comparison

ok I can show you the runway anims and u can check whether the runway is better or the stable diffusion is better

ebon salmon Nov 21, 2023, 7:27 PM

#

flat crystal I mean locally buy one

For local stuff, most of the time you're better off with multiple smaller cards, it would be way cheaper and not necessarily much slower

iron dust Nov 21, 2023, 7:28 PM

#

rustic hinge where even is the documentation?

https://github.com/Stability-AI/generative-models & https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt

And the research paper is here: https://stability.ai/research/stable-video-diffusion-scaling-latent-video-diffusion-models-to-large-datasets !

humble iris Nov 21, 2023, 7:28 PM

#

silent hinge ok I can show you the runway anims and u can check whether the runway is better ...

sure !

gilded reef Nov 21, 2023, 7:28 PM

#

Any idea of speeds for the 14/25 frame versions on a "budget" GPU that can handle this like an a6000?

modest coyote Nov 21, 2023, 7:28 PM

#

your move, openai

flat crystal Nov 21, 2023, 7:28 PM

#

ebon salmon For local stuff, most of the time you're better off with multiple smaller cards,...

Yeah fair enough, especially with most general tasks. That or workstation gpus

hollow wren Nov 21, 2023, 7:28 PM

#

flat crystal I wish they would 😦

Yeah but those gpus are mostly for data centers and enterprise workstations so b2b selling

rustic hinge Nov 21, 2023, 7:28 PM

#

iron dust https://github.com/Stability-AI/generative-models & https://huggingface.co/stabi...

is it part of a combined github repo?

iron dust Nov 21, 2023, 7:28 PM

#

gilded reef Any idea of speeds for the 14/25 frame versions on a "budget" GPU that can handl...

Budget GPUs will likely not be able to run this during the initial research preview. BUT, we have a web interface coming soon that everyone should be able to play around with! More details on this later~

royal mauve Nov 21, 2023, 7:29 PM

#

How do we run the model ? Can't see any code

hollow wren Nov 21, 2023, 7:29 PM

#

royal mauve How do we run the model ? Can't see any code

first you need the hardware lmao

gilded reef Nov 21, 2023, 7:29 PM

#

iron dust Budget GPUs will likely not be able to run this during the initial research prev...

An a6000 48GB VRAM won't run that?

silent hinge Nov 21, 2023, 7:29 PM

#

humble iris sure !

this is the astronaut walking on the moon

iron dust Nov 21, 2023, 7:29 PM

#

royal mauve How do we run the model ? Can't see any code

Some of the code and weights are just getting pushed through so they may take just a wee little longer to get going! Please bare w/ me!

modest coyote Nov 21, 2023, 7:29 PM

#

what does this mean

#

it just takes an image as input and thats it?

wraith vale Nov 21, 2023, 7:30 PM

#

silent hinge this is the astronaut walking on the moon

Pizza time?

iron dust Nov 21, 2023, 7:30 PM

#

modest coyote what does this mean

The model being released for research is a image-to-video model. The text-to-video portion will be included in the web interface to be released soontm

keen hazel Nov 21, 2023, 7:30 PM

#

modest coyote what does this mean

It showed it can to txt2vid

keen hazel Nov 21, 2023, 7:30 PM

#

iron dust The model being released for research is a image-to-video model. The text-to-vid...

Oh ok

knotty stream Nov 21, 2023, 7:30 PM

#

wraith vale Pizza time?

That's a nice magic trick

trail quiver Nov 21, 2023, 7:30 PM

#

wraith vale Pizza time?

Pizza on the moon?

flat crystal Nov 21, 2023, 7:30 PM

#

iron dust The model being released for research is a image-to-video model. The text-to-vid...

Thank you!

wraith vale Nov 21, 2023, 7:31 PM

#

knotty stream That's a nice magic trick

She made a slice of pizza... WTF... How did she do it?

teal vigil Nov 21, 2023, 7:31 PM

#

Is that how pizza is made?

knotty stream Nov 21, 2023, 7:32 PM

#

wraith vale She made a slice of pizza... WTF... How did she do it?

Yeah, I would love to know how to do that

empty oar Nov 21, 2023, 7:32 PM

#

teal vigil Is that how pizza is made?

yeah, how else?

teal vigil Nov 21, 2023, 7:32 PM

#

Neat

knotty stream Nov 21, 2023, 7:32 PM

#

Maybe pizza ovens are already running video models

silent hinge Nov 21, 2023, 7:32 PM

#

two blue jays on top of a building. looks scary

ebon salmon Nov 21, 2023, 7:32 PM

#

I'm curious to test this and see how it differs with AnimateDiff

humble iris Nov 21, 2023, 7:32 PM

#

silent hinge this is the astronaut walking on the moon

oh he didnt move at all, ignoring the unrealistic moon walk of sd video but as test i can feel it will be awesome later

ebon salmon Nov 21, 2023, 7:33 PM

#

Motion is definitively better, and from the examples I've seen there are less/no weird limbs suddenly appearing and disappearing

silent hinge Nov 21, 2023, 7:33 PM

#

humble iris oh he didnt move at all, ignoring the unrealistic moon walk of sd video but as t...

yeah runway is pretty bad for movement but maybe with the new motion thing it will be good

hexed oak Nov 21, 2023, 7:33 PM

#

It's been 15 minutes and we still lack a video of will smith eating spaghetti. come on people.

silent hinge Nov 21, 2023, 7:33 PM

#

im also gonna test pikalabs to see

humble iris Nov 21, 2023, 7:34 PM

#

silent hinge yeah runway is pretty bad for movement but maybe with the new motion thing it wi...

i hope so, also thanx for sharing

silent hinge Nov 21, 2023, 7:35 PM

#

humble iris i hope so, also thanx for sharing

yw

shadow osprey Nov 21, 2023, 7:35 PM

#

hexed oak It's been 15 minutes and we still lack a video of will smith eating spaghetti. c...

"keep my wife's spaghetti out your damn mouth"

ebon salmon Nov 21, 2023, 7:35 PM

#

I can already see animation makers raging 🤭

#

"They're taking our jobs!"

silent hinge Nov 21, 2023, 7:36 PM

#

soon in the future u can make a 20 min animation in 10 minutes 😆

haughty garden Nov 21, 2023, 7:36 PM

#

Next stop... 3D rendering and destructive enviroments in games.

viral wren Nov 21, 2023, 7:36 PM

#

Apple M chip should work😁

ebon salmon Nov 21, 2023, 7:37 PM

#

silent hinge soon in the future u can make a 20 min animation in 10 minutes 😆

Yeaaaah I wouldn't expect it too soon though... Currently on AnimateDiff, a much smaller model, it takes 2-3 minutes to generate 3seconds of animation on a 3090

silent hinge Nov 21, 2023, 7:37 PM

#

cold mulch Nov 21, 2023, 7:37 PM

#

haughty garden Next stop... 3D rendering and destructive enviroments in games.

We've already got 3d

silent hinge Nov 21, 2023, 7:37 PM

#

will smith isnt eating spaghetti

ebon salmon Nov 21, 2023, 7:37 PM

#

silent hinge

pain

shadow osprey Nov 21, 2023, 7:37 PM

#

silent hinge

This is horrifying

silent hinge Nov 21, 2023, 7:37 PM

#

ebon salmon Yeaaaah I wouldn't expect it too soon though... Currently on AnimateDiff, a much...

damn

#

3090 also has 24 gb of vram that a lot

ebon salmon Nov 21, 2023, 7:38 PM

#

silent hinge 3090 also has 24 gb of vram that a lot

It's not VRAM the bottleneck here ^^'

silent hinge Nov 21, 2023, 7:38 PM

#

ebon salmon It's not VRAM the bottleneck here ^^'

the bottleneck?

#

like cpu and gpu bottleneck

hexed oak Nov 21, 2023, 7:38 PM

#

will smith is eating the spacetime continuum

ebon salmon Nov 21, 2023, 7:38 PM

#

silent hinge the bottleneck?

Yeah, I generated animations on 3090, 4090 and A100s, and the speed different isn't really significant

#

Even if A100s have 80gb

silent hinge Nov 21, 2023, 7:39 PM

#

ebon salmon Yeah, I generated animations on 3090, 4090 and A100s, and the speed different is...

damn u got a100 bro

ebon salmon Nov 21, 2023, 7:39 PM

#

silent hinge damn u got a100 bro

No I use online GPU services xD

silent hinge Nov 21, 2023, 7:39 PM

#

ebon salmon No I use online GPU services xD

ohhh ok

ebon salmon Nov 21, 2023, 7:39 PM

#

You can even rent H100s nowadays

wraith vale Nov 21, 2023, 7:39 PM

#

Legendary....

ebon salmon Nov 21, 2023, 7:39 PM

#

ebon salmon You can even rent H100s nowadays

At $4/h tho

haughty garden Nov 21, 2023, 7:39 PM

#

Well Davinci Resolve (example) you can render on CPU (free version) or GPU (paid version) up to 32k resolution. A 4090 does it jsut fine.

silent hinge Nov 21, 2023, 7:40 PM

#

ebon salmon At $4/h tho

so that is 88 dollar a day right

#

wait

#

4 x 2 = 100

#

96 dollars a day

shadow osprey Nov 21, 2023, 7:40 PM

#

ebon salmon At $4/h tho

Is it billed by the second

silent hinge Nov 21, 2023, 7:40 PM

#

96 x 30

median helm Nov 21, 2023, 7:40 PM

#

69$ per frame

shadow osprey Nov 21, 2023, 7:40 PM

#

silent hinge 4 x 2 = 100

Me and you learned learned very different math

silent hinge Nov 21, 2023, 7:41 PM

#

2880 dollars a month

ebon salmon Nov 21, 2023, 7:41 PM

#

shadow osprey Is it billed by the second

Practically yeah, not sure but it's a relatively small time frame for price calculation

silent hinge Nov 21, 2023, 7:41 PM

#

2880 x 12

#

34k

#

34 560

#

so basically every year 34 560 I think

lean gate Nov 21, 2023, 7:41 PM

#

haughty garden Well Davinci Resolve (example) you can render on CPU (free version) or GPU (paid...

rendering pixels and inferring are very different beasts

shadow osprey Nov 21, 2023, 7:41 PM

#

silent hinge so that is 88 dollar a day right

Assuming he's running 100% 24/7

ebon salmon Nov 21, 2023, 7:42 PM

#

silent hinge so basically every year 34 560 I think

I only do sessions on 1-2h when I need to do some tests tho, I'm not rich enough to rent that h24 d365 😛

silent hinge Nov 21, 2023, 7:42 PM

#

shadow osprey Assuming he's running 100% 24/7

yeah assuming that

median helm Nov 21, 2023, 7:42 PM

#

silent hinge so basically every year 34 560 I think

5.8752e+110 USD if you ran it till the heat death of the universe

#

give or take

ebon salmon Nov 21, 2023, 7:42 PM

#

silent hinge so basically every year 34 560 I think

Also there is the option for serverless, GPU clusters that are idle most of the time and only wake up (and bill) when used

silent hinge Nov 21, 2023, 7:43 PM

#

ebon salmon Also there is the option for serverless, GPU clusters that are idle most of the ...

ohh ok

haughty garden Nov 21, 2023, 7:43 PM

#

lean gate rendering pixels and inferring are very different beasts

Resolve has AI infusion. It can't be that different. The node system plugs into Autodesk and UE5.

silent hinge Nov 21, 2023, 7:44 PM

#

artists lost their job now animators

#

and soon teachers

#

and pretty much everoyne will lose their jobs

haughty garden Nov 21, 2023, 7:44 PM

#

Nonsense.

#

Stop the sillyness.

ebon salmon Nov 21, 2023, 7:44 PM

#

silent hinge and soon teachers

All those jobs lost also created new jobs for AI whisperers tho 😛

silent hinge Nov 21, 2023, 7:44 PM

#

ebon salmon All those jobs lost also created new jobs for AI whisperers tho 😛

true

steady field Nov 21, 2023, 7:45 PM

#

there is still a major controllability and editability problem in AI

silent hinge Nov 21, 2023, 7:45 PM

#

steady field there is still a major controllability and editability problem in AI

ye

steady field Nov 21, 2023, 7:45 PM

#

editing generative images and videos still requires expert skillsets

ebon salmon Nov 21, 2023, 7:45 PM

#

steady field there is still a major controllability and editability problem in AI

It's been improving all over the board with the progress

flint obsidian Nov 21, 2023, 7:45 PM

#

silent hinge artists lost their job now animators

Which in turn will cause people to use their noggins to think of even more creative ways to make money which then will bring us further in our evolution. It's a cycle

ebon salmon Nov 21, 2023, 7:45 PM

#

steady field there is still a major controllability and editability problem in AI

Just look at SDXL, it's way better at following prompt than SD

steady field Nov 21, 2023, 7:46 PM

#

it will come with time, but it definitely lags behind

ebon salmon Nov 21, 2023, 7:46 PM

#

Even if yes it still requires quite a bit of expertise to get the most out of it

lean gate Nov 21, 2023, 7:46 PM

#

haughty garden Resolve has AI infusion. It can't be that different. The node system plugs into ...

for a 1024 x1024 image, that's a network of 1.048.576 "dimensions", the network has to decide what goes into each one of those, it's very different from rendering pixels, even in 3D you bounce a ray and it gives you a pixel color, it's less work

#

that's why in AI animation you need a lot of VRAM

silent hinge Nov 21, 2023, 7:46 PM

#

isnt microsoft developing chatgpt v with sdxl

lean gate Nov 21, 2023, 7:46 PM

#

you need to fit the model for inference into ram

silent hinge Nov 21, 2023, 7:47 PM

#

or developing sdxl with chatgpt v

ebon salmon Nov 21, 2023, 7:47 PM

#

silent hinge or developing sdxl with chatgpt v

Been there, done that 😛

rapid rapids Nov 21, 2023, 7:47 PM

#

So I'm reading paper for SVD and am confused, can anyone explain to me how many parameters does it have? Paper says that it's 1521M, but checkpoint size is 9 gigabytes. Did they ship optimizer state there as well?

shadow osprey Nov 21, 2023, 7:48 PM

#

silent hinge and pretty much everoyne will lose their jobs

I'm glad I'll be retired before they automate my job away

#

It's coming sooner or later

haughty garden Nov 21, 2023, 7:48 PM

#

Stop the sillyness.

eager sandal Nov 21, 2023, 7:48 PM

#

have anyone tried if this can run in 24G vram enviornment?

lean gate Nov 21, 2023, 7:49 PM

#

eager sandal have anyone tried if this can run in 24G vram enviornment?

it can't

#

for now

eager sandal Nov 21, 2023, 7:49 PM

#

with lowering to fp8 or fp16 to reduce vram consumption

ebon salmon Nov 21, 2023, 7:49 PM

#

haughty garden Stop the sillyness.

NO!

shut narwhal Nov 21, 2023, 7:49 PM

#

Hey! So the Stable video diffusion isnt available yet right ?

#

Really stocked to see what it uses, if it use some AnimateDiff, Deforum, Ip adapters, LCM or if it's fully dev by Stability 👀

tight fossil Nov 21, 2023, 7:50 PM

#

40gb for local inference

#

someone pin that

#

this question is gonna come up 300 million times in this chat

lean gate Nov 21, 2023, 7:50 PM

#

shut narwhal Hey! So the Stable video diffusion isnt available yet right ?

it is, you just need a hefty GPU (40 GB)

knotty stream Nov 21, 2023, 7:50 PM

#

it is in research preview

shut narwhal Nov 21, 2023, 7:50 PM

#

lean gate it is, you just need a hefty GPU (40 GB)

40Gb of VRAM ?!?!

eager sandal Nov 21, 2023, 7:50 PM

#

it is in announcement

lean gate Nov 21, 2023, 7:51 PM

#

shut narwhal 40Gb of VRAM ?!?!

yeah guys this is at research level for now

eager sandal Nov 21, 2023, 7:51 PM

#

but my focus here is if it can be inference in fp8/16 without losing the quality too much

#

as it is fp32 model

#

which is why it is costing so much vram

lean gate Nov 21, 2023, 7:51 PM

#

it will get optimized, it'll work on consumer GPUs down the road

shut narwhal Nov 21, 2023, 7:51 PM

#

Oh wow xD, require so much gpus ahahah

silk jolt Nov 21, 2023, 7:51 PM

#

silent hinge artists lost their job now animators

Correct

shut narwhal Nov 21, 2023, 7:51 PM

#

silk jolt Correct

Or they just upgrade it

silk jolt Nov 21, 2023, 7:51 PM

#

Can you use multigpu for inference?

outer trail Nov 21, 2023, 7:52 PM

#

Saying, "this will be on the web" isn't addressing some people's concerns, such as will this ever be runnable locally on consumer hardware.

ebon salmon Nov 21, 2023, 7:52 PM

#

silk jolt Correct

As I said, it's not a complete loss, it's more like a job shift ^^'

iron dust Nov 21, 2023, 7:52 PM

#

Anyone who doesn't have a spare 40GB of VRAM sitting on their shelf collecting dust should keep an eye out for the web experience coming soontm

lean gate Nov 21, 2023, 7:52 PM

#

silent hinge artists lost their job now animators

another way of looking into this is that animators now have TOOLS that expand their horizons, make them more efficient and they can do more and better work

distant epoch Nov 21, 2023, 7:52 PM

#

iron dust Anyone who doesn't have a spare 40GB of VRAM sitting on their shelf collecting d...

👁️ 👅 👁️

lean gate Nov 21, 2023, 7:52 PM

#

let's not be pessimistic, these tools are amazing

iron dust Nov 21, 2023, 7:53 PM

#

distant epoch 👁️ 👅 👁️

It's like looking in a mirror

knotty stream Nov 21, 2023, 7:53 PM

#

Yes, all these tools help a lot in the creative process

silent hinge Nov 21, 2023, 7:53 PM

#

lean gate another way of looking into this is that animators now have TOOLS that expand th...

true

ebon salmon Nov 21, 2023, 7:54 PM

#

knotty stream Yes, all these tools help a lot in the creative process

Still, it's a radical change of tools for those not willing to change their ways ^^' I think there is still space for both anyway

iron dust Nov 21, 2023, 7:54 PM

#

outer trail Saying, "this will be on the web" isn't addressing some people's concerns, such ...

True, but you also have to keep in mind this is an exclusive release targeted towards researchers while still in development. This is not a commercial release targeted towards both the hardware and use cases of the general public.

It's hard to apply one side's expectaitons to a different situation that wasn't intended to be the proper fit.

echo yew Nov 21, 2023, 7:55 PM

#

To me, the tools that will succeed the most in production settings are those that can use input images and work on top of them without destroying the intent from the artist. E.g. someone sketches a rough tree and adds the lighting direction and temperature, the AI output resembles the orignal sketch but rendered out faster.

outer trail Nov 21, 2023, 7:55 PM

#

Unspinned answer sounds like no. 🙂

knotty stream Nov 21, 2023, 7:55 PM

#

There is space for everyone, for those who want to continue creating as they always have and for those who are interested in testing new paths

#

and the ones who will mix everything too

echo yew Nov 21, 2023, 7:55 PM

#

Text to image isn't as usable in every production setting, its fine for things that are unimportant or as a starting point that artists have to work on top of afterwards

silk jolt Nov 21, 2023, 7:55 PM

#

2 x 3090?

eager sandal Nov 21, 2023, 7:55 PM

#

i do believe by reducing the model to fp8 inference it would be very likely that would run in 24G vram enviornment

#

but the only issue is if the characteristic of learnt data would be lost when its inferencing in lower precision

lean gate Nov 21, 2023, 7:56 PM

#

eager sandal i do believe by reducing the model to fp8 inference it would be very likely that...

not as simple as that

rapid rapids Nov 21, 2023, 7:57 PM

#

eager sandal i do believe by reducing the model to fp8 inference it would be very likely that...

Diffusion models don't respond that well to quantization

iron dust Nov 21, 2023, 7:57 PM

#

lean gate not as simple as that

But what if we ask it nicely?

unkempt mica Nov 21, 2023, 7:57 PM

#

who wants to go in on an A100 with me?

eager sandal Nov 21, 2023, 7:58 PM

#

rapid rapids Diffusion models don't respond that well to quantization

have tested by doing fp8 to checkpoint for lower vram consumption in ad generation, effectively cutting 18GB vram to less than 12GB, from fp16

#

without much quality lose

unkempt mica Nov 21, 2023, 7:58 PM

#

I will put in $20

eager sandal Nov 21, 2023, 7:58 PM

#

but idk if thats applicable here

lean gate Nov 21, 2023, 7:58 PM

#

eager sandal have tested by doing fp8 to checkpoint for lower vram consumption in ad generati...

are you talking about a motion module?

eager sandal Nov 21, 2023, 7:59 PM

#

lean gate are you talking about a motion module?

both checkpoint and motion module

distant epoch Nov 21, 2023, 7:59 PM

#

unkempt mica I will put in $20

If you want, you can give me money and I'll keep the RTX 6000

eager sandal Nov 21, 2023, 7:59 PM

#

its in the dev branch in A1111

#

fp8test branch by kohaku

distant epoch Nov 21, 2023, 8:00 PM

#

Alternatively, I can go on Tiktok live and start ebegging

short gulch Nov 21, 2023, 8:00 PM

#

This is an image to video model not a text to vid right? the subreddit is saying image to vid

eager sandal Nov 21, 2023, 8:00 PM

#

short gulch This is an image to video model not a text to vid right? the subreddit is saying...

iron dust Nov 21, 2023, 8:00 PM

#

short gulch This is an image to video model not a text to vid right? the subreddit is saying...

Correct! Image-To-Video with a text-to-video interface coming out on a web platform soon

eager sandal Nov 21, 2023, 8:01 PM

#

the paper documents it well

#

and the link is at https://static1.squarespace.com/static/6213c340453c3f502425776e/t/655ce779b9d47d342a93c890/1700587395994/stable_video_diffusion.pdf

unkempt mica Nov 21, 2023, 8:01 PM

#

distant epoch If you want, you can give me money and I'll keep the RTX 6000

I dont think that will work for me lol

distant epoch Nov 21, 2023, 8:04 PM

#

I don't know why, but on the website the example clips looks like a Robot 'post nut', now stuck in a crisis.

#

https://images.squarespace-cdn.com/content/v1/6213c340453c3f502425776e/f962c653-c6e8-4df5-b9c8-b50d30c5cfe7/stable_video_diffusion.gif?format=750w

keen hazel Nov 21, 2023, 8:04 PM

#

winged raft Nov 21, 2023, 8:18 PM

#

https://stability.ai/video doesn't work

sturdy reef Nov 21, 2023, 8:18 PM

#

I'm sure it's been asked, but how do I run the model locally? automatic?

olive wave Nov 21, 2023, 8:23 PM

#

arg, why did they used SVD as abbreviation 🤦‍♂️

grim tangle Nov 21, 2023, 8:28 PM

#

512x460 fits 24GB at least...results are interesting 😄

drifting glen Nov 21, 2023, 8:30 PM

#

silk jolt 2 x 3090?

Just buy an A6000 if you have that much money anyway.

unborn acorn Nov 21, 2023, 8:35 PM

#

Where is the ComfyUI nodes for svd_xt models? Models don't work in AnimateDiff loader.

glossy plover Nov 21, 2023, 8:45 PM

#

what is the difference between the xt model to the non-xt model?

iron dust Nov 21, 2023, 8:50 PM

#

glossy plover what is the difference between the xt model to the non-xt model?

non-xt was trained for 14 frame prediction & xt was trained for 24 frame prediction

ebon salmon Nov 21, 2023, 8:53 PM

#

At how many FPS? 8 like AnimateDiff?

#

(just wanted to mention for those that might find it's not enough: there are very good frame interpolation tools to transform 8fps into 64fps or more if you want, like RIFE)

lean gate Nov 21, 2023, 8:54 PM

#

ebon salmon At how many FPS? 8 like AnimateDiff?

variable rate up to 30fps

#

13 to 30 fps

#

AnimeDiff has a 16 frame context, you can combine it to any fps you like (via comfyUI, dont know about auto1111)

unborn acorn Nov 21, 2023, 8:56 PM

#

How to use the models?

lean gate Nov 21, 2023, 9:02 PM

#

unborn acorn How to use the models?

https://github.com/Stability-AI/generative-models

GitHub

GitHub - Stability-AI/generative-models: Generative Models by Stabi...

Generative Models by Stability AI. Contribute to Stability-AI/generative-models development by creating an account on GitHub.

#

it's mostly for research purposes right now, if your GPU can run it you can test it locally (you need at least 40GB vram)

#

it will be accessible via a website soon #▶｜stable-video-diffusion message

obtuse bridge Nov 21, 2023, 9:07 PM

#

The autoencoder for the video model is the same one as 2.1 isn't it? I'm going to end up testing it later to see if the temporal bits to the decoder work on other video models like AnimateDiff

grim tangle Nov 21, 2023, 9:11 PM

#

lean gate it's mostly for research purposes right now, if your GPU can run it you can test...

it runs on 24GB with system ram fallback on, just slow as hell

keen hazel Nov 21, 2023, 9:12 PM

#

Damn

sullen flower Nov 21, 2023, 9:13 PM

#

go into scripts/demo/streamlit_helpers.py and enable lowvram_mode = True for model offloading and it will run on 24gb

#

set frame decoding to 5 in the UI as well

lean gate Nov 21, 2023, 9:15 PM

#

but is it really usable like that?

#

there'll be optimizations down the road

lean gate Nov 21, 2023, 9:16 PM

#

sullen flower go into scripts/demo/streamlit_helpers.py and enable lowvram_mode = True for mod...

what card are you testing it on?

sullen flower Nov 21, 2023, 9:16 PM

#

3090

glossy plover Nov 21, 2023, 9:18 PM

#

sullen flower set frame decoding to 5 in the UI as well

UI? is there a UI already?

sullen flower Nov 21, 2023, 9:19 PM

#

the streamlit demo in the repo

nocturne star Nov 21, 2023, 9:19 PM

#

curious if the multi-view synthesis finetune of the video model will be avaliable at some point as well, thats far more interesting to me than the actual image to video, might be good for creating interesting 3d models or alphas

glossy plover Nov 21, 2023, 9:20 PM

#

sullen flower the streamlit demo in the repo

I'm getting ModuleNotFoundError: No module named 'imwatermark' with that script, even though I have that module installed

half hound Nov 21, 2023, 9:21 PM

#

will this get support for RTX 4090 cards?

sullen flower Nov 21, 2023, 9:21 PM

#

ehh, did not have that error

#

tiny cradle Nov 21, 2023, 9:24 PM

#

sullen flower go into scripts/demo/streamlit_helpers.py and enable lowvram_mode = True for mod...

where exactly should lowvram_mode = True be set?

sullen flower Nov 21, 2023, 9:25 PM

#

where its lowvram_mode = False now

tiny cradle Nov 21, 2023, 9:25 PM

#

ah I see I misread which file - thank you

glossy plover Nov 21, 2023, 9:25 PM

#

now I'm getting ModuleNotFoundError: No module named 'scripts.demo'

#

I managed to fix the invisible watermark thing

subtle agate Nov 21, 2023, 9:26 PM

#

sullen flower

nice, can you upload scene with human face in it? it's a crucial weak point in the most of competitiors, and it will be nice to see raw generation of the model

spring cedar Nov 21, 2023, 9:26 PM

#

can we have a room for just videos?

tiny cradle Nov 21, 2023, 9:27 PM

#

Setup Instructions (Python 3.10, 4090, working on Linux):

git clone the repo
- git clone git@github.com:Stability-AI/generative-models.git
- cd generative-models
pip install -r requirements/pt2.txt
double check that pip install actually worked. on windows you may need to comment out xformers and triton
pip install .
modify streamlit_helpers.py lowvram_mode = True
create a checkpoints folder in the root folder of the project
download the weights from https://huggingface.co/stabilityai/stable-video-diffusion-img2vid/tree/main to checkpoints folder
streamlit run scripts/demo/video_sampling.py
set "Decode t frames at a time)" to 2 or lower
click "Load Model"
upload image and go

sullen flower Nov 21, 2023, 9:28 PM

#

subtle agate nice, can you upload scene with human face in it? it's a crucial weak point in t...

this one turned a bit 3d but an example at least

subtle agate Nov 21, 2023, 9:29 PM

#

sullen flower this one turned a bit 3d but an example at least

thanks! wow, looks very good!

glossy plover Nov 21, 2023, 9:30 PM

#

tiny cradle **Setup Instructions** (Python 3.10, 4090, working on Linux): - git clone the re...

that assumes I have Linux. is the model only for Linux systems?

tiny cradle Nov 21, 2023, 9:31 PM

#

trying on m1 right now. lets see

glossy plover Nov 21, 2023, 9:37 PM

#

ok, I bypassed the thing that wanted Triton. now I'm getting:
C:\Users\joker\OneDrive\Desktop\A.I\generative-models\venv\lib\site-packages\torchaudio\backend\utils.py:74: UserWarning: No audio backend is available. warnings.warn("No audio backend is available.") 2023-11-21 23:36:09.160 Uncaught app exception Traceback (most recent call last): File "C:\Users\joker\OneDrive\Desktop\A.I\generative-models\venv\lib\site-packages\streamlit\runtime\scriptrunner\script_runner.py", line 534, in _run_script exec(code, module.__dict__) File "C:\Users\joker\OneDrive\Desktop\A.I\generative-models\scripts\demo\video_sampling.py", line 5, in <module> from scripts.demo.streamlit_helpers import * ModuleNotFoundError: No module named 'scripts'

tiny cradle Nov 21, 2023, 9:38 PM

#

its hard coded for cuda. I suspect the code can be made to run on m1

File "/Users/bryce/.pyenv/versions/3.10.13/envs/gen-mdls-3.10.13/lib/python3.10/site-packages/torch/cuda/__init__.py", line 239, in _lazy_init
    raise AssertionError("Torch not compiled with CUDA enabled")
AssertionError: Torch not compiled with CUDA enabled

sullen flower Nov 21, 2023, 9:38 PM

#

you need to pip install . probably

glossy plover Nov 21, 2023, 9:40 PM

#

sullen flower you need to pip install . probably

I did do that, idk what's the error I'm getting

half hound Nov 21, 2023, 9:41 PM

#

how do you install?

sullen flower Nov 21, 2023, 9:42 PM

#

pretty sure that fixed it for me though i did it with the -e flag, but not sure that should make any difference

tiny cradle Nov 21, 2023, 9:43 PM

#

glossy plover that assumes I have Linux. is the model only for Linux systems?

wait why do you think this assumes linux systems? are you on mac or PC?

glossy plover Nov 21, 2023, 9:44 PM

#

tiny cradle wait why do you think this assumes linux systems? are you on mac or PC?

I'm on windows, when I followed the setup it errored out due to needing Triton, which is a Linux package

tiny cradle Nov 21, 2023, 9:45 PM

#

run your commands from the root of the project

glossy plover Nov 21, 2023, 9:45 PM

#

I changed the requirements file, continued as normal and it still errored out, a completely different error though

sullen flower Nov 21, 2023, 9:45 PM

#

i ran mine in wsl because of triton

glossy plover Nov 21, 2023, 9:46 PM

#

tiny cradle run your commands from the root of the project

that's what I did, I'll try in WSL I guess

tiny cradle Nov 21, 2023, 9:48 PM

#

glossy plover that's what I did, I'll try in WSL I guess

you could try PYTHONPATH=. streamlit run scripts/demo/video_sampling.py

#

not sure if thats how env vars work on windows. apologies if its wrong 🙂

quartz crescent Nov 21, 2023, 9:50 PM

#

glossy plover ok, I bypassed the thing that wanted Triton. now I'm getting: `C:\Users\joker\On...

you need to install the generative-models module

#

pip install .

glossy plover Nov 21, 2023, 9:51 PM

#

quartz crescent you need to install the `generative-models` module

yeah, I already did that

quartz crescent Nov 21, 2023, 9:52 PM

#

oh weird, am on win & that fixed it for me

tiny cradle Nov 21, 2023, 9:54 PM

#

i can only get 8 frames on a 4090 but I see some opportunities for better memory usage

glossy plover Nov 21, 2023, 9:59 PM

#

yeah, still getting ModuleNotFoundError: No module named 'scripts'

#

that's after the UI began loading itself in browser though

grim tangle Nov 21, 2023, 10:00 PM

#

SD image to SVD, with 3x interpolation in post

glossy plover Nov 21, 2023, 10:01 PM

#

glossy plover that's after the UI began loading itself in browser though

it already got to a point where it automatically opened my browser, but it still spits that error

half hound Nov 21, 2023, 10:04 PM

#

glossy plover it already got to a point where it automatically opened my browser, but it still...

just made it to the same step as you

quartz viper Nov 21, 2023, 10:06 PM

#

Congrats to Stability team for releasing a video model! GO JOE!

iron topaz Nov 21, 2023, 10:06 PM

#

new we just need to foce gpu manufactuers to solve the vram proplem

quartz viper Nov 21, 2023, 10:06 PM

#

also, maybe we can not call it SVD as that is kind of homophonic with another initialism

quartz viper Nov 21, 2023, 10:08 PM

#

iron topaz new we just need to foce gpu manufactuers to solve the vram proplem

software drives demands. While this might seem like it, gaming cards are made for games. So convince developers to make games that need 40gb cards :X

iron topaz Nov 21, 2023, 10:08 PM

#

quartz viper software drives demands. While this might seem like it, gaming cards are made f...

ai cards are to expensive for consumers

#

i have hered you can mod gpus to dubble the vram by switching the ram chipp

quartz viper Nov 21, 2023, 10:09 PM

#

thats right. but we use gaming cards to do it because gaming cards are priced for consumers. so we need to encourage developers to make games even more unoptimized and demanding on vram

#

start demanding 4k specular and normal maps

grim tangle Nov 21, 2023, 10:09 PM

#

generated with SVD, interpolated 3x

iron topaz Nov 21, 2023, 10:09 PM

#

ram is so inexpencive in comparison its just sad that thats the limiting factor

iron topaz Nov 21, 2023, 10:09 PM

#

quartz viper thats right. but we use gaming cards to do it because gaming cards are priced f...

yes

quartz viper Nov 21, 2023, 10:09 PM

#

iron topaz i have hered you can mod gpus to dubble the vram by switching the ram chipp

you can but then you have to also patch the bios. it generally is only done on models where there's an 8gb and a 16gb varient

#

since the bios already has 16gb code

iron topaz Nov 21, 2023, 10:10 PM

#

they shuld use uncompressed 8k textures for everything

quartz viper Nov 21, 2023, 10:10 PM

#

iron topaz they shuld use uncompressed 8k textures for everything

you're getting it!

#

i don't think we'll get LLMs running locally in our games. Games that use those will tie into openai's api or whatever exists since that's busy self immolating right now

iron topaz Nov 21, 2023, 10:11 PM

#

another option are games that will use ai feutres so nvida needs to add more vram for that

quartz viper Nov 21, 2023, 10:11 PM

#

iron topaz another option are games that will use ai feutres so nvida needs to add more vra...

yeah. i think that stuff is going to be server side 😦

iron topaz Nov 21, 2023, 10:11 PM

#

quartz viper yeah. i think that stuff is going to be server side 😦

naw that would be hell

#

but you are probably right 😦

quartz viper Nov 21, 2023, 10:11 PM

#

Microsoft just bought activision. do you think things will get betteR?

iron topaz Nov 21, 2023, 10:12 PM

#

i hope the sever costs are so extreme and the user so unpredictibil that it iwll not work

quartz viper Nov 21, 2023, 10:12 PM

#

You think GTA6 trailer is going to show off client side ai? lol

glossy plover Nov 21, 2023, 10:12 PM

#

gaming cards were 4-24gb for about a decade now. at this point it probably won't cost NVIDIA more to bump that up to higher VRAMs

iron topaz Nov 21, 2023, 10:13 PM

#

server costs are not free. and game companys that run llm 24/7 will notce that soon if they try to do that

quartz viper Nov 21, 2023, 10:13 PM

#

glossy plover gaming cards were 4-24gb for about a decade now. at this point it probably won't...

true, but nvidia likes money and there's no gaming demand for more than 24gb. games can't fill the 4090 or 3090 up and won't for a few years. especially as software optimizations like nanite and lumen are brought in

iron topaz Nov 21, 2023, 10:13 PM

#

even open ai seems to make a loss

quartz viper Nov 21, 2023, 10:13 PM

#

iron topaz server costs are not free. and game companys that run llm 24/7 will notce that s...

yeah but server costs are also a huge piracy stopper

lean gate Nov 21, 2023, 10:14 PM

#

quartz viper true, but nvidia likes money and there's no gaming demand for more than 24gb. g...

well even 3090/4090 struggled with cyberpunk before the last update/dlc

quartz viper Nov 21, 2023, 10:14 PM

#

iron topaz even open ai seems to make a loss

think open ai is a non profit that mandates their for profit arm to make as little money as they can

#

their server costs are basically donated by microsoft

iron topaz Nov 21, 2023, 10:15 PM

#

quartz viper yeah but server costs are also a huge piracy stopper

you are not understanding how high llm server costs are. open ai makes a loss with a 20/amothn subsciption

quartz viper Nov 21, 2023, 10:15 PM

#

lean gate well even 3090/4090 struggled with cyberpunk before the last update/dlc

you sure? cyberpunk has never filled more than 12gb of my vram

quartz viper Nov 21, 2023, 10:15 PM

#

iron topaz you are not understanding how high llm server costs are. open ai makes a loss wi...

you're not understanding tha microsoft donates azure compute to them

#

Microsoft, the games publisher, who has legions of servers

iron topaz Nov 21, 2023, 10:16 PM

#

quartz viper you're not understanding tha microsoft donates azure compute to them

yes but that shows that its not making profit. so toher comapys will not realy be able to do it unless they want to pay for gamers server credits

glossy plover Nov 21, 2023, 10:16 PM

#

also NVIDIA keep acknowledging the existence of the many users of their GPUs to run AI locally. they literally made the last few drivers specifically for Stable Diffusion. there could be a good chance that they'll bump up VRAM in the future

quartz viper Nov 21, 2023, 10:16 PM

#

i think corporate strategy is very much going to try to price home local AI out and consolidate it all locally. Thats why we need stability. They're the only major players releasing this stuff

iron topaz Nov 21, 2023, 10:16 PM

#

glossy plover also NVIDIA keep acknowledging the existence of the many users of their GPUs to ...

good copium

quartz viper Nov 21, 2023, 10:17 PM

#

Runway ML sure didn't take up the torch. Stability got an open model released before they ever put gen2 out

iron topaz Nov 21, 2023, 10:17 PM

#

stabillety ai gpu when 👀

quartz viper Nov 21, 2023, 10:17 PM

#

glossy plover also NVIDIA keep acknowledging the existence of the many users of their GPUs to ...

its true. it's a good halo feature to get people buying their brand over amd. AMD has sort of relied on pushing vram higher and higher over the years, as a way to stay brand relevant

#

another compnay making GPUs would still have to rely on the same silicon forges that all the processor makers do

#

microsoft making a processor design now even. everyone's getting into it

#

i'm hoping intel's new ML focused instruction sets will bring the kind of speed GPU's benefit with, to the CPU. Thhen we just need to stick a new dimm into an open bank

#

Tried and true scaling form

jade oar Nov 21, 2023, 10:23 PM

#

what the hell those results all look insane

half hound Nov 21, 2023, 10:24 PM

#

what directory are you supposed to put the safetensor files?

jade oar Nov 21, 2023, 10:27 PM

#

the temporal coherence here is much much better just looking at it, no?

lean gate Nov 21, 2023, 10:28 PM

#

Useful info: https://x.com/timudk/status/1727064128223855087?s=46&t=0WU4EHlrcScluYAy-xmVqQ

jade oar Nov 21, 2023, 10:28 PM

#

jade oar the temporal coherence here is much much better just looking at it, no?

a lot less trippy-looking glitchy visuals

timid storm Nov 21, 2023, 10:28 PM

#

lean gate Useful info: https://x.com/timudk/status/1727064128223855087?s=46&t=0WU4EHlrcScl...

tim has used this model far more than anyone else on earth

lean gate Nov 21, 2023, 10:31 PM

#

iron topaz Nov 21, 2023, 10:38 PM

#

when i inslal it do i need ti install the pt2 requerments or pt13 requermetns or something diffrent ?

tiny cradle Nov 21, 2023, 10:43 PM

#

grim tangle generated with SVD, interpolated 3x

those look really nice. how many frames are you getting? what are you using for interpolation?

grim tangle Nov 21, 2023, 10:43 PM

#

tiny cradle those look really nice. how many frames are you getting? what are you using for ...

default settings mostly, 14 frames and used topaz for this one

chilly nymph Nov 21, 2023, 10:44 PM

#

Is there a stable diffusion discord bot that I can add to my own server?

tiny cradle Nov 21, 2023, 10:44 PM

#

grim tangle default settings mostly, 14 frames and used topaz for this one

torch 2 or 1? I wonder why i can only get 8 frames on the 4090 with lowvram on.

grim tangle Nov 21, 2023, 10:45 PM

#

tiny cradle torch 2 or 1? I wonder why i can only get 8 frames on the 4090 with lowvram on.

2, didn't try using the streamlit

tiny cradle Nov 21, 2023, 10:45 PM

#

ah maybe that's it then - thanks

grim tangle Nov 21, 2023, 10:45 PM

#

https://cdn.discordapp.com/attachments/1138865343314530324/1176650724021633095/000025_apo8.mp4?ex=656fa461&is=655d2f61&hm=dd8d575d524b3abd80f36a81d66cd203fcd77ac7f13375a115569ce1dfe6290a&

▶ Play video

#

most impressive one to me so far

quartz viper Nov 21, 2023, 10:46 PM

#

https://tenor.com/view/muybridge-gif-25225141

Tenor

grim tangle Nov 21, 2023, 10:47 PM

#

tiny cradle ah maybe that's it then - thanks

I also reduced the decoding_t... using only 2 now

quartz viper Nov 21, 2023, 10:47 PM

#

what a time to be alive!

tiny cradle Nov 21, 2023, 10:47 PM

#

grim tangle I also reduced the decoding_t... using only 2 now

yeah that was the setting I was missing

iron topaz Nov 21, 2023, 10:47 PM

#

grim tangle https://cdn.discordapp.com/attachments/1138865343314530324/1176650724021633095/0...

how did u install it? i jsut need a overview idea

#

just clone the github and create a checkpoints folder and put them into it?

grim tangle Nov 21, 2023, 10:48 PM

#

iron topaz how did u install it? i jsut need a overview idea

mostly just followed the instructions here: https://github.com/Stability-AI/generative-models
moved the default sample script to the root of the project and put the model from hugginface to checkpoints -folder

iron topaz Nov 21, 2023, 10:48 PM

#

grim tangle mostly just followed the instructions here: https://github.com/Stability-AI/gene...

they are not realy detailed

half hound Nov 21, 2023, 10:49 PM

#

@iron topaz imagiAlrgy-Bryce posted this earlier.
setup (Python 3.10, 4090):
pip install -r requirements/pt2.txt
pip install .
-modify streamlit_helpers.py lowvram_mode = True
streamlit run scripts/demo/video_sampling.py

#

I set up a virtual environment first though

#

you also need to do pip install steamlit

grim tangle Nov 21, 2023, 10:49 PM

#

that's pretty much it yeah

half hound Nov 21, 2023, 10:49 PM

#

but I ran into an issue

iron topaz Nov 21, 2023, 10:49 PM

#

ok thanks

half hound Nov 21, 2023, 10:49 PM

#

still trying to figure it out

#

tiny cradle Nov 21, 2023, 10:50 PM

#

I've updated instructions. #▶｜stable-video-diffusion message

iron topaz Nov 21, 2023, 10:50 PM

#

nice i installed the wrong requerments file but i guess thats what conda is for xdD

half hound Nov 21, 2023, 10:50 PM

#

tiny cradle I've updated instructions. https://discord.com/channels/1002292111942635562/1176...

This should get pinned

tiny cradle Nov 21, 2023, 10:50 PM

#

half hound

not sure what causes that

half hound Nov 21, 2023, 10:51 PM

#

I am guessing because I can't figure out where to put the safetensors files lol

#

that might be the issue

iron topaz Nov 21, 2023, 10:51 PM

#

half hound I am guessing because I can't figure out where to put the safetensors files lol

i would jsut create a folder and called checkpoints in the main folder

#

if you look into the skript you see that it looks for it ther. but idk

half hound Nov 21, 2023, 10:52 PM

#

i'll try that

iron topaz Nov 21, 2023, 10:54 PM

#

why does everything happen when i am supost to sleep x-x

grim tangle Nov 21, 2023, 10:55 PM

#

iron topaz i would jsut create a folder and called checkpoints in the main folder

this is correct

grim tangle Nov 21, 2023, 10:56 PM

#

iron topaz why does everything happen when i am supost to sleep x-x

yeah just looked at the time, last 2 hours just gone 😄

iron topaz Nov 21, 2023, 10:56 PM

#

hahahah

timid storm Nov 21, 2023, 10:56 PM

#

the model is rly addictive to use

iron topaz Nov 21, 2023, 10:57 PM

#

i didint even use it jet x.x

half hound Nov 21, 2023, 10:57 PM

#

iron topaz i didint even use it jet x.x

same 😢

#

I feel like I am so close to getting it working lol

iron topaz Nov 21, 2023, 10:58 PM

#

just isntalling a ton of dependecies.

half hound Nov 21, 2023, 10:58 PM

#

are you on windows?

iron topaz Nov 21, 2023, 10:58 PM

#

no

#

kubuntu

half hound Nov 21, 2023, 10:58 PM

#

ah ok

iron topaz Nov 21, 2023, 10:59 PM

#

but my 3090 is halve broken let hope it works. becaseu a evil person sold it to me. in most tasks it works but at some intense ai stuff it blacksceens my pc

half hound Nov 21, 2023, 11:00 PM

#

darn

iron topaz Nov 21, 2023, 11:00 PM

#

i can play cyperpunk very well but animate diff for more then 20frames is to mutch

half hound Nov 21, 2023, 11:06 PM

#

https://tenor.com/view/dance-party-lets-gif-25999992

Tenor

#

finally got it

grim tangle Nov 21, 2023, 11:06 PM

#

as 3D modeler, this is kinda crazy to me

iron topaz Nov 21, 2023, 11:06 PM

#

cool

half hound Nov 21, 2023, 11:06 PM

#

so I ran into 3 issues

#

idk if you will run into them

#

but this is what I had to fix

#

in video_sampling.py I changed it to streamlit_helpers import *

iron topaz Nov 21, 2023, 11:09 PM

#

ok

tiny cradle Nov 21, 2023, 11:09 PM

#

iron topaz ok

what version of python are you using?

half hound Nov 21, 2023, 11:09 PM

#

3.10

iron topaz Nov 21, 2023, 11:10 PM

#

tiny cradle what version of python are you using?

same but i am still downloading the wights

tiny cradle Nov 21, 2023, 11:10 PM

#

and you're running from the root of the project right?

half hound Nov 21, 2023, 11:11 PM

#

had to do pip install torchvision and pip install opencv-python as well

grim tangle Nov 21, 2023, 11:21 PM

#

on windows I just copied the scripts from the demo folder to the root and ran from there

pastel storm Nov 21, 2023, 11:22 PM

#

Has anyone gotten this to run locally yet?

grim tangle Nov 21, 2023, 11:22 PM

#

pastel storm Has anyone gotten this to run locally yet?

yeah, it does run on 4090 at least... dips into system ram so it's slow, but it runs

pastel storm Nov 21, 2023, 11:23 PM

#

I’m guessing there will be optimizations in the future tho? I have a 3060

turbid rampart Nov 21, 2023, 11:23 PM

#

grim tangle yeah, it does run on 4090 at least... dips into system ram so it's slow, but it ...

someone mentioned they were able to generate 8 frames on a 4090, that's insane

nimble cape Nov 21, 2023, 11:24 PM

#

Hey! First time poster. I saw you guys just launched your video model. Super big step... congrats. I am intersted in working with it.... I am going to need a new system though. Other then the 4090 is there a card that runs the model well?

grim tangle Nov 21, 2023, 11:24 PM

#

turbid rampart someone mentioned they were able to generate 8 frames on a 4090, that's insane

I'm doing 14 at the default res on 4090, it's just slow

tiny cradle Nov 21, 2023, 11:24 PM

#

nimble cape Hey! First time poster. I saw you guys just launched your video model. Super big...

A $6000 A100 🙂 (non-official answer)

earnest tartan Nov 21, 2023, 11:25 PM

#

When this has matured, will the VRAM requirement be lower or greater?

turbid rampart Nov 21, 2023, 11:25 PM

#

grim tangle I'm doing 14 at the default res on 4090, it's just slow

i got my pc when all of this took off and i thought i would future proof myself with an rtx 3070. welp, look where we are now 💀

iron topaz Nov 21, 2023, 11:25 PM

#

i get this message 🤔

nimble cape Nov 21, 2023, 11:26 PM

#

tiny cradle A $6000 A100 🙂 (non-official answer)

What type of system would run this card well?

tiny cradle Nov 21, 2023, 11:26 PM

#

nimble cape What type of system would run this card well?

no idea. i've never run one locally. probably rent a cloud one first

iron topaz Nov 21, 2023, 11:26 PM

#

nimble cape What type of system would run this card well?

i think online renting would be smarter

grim tangle Nov 21, 2023, 11:27 PM

#

iron topaz i get this message 🤔

I got rid of that by just copying the scrips from the demo folder to the root, and running from there, dunno if there's more elegant solution

tiny cradle Nov 21, 2023, 11:27 PM

#

iron topaz i get this message 🤔

show the command you're running to get this

half hound Nov 21, 2023, 11:28 PM

#

pip install einops, pip install imwatermark, pip install invisible-watermark, pip install omegaconf ran all these other dependencies, thought it was weird I had to install so many, then it circled me back to the script issue again. I am going to make an issue post on the repo. I did get it to the main page, but as soon as I try and select a model it gives me an error

tiny cradle Nov 21, 2023, 11:28 PM

#

half hound pip install einops, pip install imwatermark, pip install invisible-watermark, pi...

sounds like you didn't pip install the requirements file

iron topaz Nov 21, 2023, 11:28 PM

#

witch one is needed the 2 or the 13?

half hound Nov 21, 2023, 11:28 PM

#

I ran the pt2.txt

#

yeah I guess I should have done 13 as well

#

i'll try that lol

tiny cradle Nov 21, 2023, 11:29 PM

#

iron topaz witch one is needed the 2 or the 13?

pt2 worked for me and i haven't tried 13

half hound Nov 21, 2023, 11:29 PM

#

which model did you use?

tiny cradle Nov 21, 2023, 11:29 PM

#

maybe it didn't succeed? or maybe windows is just hard. SVD

half hound Nov 21, 2023, 11:31 PM

#

got this error when trying to pip install pt13:
ERROR: Ignored the following versions that require a different python version: 1.6.2 Requires-Python >=3.7,<3.10; 1.6.3 Requires-Python >=3.7,<3.10; 1.7.0 Requires-Python >=3.7,<3.10; 1.7.1 Requires-Python >=3.7,<3.10
ERROR: Could not find a version that satisfies the requirement triton==2.0.0.post1 (from versions: none)
ERROR: No matching distribution found for triton==2.0.0.post1

tiny cradle Nov 21, 2023, 11:32 PM

#

yeah probably comment out triton - might be just for linux. not sure

west carbon Nov 21, 2023, 11:33 PM

#

I am on WSL2 and I am getting the same issue: No module named 'scripts'

#

tried different python versions, conda environment, venv, etc...

tiny cradle Nov 21, 2023, 11:33 PM

#

west carbon I am on WSL2 and I am getting the same issue: No module named 'scripts'

show the exact command you are running

iron topaz Nov 21, 2023, 11:34 PM

#

ok i solved it by moving the skript to the main directory. strage

west carbon Nov 21, 2023, 11:35 PM

#

#

this is after loading the page in the browser.. similar error in there

iron topaz Nov 21, 2023, 11:36 PM

#

west carbon

move the skript to the miain directory then it works

#

and start it with this comand "streamlit run video_sampling.py
"

tiny cradle Nov 21, 2023, 11:37 PM

#

west carbon

interesting. moving the script apparently works. wonder why I didn't have to do that

west carbon Nov 21, 2023, 11:37 PM

#

oh, I will try, thanks

tiny cradle Nov 21, 2023, 11:37 PM

#

you might try PYTHONPATH=. streamlit run scripts/demo/video_sampling.py

iron topaz Nov 21, 2023, 11:38 PM

#

tiny cradle interesting. moving the script apparently works. wonder why I didn't have to do...

gpt4 said it shuld work ahahha

#

what options do u use?

tiny cradle Nov 21, 2023, 11:39 PM

#

just svd should be fine

iron topaz Nov 21, 2023, 11:39 PM

#

why is ther no way to imput a prompt?

tiny cradle Nov 21, 2023, 11:39 PM

#

yeah thats not a feature of this

iron topaz Nov 21, 2023, 11:40 PM

#

image to video?

west carbon Nov 21, 2023, 11:40 PM

#

iron topaz move the skript to the miain directory then it works

I confirm this works for me as well, thanks man

iron topaz Nov 21, 2023, 11:41 PM

#

tiny cradle yeah thats not a feature of this

?

tiny cradle Nov 21, 2023, 11:41 PM

#

you input an image and get out a video. no prompt

iron topaz Nov 21, 2023, 11:42 PM

#

tiny cradle you input an image and get out a video. no prompt

ok it isdoing something

#

hels hope that it does not crash

#

i am at about 20gb vram

tiny cradle Nov 21, 2023, 11:43 PM

#

looks like it uses 15 gb during generation

iron topaz Nov 21, 2023, 11:43 PM

#

at what res are u genrating

tiny cradle Nov 21, 2023, 11:44 PM

#

and decoding i'm not sure but you can set it to decode 1 image at a time

#

1024x576

iron topaz Nov 21, 2023, 11:44 PM

#

it used 20 while genrating and tehn it crashed becseu out of mamorey.

#

how can i activate system memory fallback on linux?

tiny cradle Nov 21, 2023, 11:45 PM

#

make sure you did the low memory changes:
#▶｜stable-video-diffusion message

iron topaz Nov 21, 2023, 11:47 PM

#

tiny cradle **Setup Instructions** (Python 3.10, 4090, working on Linux): - git clone the re...

daim i wish i saw this erlyer nice gude

#

nice it worked thanks for all the help

grim tangle Nov 21, 2023, 11:53 PM

#

decoding 1 at the time actually makes this run fast on 4090, duh...

#

should've just done that from the start 😛

turbid rampart Nov 21, 2023, 11:54 PM

#

iron topaz

here is a 30fps version

#

i hope you don't mind me trying it out with your video

iron topaz Nov 21, 2023, 11:55 PM

#

dont google cat ! xD

turbid rampart Nov 21, 2023, 11:56 PM

#

iron topaz dont google cat ! xD

i get images like this, what are you getting? xD

iron topaz Nov 21, 2023, 11:56 PM

#

turbid rampart here is a 30fps version

how did u make the movment smoth?

iron topaz Nov 21, 2023, 11:57 PM

#

turbid rampart i get images like this, what are you getting? xD

images like this xD

turbid rampart Nov 21, 2023, 11:57 PM

#

iron topaz how did u make the movment smoth?

i'm using flowframes :) in a minute i will upload an interpolated video of the interpolated video to see if it can smoothen the inbetween frames

#

instead of it being partly choppy

iron topaz Nov 21, 2023, 11:58 PM

#

cool

turbid rampart Nov 22, 2023, 12:01 AM

#

#

here it is as a looping gif i think

half hound Nov 22, 2023, 12:01 AM

#

Thanks @tiny cradle @iron topaz for the help. Looks like it's finally generating. Crossing fingers this time.

iron topaz Nov 22, 2023, 12:02 AM

#

half hound Thanks <@613395195031191585> <@354223946414948353> for the help. Looks like it's...

nice did u use the low vram setting?

patent escarp Nov 22, 2023, 12:03 AM

#

stuck on ModuleNotFoundError: No module named 'imwatermark' unfortunately even though its installed

half hound Nov 22, 2023, 12:04 AM

#

Moving video_sampling.py to main dir worked. I ran into some other errors. Said I had python version 3.10.13 and said I should use v 3.10.11 so I created a new venv and then I also had to do pip install xformers and I had to take out triton==2.0.0 from the pt2.txt file to get it to work.

half hound Nov 22, 2023, 12:04 AM

#

iron topaz nice did u use the low vram setting?

yep

half hound Nov 22, 2023, 12:04 AM

#

patent escarp stuck on `ModuleNotFoundError: No module named 'imwatermark'` unfortunately even...

yeah I ran into that too

patent escarp Nov 22, 2023, 12:05 AM

#

i commented out triton since i'm on a mac and i moved the video_sampling.py to the root dir. still no go though. your vids look good tho! 😄

half hound Nov 22, 2023, 12:06 AM

#

#▶｜stable-video-diffusion message follow ImaginAlry-Bryce instuctions

#

isn't triton for unix systems?

#

I am on windows

patent escarp Nov 22, 2023, 12:09 AM

#

some forums say its for linux, not sure if its for mac though. i wasnt able to pip install it even with python 3.10. i'll keep pluggin away.

iron topaz Nov 22, 2023, 12:10 AM

#

i dont think it will work for mac

#

but i am sure that it will work in future

patent escarp Nov 22, 2023, 12:14 AM

#

yeah, you are probably right. i'll just sit tight for awhile and work on some other projects : P

iron topaz Nov 22, 2023, 12:30 AM

#

patent escarp yeah, you are probably right. i'll just sit tight for awhile and work on some ot...

u can do this https://www.youtube.com/watch?v=2Tv5ZfPabGM

YouTube

AI Readme

How to Setup LLaVA Locally Using llama.cpp - Apple Silicon Supported

Follow along and set up LLaVA: Large Language and Vision Assistant on your Silicon Mac and any other llama.cpp supported platforms. The performance of 4bit quantized 7B model is amazing and this can be your local ChatGPT Vision alternative and keep your data private.

Timestamps:
00:00 - Introduction
00:59 - Installation & building LLaVA
02:18 -...

▶ Play video

#

it can see images

glossy plover Nov 22, 2023, 2:31 AM

#

I've managed to fix stuff, now getting FileNotFoundError: [Errno 2] No such file or directory: 'outputs/demo/vid/svd\\samples\\000003_h264.mp4'

glossy plover Nov 22, 2023, 2:33 AM

#

iron topaz u can do this https://www.youtube.com/watch?v=2Tv5ZfPabGM

yeah, LLaVa was said to even be a possible future text/image encoder for SD3

gleaming tide Nov 22, 2023, 2:40 AM

#

so if you don't have a 3090, best to just wait for the website launch?

oak swift Nov 22, 2023, 2:45 AM

#

probably basic question, when I git clone git@github.com:Stability-AI/generative-models.git it says "Cloning into 'generative-models'...
git@github.com: Permission denied (publickey).
fatal: Could not read from remote repository.

Please make sure you have the correct access rights
and the repository exists."

quartz crescent Nov 22, 2023, 2:46 AM

#

grim tangle decoding 1 at the time actually makes this run fast on 4090, duh...

haha yeah, just barely able to squeeze it all into 24g vram with fp16 and decode_t=1

~2min for 25 frames with xt on my 3090

#

very cool

oak swift Nov 22, 2023, 2:47 AM

#

can you get the dogs to actually move and not just camera possibly, like if you says dogs running?

#

looks good though.

nova sparrow Nov 22, 2023, 2:56 AM

#

glossy plover it already got to a point where it automatically opened my browser, but it still...

did you ever figure this out 👉👈

quartz crescent Nov 22, 2023, 2:58 AM

#

oak swift can you get the dogs to actually move and not just camera possibly, like if you ...

no you cant prompt it unfortunately

west carbon Nov 22, 2023, 3:00 AM

#

you can try changing the seed

oak swift Nov 22, 2023, 3:09 AM

#

quartz crescent no you cant prompt it unfortunately

Do mean can't prompt specific motion like animatediff, but general prompts work?

glossy plover Nov 22, 2023, 3:10 AM

#

nova sparrow did you ever figure this out 👉👈

Yes, but it created more problems I wasn't able to solve.

nova sparrow Nov 22, 2023, 3:10 AM

#

me

glossy plover Nov 22, 2023, 3:10 AM

#

glossy plover I've managed to fix stuff, now getting `FileNotFoundError: [Errno 2] No such fil...

This

quartz crescent Nov 22, 2023, 3:29 AM

#

oak swift Do mean can't prompt specific motion like animatediff, but general prompts work?

you cant prompt at all(?) (in the current release. base model supports txt2vid/txt2img2vid according to the paper)

from hf page:

Limitations
The generated videos are rather short (<= 4sec), and the model does not achieve perfect photorealism.
The model may generate videos without motion, or very slow camera pans.
The model cannot be controlled through text.
The model cannot render legible text.
Faces and people in general may not be generated properly.
The autoencoding part of the model is lossy.

west carbon Nov 22, 2023, 3:30 AM

#

here is a character moving... it's funny because it seems like the character is trying out some camera lenses that distort him, and he is aware of that... haha

oak swift Nov 22, 2023, 3:30 AM

#

ohh, yeah my bad. I forgot txt to vid wasn't released yet.

quartz crescent Nov 22, 2023, 3:48 AM

#

west carbon here is a character moving... it's funny because it seems like the character is ...

i get a bunch of motion distortions at the default 6fps but barely any at 12+

west carbon Nov 22, 2023, 3:50 AM

#

good to know, thanks!. I would love more explanation about the rest of parameters, like for example: s_churn #1, etc...

quartz crescent Nov 22, 2023, 3:51 AM

#

6/12/24 same seed

west carbon Nov 22, 2023, 3:55 AM

#

have you tried to loopback it? take the last frame and feed it again at 24 fps using the same seed

quartz crescent Nov 22, 2023, 4:07 AM

#

west carbon good to know, thanks!. I would love more explanation about the rest of parameter...

yeah theres a lot to mess around with for sure

quartz crescent Nov 22, 2023, 4:07 AM

#

west carbon have you tried to loopback it? take the last frame and feed it again at 24 fps u...

im trying this rn kek

west carbon Nov 22, 2023, 4:11 AM

#

haha, me too

quartz crescent Nov 22, 2023, 4:20 AM

#

ooooh yeah the autoencoder is lossy indeed agony

gleaming tide Nov 22, 2023, 4:28 AM

#

#

from a dalle3 still image

quartz crescent Nov 22, 2023, 4:31 AM

#

quartz crescent ooooh yeah the autoencoder is lossy indeed <:agony:1002961183105634415>

timid storm Nov 22, 2023, 5:05 AM

#

strange pawn Nov 22, 2023, 5:06 AM

#

anyone got advice for running in windows with 4090 - i have to restart the streamlit app after every attempt; seams like a gpu mem leak. Yes i'm using "Decode t frames at a time (set small if you are low on VRAM) = 1"

rustic hinge Nov 22, 2023, 5:13 AM

#

does anyone have a published script to run this yet?

#

it looks like stable diffusion didn't publish a script to use it

gleaming tide Nov 22, 2023, 5:25 AM

#

#

doesn't work so great on 2d art but I'm fascinated by what it came up with here

quartz crescent Nov 22, 2023, 5:38 AM

#

strange pawn anyone got advice for running in windows with 4090 - i have to restart the strea...

there's no leak (unless you try to switch to the other model 99pepefakelaugh ), just hit sample again & it'll go without reallocating

torpid falcon Nov 22, 2023, 5:42 AM

#

Is there a way to run this in a Colab? Would it be hard for me to figure out how to create one? I have so many questions and so little vram 😭😭😭

tiny cradle Nov 22, 2023, 5:48 AM

#

A lot of my generations are just doing simple translations of the image. Annoying. Any tricks to prevent that?

echo ether Nov 22, 2023, 6:06 AM

#

gleaming tide

Dang looks pretty rough, that was my main interest

#

Maybe there will be a finetune or something

gleaming tide Nov 22, 2023, 6:08 AM

#

for 2D it most often does camera pans and doesn't really animate them at all

obtuse bridge Nov 22, 2023, 6:10 AM

#

haha, just ported the VAE to AnimateDiff. It works really well for it.

#

It sure loves eating VRAM though.

tiny cradle Nov 22, 2023, 6:12 AM

#

obtuse bridge haha, just ported the VAE to AnimateDiff. It works really well for it.

is this a comparison?

obtuse bridge Nov 22, 2023, 6:12 AM

#

tiny cradle is this a comparison?

Yes, first is the default decoder, second is the temporal decoder with timesteps=4 (which I am assuming is just the amount of cross-frame attention applied)

tiny cradle Nov 22, 2023, 6:18 AM

#

so the new vae is less noisy. nice

raw coral Nov 22, 2023, 6:38 AM

#

A small question, what's the difference between svd and svd_image_decoder?

shell plume Nov 22, 2023, 6:49 AM

#

Hello everyone

#

I created a Google Colab to test it, and it's working quite well with an A100. It downloads two models (svd and svd_xt) from Hugging Face. If it's useful to anyone, here's the link: https://bit.ly/stable-difussion-video

I'm also posting some results. Does anyone know which settings to adjust to make the movement smoother?

gleaming tide Nov 22, 2023, 6:50 AM

#

your link is not public

sinful vine Nov 22, 2023, 6:50 AM

#

peak shore Nov 22, 2023, 6:51 AM

#

gleaming tide

nice

shell plume Nov 22, 2023, 6:52 AM

#

gleaming tide your link is not public

u right, open now

#

unborn acorn Nov 22, 2023, 7:01 AM

#

Not very good quality with this type of images

shell plume Nov 22, 2023, 7:01 AM

#

echo ether Nov 22, 2023, 7:10 AM

#

shell plume

It seems like theres some sort of a bias for looping vids or small movements...

obtuse bridge Nov 22, 2023, 7:23 AM

#

raw coral A small question, what's the difference between svd and svd_image_decoder?

svd uses the new temporal VAE, svd_image_decoder uses the normal vae that SD1.5/2.1 uses. The new one should generate less noisy outputs

bold prawn Nov 22, 2023, 7:30 AM

#

shell plume Nov 22, 2023, 7:32 AM

#

bold prawn

is xt? did you change the default settings?

bold prawn Nov 22, 2023, 7:34 AM

#

shell plume is xt? did you change the default settings?

nope, only use The official website to generated(127motion) for free and then topaz it

unborn acorn Nov 22, 2023, 7:37 AM

#

20steps with EulerA. Faster generation, a bit less movement.

#

Same with 30steps.

shell plume Nov 22, 2023, 7:39 AM

#

fair otter Nov 22, 2023, 7:53 AM

#

Hi everyone. I’m trying to get this running on either a1111 or comfy and maybe I’m just tired but I feel like I’m missing something any help is appreciated.

tiny cradle Nov 22, 2023, 7:54 AM

#

fair otter Hi everyone. I’m trying to get this running on either a1111 or comfy and maybe I...

Unless you're doing to the integration yourself I doubt those support this yet.

fair otter Nov 22, 2023, 8:00 AM

#

tiny cradle Unless you're doing to the integration yourself I doubt those support this yet.

Ah well. I was hopeful 😅 not entirely sure how to do the GitHub pull to test this out as is. Thanks for the snappy response though.

pastel storm Nov 22, 2023, 8:05 AM

#

tiny cradle Unless you're doing to the integration yourself I doubt those support this yet.

I’m guessing it’s still unoptimized and would need a 4090 minimum?

tiny cradle Nov 22, 2023, 8:06 AM

#

pastel storm I’m guessing it’s still unoptimized and would need a 4090 minimum?

someone was running it on a 3090. I believe it uses 15 gb when run in lowvram mode

pastel storm Nov 22, 2023, 8:07 AM

#

So close to 12gb! I have a 3060 so I anticipate further optimizations

mental magnet Nov 22, 2023, 8:07 AM

#

pastel storm Has anyone gotten this to run locally yet?

i'm trying run on A100

tiny cradle Nov 22, 2023, 8:07 AM

#

number of frames increases memory usage so you might be able to run it just generating fewer frames

pastel storm Nov 22, 2023, 8:08 AM

#

Interesting, do you think there will be a ui for it soon?

royal nebula Nov 22, 2023, 8:08 AM

#

obtuse bridge svd uses the new temporal VAE, svd_image_decoder uses the normal vae that SD1.5/...

how to use the new temporal VAE for AnimateDiff? waow

obtuse bridge Nov 22, 2023, 8:09 AM

#

royal nebula how to use the new temporal VAE for AnimateDiff?<:waow:1017853838516035725>

Depends on how much effort you want to put in 😛

#

Just wait for diffusers to implement it

#

I just extracted the decoder weights from the model and hacked together something horrible in a jupyter notebook

#

Hopefully diffusers implements some optimizations too because... well, the VAE is requiring 24GB to run on this machine.

#

Assuming I didn't break something which is entirely a possibility still, this desperately needs some form of tiling

tiny cradle Nov 22, 2023, 8:11 AM

#

it doesnt require that if you set the number of images to decode at atime to 1

obtuse bridge Nov 22, 2023, 8:12 AM

#

tiny cradle it doesnt require that if you set the number of images to decode at atime to 1

Is that just turning cross-frame attention off entirely, though?

tiny cradle Nov 22, 2023, 8:12 AM

#

no its not

#

i think...

obtuse bridge Nov 22, 2023, 8:13 AM

#

Also I must emphasize that I have completely gutted the model, I am just using the decoder. I'm guessing it's the timesteps argument?

tiny cradle Nov 22, 2023, 8:14 AM

#

decoding_t

#

model.en_and_decode_n_samples_a_time = decoding_t

obtuse bridge Nov 22, 2023, 8:14 AM

#

Yeah that's the timesteps argument I think

tiny cradle Nov 22, 2023, 8:14 AM

#

doubt it

#

that usually means something else

obtuse bridge Nov 22, 2023, 8:14 AM

#

Setting it to 1 reduced peak memory consumption to under 20gb

obtuse bridge Nov 22, 2023, 8:15 AM

#

tiny cradle that usually means something else

I know it does but that is what it is referred to as in the code /shrug

#

...no, memory usage is still at 20 when using timesteps=16?

#

I'm just going to have to run the profiler PAIN

tiny cradle Nov 22, 2023, 8:17 AM

#

obtuse bridge I know it does but that is what it is referred to as in the code /shrug

yeah I'm not sure but strong hunch timesteps isn't doing what you think

solemn turtle Nov 22, 2023, 8:17 AM

#

iron topaz but my 3090 is halve broken let hope it works. becaseu a evil person sold it to ...

U should always buy a gpu new. Used gpus are often really used

tiny cradle Nov 22, 2023, 8:18 AM

#

what class is video_encoder

obtuse bridge Nov 22, 2023, 8:18 AM

#

tiny cradle yeah I'm not sure but strong hunch timesteps isn't doing what you think

I remember that line of code though that you mentioned, it's just been getting late at this point and i don't recall off hand where exactly it is

obtuse bridge Nov 22, 2023, 8:19 AM

#

tiny cradle what class is video_encoder

sgm.modules.autoencoding.temporal_ae.VideoDecoder

#

inherits from decoder which I think has the actual important methods in it

tiny cradle Nov 22, 2023, 8:23 AM

#

hmm i'm tracing how the argument gets to the decoder

obtuse bridge Nov 22, 2023, 8:24 AM

#

tiny cradle it doesnt require that if you set the number of images to decode at atime to 1

actually where does the pipeline where you use this value start? probably pretty far from where i am

tiny cradle Nov 22, 2023, 8:26 AM

#

n_samples = default(self.en_and_decode_n_samples_a_time, z.shape[0])

#

                if isinstance(self.first_stage_model.decoder, VideoDecoder):
                    kwargs = {"timesteps": len(z[n * n_samples : (n + 1) * n_samples])}

#

so timesteps is related

#

you might be right

#

so I think you're missing this logic thats in DiffusionEngine

    @torch.no_grad()
    def decode_first_stage(self, z):
        z = 1.0 / self.scale_factor * z
        n_samples = default(self.en_and_decode_n_samples_a_time, z.shape[0])

        n_rounds = math.ceil(z.shape[0] / n_samples)
        all_out = []
        with platform_appropriate_autocast(
            enabled=not self.disable_first_stage_autocast
        ):
            for n in range(n_rounds):
                if isinstance(self.first_stage_model.decoder, VideoDecoder):
                    kwargs = {"timesteps": len(z[n * n_samples : (n + 1) * n_samples])}
                else:
                    kwargs = {}
                out = self.first_stage_model.decode(
                    z[n * n_samples : (n + 1) * n_samples], **kwargs
                )
                all_out.append(out)
        out = torch.cat(all_out, dim=0)
        return out

obtuse bridge Nov 22, 2023, 8:29 AM

#

tiny cradle so I think you're missing this logic thats in `DiffusionEngine` ``` @torch.n...

yeah... just found that. that is what originally led me to believe it was timesteps that was important, since it was the only place I saw it being set

tiny cradle Nov 22, 2023, 8:29 AM

#

but this function is running the decoder multiple times, to decode all the frames, and then compiling them together

obtuse bridge Nov 22, 2023, 8:31 AM

#

so basically just what vae slicing does in diffusers

tiny cradle Nov 22, 2023, 8:32 AM

#

yes kind of

obtuse bridge Nov 22, 2023, 8:32 AM

#

thanks for the help

unborn acorn Nov 22, 2023, 8:40 AM

#

pastel storm I’m guessing it’s still unoptimized and would need a 4090 minimum?

I use 3090. This was funny and fast generation.

mental magnet Nov 22, 2023, 8:50 AM

#

Hello,

I'm currently experiencing a technical issue with a Python script that involves downloading a pretrained model using the open_clip module from Hugging Face Hub. However, I'm facing a LocalEntryNotFoundError as the script is unable to access the necessary files from the Hugging Face Hub due to network connectivity issues. The specific model in question is CLIP-ViT-H-14-laion2B-s32B-b79K, and the file it's trying to download is open_clip_pytorch_model.bin.

#

Given this situation, I'm considering manually downloading the model file and placing it in an appropriate location on my local system. However, I'm uncertain about the correct directory where the open_clip module expects to find this file. The default cache directory for huggingface_hub seems to be ~/.cache/huggingface/transformers/, but I'm not sure if this is where I should place the downloaded file.

Could you please advise on the correct procedure for manually downloading and placing the pretrained model file, so that my script can access it without needing to download it from the internet?

Thank you for your assistance.

ripe field Nov 22, 2023, 8:52 AM

#

mental magnet Hello, I'm currently experiencing a technical issue with a Python script that i...

panzhong?

mental magnet Nov 22, 2023, 9:08 AM

#

?

mental magnet Nov 22, 2023, 9:12 AM

#

ripe field panzhong?

person name？

#

nope

pure compass Nov 22, 2023, 9:13 AM

#

bold prawn

decoherence - is it done through decohereance?

finite breach Nov 22, 2023, 9:15 AM

#

Woohoo, new here

pure compass Nov 22, 2023, 9:34 AM

#

grim tangle as 3D modeler, this is kinda crazy to me

damm

wispy valley Nov 22, 2023, 10:48 AM

#

how much VRAM is needed for svd_xt? I tried it on 3090 24GB and OOM was reported...

solar willow Nov 22, 2023, 11:45 AM

#

Hello all!

#

Does anybody know anything about motion buckets and their id's? For now i just spam some random int from 1-255 but maybe it means something 😄 😄

solar willow Nov 22, 2023, 11:54 AM

#

wispy valley how much VRAM is needed for svd_xt? I tried it on 3090 24GB and OOM was reported...

same 😦
torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 1.10 GiB. GPU 0 has a total capacty of 23.99 GiB of which 0 bytes is free. Of the allocated memory 18.91 GiB is allocated by PyTorch, and 3.34 GiB is reserved by PyTorch but unallocated.

unborn acorn Nov 22, 2023, 11:57 AM

#

wispy valley how much VRAM is needed for svd_xt? I tried it on 3090 24GB and OOM was reported...

Works just fine with 3090 with 24gb vram. You can set the resolution smaller to test like 512x512px. Takes less vramn and renders fasrer.

meager geode Nov 22, 2023, 12:04 PM

#

The generated resolution is 1024*576. Pictures that are not of this ratio will be automatically deformed and compressed to this ratio.🥹

onyx silo Nov 22, 2023, 12:20 PM

#

Look for decoding_t: int in the code and set it lower, defaults to 14 IIRC, can goes all the way down to 1, that should reduce VRAM reqirements.

#

Doesn't work on MPS though the stabilty-ai code assumes XFormers

nocturne magnetBOT Nov 22, 2023, 12:27 PM

#

FAQ: Where can I access the DreamStudio beta website?

https://beta.dreamstudio.ai

grim tangle Nov 22, 2023, 1:04 PM

#

comparing motion bucket id values, 1024x640, 25 frames with SVD-XT on 4090 (lowvram mode, decoding 1 at the time)

solar willow Nov 22, 2023, 1:06 PM

#

grim tangle comparing motion bucket id values, 1024x640, 25 frames with SVD-XT on 4090 (lowv...

wait what lowvram mode 🙂 i want SVD-XT on 4090 😄 😄

grim tangle Nov 22, 2023, 1:06 PM

#

solar willow wait what lowvram mode 🙂 i want SVD-XT on 4090 😄 😄

in the streamlit_helpers.py file:

solar willow Nov 22, 2023, 1:07 PM

#

grim tangle comparing motion bucket id values, 1024x640, 25 frames with SVD-XT on 4090 (lowv...

so basically the higher i put the number the more movement im expecting 🙂

grim tangle Nov 22, 2023, 1:07 PM

#

solar willow so basically the higher i put the number the more movement im expecting 🙂

yep, didn't think it would stay so consistent, need to test more, maybe lucky seed

solar willow Nov 22, 2023, 1:08 PM

#

Thank's DUDE! For the lowvram and for the comparison!!!!!

grim tangle Nov 22, 2023, 1:08 PM

#

yeah I didn't even dare to try XT at first... turns out it runs just fine...

#

1 min 22 seconds per gen

solar willow Nov 22, 2023, 1:12 PM

#

https://tenor.com/view/noice-nice-click-gif-8843762

Tenor

solar willow Nov 22, 2023, 1:23 PM

#

grim tangle in the streamlit_helpers.py file:

ok i did that but still got omme. Btw how are you running SVD-XT? Through simple_video_sample.py script? Is there some kind of UI for this?!

turbid sluice Nov 22, 2023, 1:24 PM

#

Interpolated afterwards. SVD_XT, fps 6, S_noise 1.08 (gives a bit sharper image), Decode_T is 1.

#

It lost it's face, but i think it's cool.

grim tangle Nov 22, 2023, 1:30 PM

#

solar willow ok i did that but still got omme. Btw how are you running SVD-XT? Through simple...

answering here too so others see: using the streamlit UI

shell plume Nov 22, 2023, 1:35 PM

#

Does anyone have any idea why, after the first second of animation, it hallucinates so much? I'm using a T 25 at 6 FPS, with the XT model?

#

Any idea?

last basin Nov 22, 2023, 1:37 PM

#

does this new model runs on a UI?

turbid sluice Nov 22, 2023, 1:40 PM

#

shell plume Does anyone have any idea why, after the first second of animation, it hallucina...

Try to raise s_noise to 1.05, 1.10 or 1.15... or something inbetween... with same seed from where you like the movement. I think that will give you sharper image. It will halucinate for sure, but I think your video is soft because of that. It is a trial and error until you get something really cool.

cosmic cobalt Nov 22, 2023, 1:51 PM

#

Note: 40GB of VRAM required -> LMAO

shell plume Nov 22, 2023, 1:53 PM

#

grim tangle answering here too so others see: using the streamlit UI

I made a Google Colab, is open to clone:
https://bit.ly/stable-difussion-video

Google Colaboratory

cosmic cobalt Nov 22, 2023, 1:53 PM

#

shell plume I made a Google Colab, is open to clone: https://bit.ly/stable-difussion-video

awsm thanks man

iron topaz Nov 22, 2023, 2:19 PM

#

last basin does this new model runs on a UI?

Yes

shell plume Nov 22, 2023, 2:19 PM

#

better

iron topaz Nov 22, 2023, 2:20 PM

#

cosmic cobalt Note: 40GB of VRAM required -> LMAO

It only needs 24 with some spesific settings

cosmic cobalt Nov 22, 2023, 2:20 PM

#

iron topaz It only needs 24 with some spesific settings

acknowledged, still way to much for my little 3060 with 12 ^^

iron topaz Nov 22, 2023, 2:21 PM

#

cosmic cobalt acknowledged, still way to much for my little 3060 with 12 ^^

Next gpu gen needs at least dubble the vram ! XD

cosmic cobalt Nov 22, 2023, 2:21 PM

#

iron topaz Next gpu gen needs at least dubble the vram ! XD

bet hahah

iron topaz Nov 22, 2023, 2:23 PM

#

I have q idea for later. U automatically grab the last frame of a video and make a new vid with it and make q long video with this 🤔

cosmic cobalt Nov 22, 2023, 2:24 PM

#

iron topaz I have q idea for later. U automatically grab the last frame of a video and make...

at least this worked with other text2video tools (pikalabs, runway)

solar willow Nov 22, 2023, 2:31 PM

#

Ok... am i correct that each seed is attached to a certain camera movement? Might be super wrong 🙂

#

or the other way... how tf the model selects the movement 🙂

unkempt mica Nov 22, 2023, 2:36 PM

#

solar willow or the other way... how tf the model selects the movement 🙂

I was wondering that too.

fallen wren Nov 22, 2023, 2:46 PM

#

iron topaz I have q idea for later. U automatically grab the last frame of a video and make...

that's a good concept but works weird in practice - the model will often not know what motion was happening in the video if you feed it only one frame, and you'll get sudden changes. If you input multiple frames at a time you can theoretically build an automatic continuation system.

fallen wren Nov 22, 2023, 2:47 PM

#

solar willow Ok... am i correct that each seed is attached to a certain camera movement? Migh...

seeds as always with diffusion strongly influence results, but there's not a 1-to-1 correlation where x seed always creates y motion in any video

#

i think if you edit some motion blur into an image that'll strongly influence what motion the model creates in a more controllable way

iron topaz Nov 22, 2023, 2:56 PM

#

fallen wren that's a good concept but works weird in practice - the model will often not kno...

I wish i would know how to imput 2 frames

fallen wren Nov 22, 2023, 2:57 PM

#

tbh a lot of the early days here is gonna be stuff for developers to play with moreso than end-users

#

normal people get to have the most fun with AI tech only after developers have figured out to how to make it work right and then build an interface around that

half hound Nov 22, 2023, 2:58 PM

#

I am trying to find the sweet spot for generations and I have a couple of questions for everyone. I put my answers on the side:

What OS are you using? Windows
What graphic card are you using? RTX 4090
What image size are you using for the image generation? 1024x1024 will be testing smaller ones now haha
What T value are you using? 48
What FPS are you using? 12
What Decode t frames at a time are you using? 24 but 48 is way faster but more unstable
How long does the generation take? 3 hours. decode t frames at 48 is good and takes about 10 mins, but crashes. Trying to find sweet spot for 48.
What errors do you run into when generating? I only get errors at the very end. Either low vram errors, Expected all tensors to be on the same device, but found at least two devices error. Its too bad I have to wait a long time before I can tell if it errored out or not.

copper lynx Nov 22, 2023, 3:24 PM

#

will SVD only run on machines with 4090 or greater? I have 3070ti and was wondering if it's even worth it to install. loving what I am seeing so far from what people are generating. thanks in advance

silent hinge Nov 22, 2023, 3:31 PM

#

Anyone running SVD with a new MacBook M3 Max? I'm attempting but having issue with Pytorch not working device=mps

half hound Nov 22, 2023, 3:35 PM

#

silent hinge Anyone running SVD with a new MacBook M3 Max? I'm attempting but having issue w...

I read that someone said that it's not possible on a mac yet.

silent hinge Nov 22, 2023, 3:36 PM

#

half hound I read that someone said that it's not possible on a mac yet.

Ooh okay — I saw cocktailpeanut (on twitter - https://twitter.com/cocktailpeanut/status/1727068314583670969) trying and got me excited

copper lynx Nov 22, 2023, 3:41 PM

#

grim tangle I also reduced the decoding_t... using only 2 now

how does one reduce this? I saw someone say look for it in the code but I dont know where

grim tangle Nov 22, 2023, 3:46 PM

#

copper lynx how does one reduce this? I saw someone say look for it in the code but I dont k...

in the streamlit ui it's the very last setting, with the command line script it's very clearly documented in side the script itself (simple_video_sample.py)

gleaming tide Nov 22, 2023, 4:47 PM

#

fallen wren that's a good concept but works weird in practice - the model will often not kno...

Was thinking about what a version of this model designed for low vram would look like
Seems like the minimum would be one that just takes in two frames and generates a single new frame from it

gleaming tide Nov 22, 2023, 4:48 PM

#

half hound I am trying to find the sweet spot for generations and I have a couple of questi...

1024x576 is the canonical resolution

fallen wren Nov 22, 2023, 4:49 PM

#

the model itself can definitely run in lower vram that the demo code has it

#

just, yknow, day 1 demo code is always built for the hardware it was trained on, not end-user/consumer tier hardware

#

(remember: SDv1 at launch required 24GiB of VRAM!)

silent hinge Nov 22, 2023, 5:02 PM

#

flint current Nov 22, 2023, 5:17 PM

#

Gen-2 / Stable diffusion video

solar willow Nov 22, 2023, 5:38 PM

#

Ooooooh my 🙂 This year the Holliday cards will be AWESOME 🙂

half hound Nov 22, 2023, 5:39 PM

#

wispy garnet Nov 22, 2023, 5:45 PM

#

SVD moves me lot!
https://x.com/o_ob/status/1727381562311012685?s=20
Original "singularity tunnel" image-to-video Demo
https://www.pixiv.net/artworks/113439831
↓
https://youtu.be/KumvEA6Wu2s

pixiv

しらいはかせ

singularity tunnel

物語のはじまりを語る上で
バックパックとマフラーを装備した
女子高生の背中が力強い事を知った

DCEXPO2023講演「クリエイティブAIとAIDXが拓く新市場 - メタバース・放送・メディアアートのその先に」シンギュラリティについての考察補足｜しらいはかせ(Hacker作家) @o_ob https://note.com/o_ob/n/n22d154730de2?sub_rt=share_pw #note

YouTube

AICU Inc.

SVD singularity tunnel

Original "singularity tunnel" image-to-video Demo
https://www.pixiv.net/artworks/113439831
↓
https://youtu.be/KumvEA6Wu2s

#SVD #StableVideoDiffusion

▶ Play video

solemn turtle Nov 22, 2023, 5:54 PM

#

shell plume I made a Google Colab, is open to clone: https://bit.ly/stable-difussion-video

thanks. it fixed my error

idle finch Nov 22, 2023, 6:03 PM

#

shell plume I made a Google Colab, is open to clone: https://bit.ly/stable-difussion-video

is the only way to run on Apple Silicon so far?

tiny cradle Nov 22, 2023, 6:09 PM

#

I've been trying to get apple silicon to work. Worked through a bunch of issues but finally hit one I think I can't work through:
RuntimeError: Conv3D is not supported on MPS
https://github.com/pytorch/pytorch/issues/77818

solemn turtle Nov 22, 2023, 6:35 PM

#

shell plume I made a Google Colab, is open to clone: https://bit.ly/stable-difussion-video

Hi, i tried your code, but it is giving error about OS. It seems like i have all your code as one cell, otherwise, it says os is undefined. Is there a way to run it in colab without offloading it to ngrok?

idle finch Nov 22, 2023, 6:47 PM

#

solemn turtle Hi, i tried your code, but it is giving error about OS. It seems like i have all...

as for me it's constant Connection error 502 while attempting of load a model

solemn turtle Nov 22, 2023, 6:54 PM

#

shell plume I made a Google Colab, is open to clone: https://bit.ly/stable-difussion-video

I'm gettting another error for your colab file. Error: Invalid value: File does not exist: video_sampling.py But I have the video_sampling.py loaded. Sometimes I think colab doesn't recognize the code before.

shell plume Nov 22, 2023, 6:54 PM

#

Hello

solemn turtle Nov 22, 2023, 6:56 PM

#

shell plume Hello

Hi!

shell plume Nov 22, 2023, 6:56 PM

#

I just updated it, can you try again? It should be fine, and separated

shell plume Nov 22, 2023, 6:56 PM

#

solemn turtle I'm gettting another error for your colab file. Error: Invalid value: File does...

There is one line, where I copy the script to the root, make sure you ran it

solemn turtle Nov 22, 2023, 6:58 PM

#

shell plume There is one line, where I copy the script to the root, make sure you ran it

"!cp '/content/generative-models/scripts/demo/video_sampling.py' '/content/generative-models/' " "!pip install -q streamlit
!pip install pyngrok " I ran it . still same error that video_sampling doesn't exist

shell plume Nov 22, 2023, 7:01 PM

#

check on the folder, there is any video-sampling.py on the root?

solemn turtle Nov 22, 2023, 7:04 PM

#

shell plume check on the folder, there is any video-sampling.py on the root?

I'm getting an error there is no such a file.

shell plume Nov 22, 2023, 7:05 PM

#

check if the file exists where is looking /content/generative-models/HERE

#

if is not there, copy it or move it there

solemn turtle Nov 22, 2023, 7:11 PM

#

shell plume check if the file exists where is looking /content/generative-models/HERE

It is not there

shell plume Nov 22, 2023, 7:12 PM

#

then the line where is copied is not working for some reason

#

maybe some of the lines didnt run correctly

#

you have the project in the folder?

solemn turtle Nov 22, 2023, 7:13 PM

#

shell plume then the line where is copied is not working for some reason

I copied /content/generative-models/HERE it is giving me error when i copy it, saying no /. so i wrote generative model, still doesn't work. Where is the actual file video sampling? /content/generative-models/HERE is just a path

wispy garnet Nov 22, 2023, 7:20 PM

#

Stable Video Diffusion - Good/Bad cases
https://youtu.be/v9DyHMmmxg4

https://twitter.com/AICUai/status/1727406576745787692

YouTube

AICU Inc.

Stable Video Diffusion - Good/Bad cases

Stable Video Diffusion - My first try at 23rd Nov 2023.

Article
https://note.com/aicu/n/n509bd1d01d91

Original Pictures
https://www.pixiv.net/users/1355931/illustrations

▶ Play video

onyx ore Nov 22, 2023, 7:22 PM

#

iron dust No spam please.

Sorry

pastel storm Nov 22, 2023, 7:29 PM

#

Has it been optimized for consumer hardware yet? I have a 3060 (12gb vram)

hazy gorge Nov 22, 2023, 7:32 PM

#

pastel storm Has it been optimized for consumer hardware yet? I have a 3060 (12gb vram)

This

hazy gorge Nov 22, 2023, 7:33 PM

#

pastel storm Has it been optimized for consumer hardware yet? I have a 3060 (12gb vram)

?

fallen wren Nov 22, 2023, 7:34 PM

#

gonna have to wait a lil more than a day for that lol

#

it'll probably be running on 3090s within a few days

#

3060s will take longer (weeks/months, not sure, definitely not soon)

tiny cradle Nov 22, 2023, 7:34 PM

#

its already running on 3090

#

i dont know an obvious way to get it down to 12gb though

#

maybe if you only generate 2 frames

fallen wren Nov 22, 2023, 7:35 PM

#

only posts of it "running" on a 3090 have been buffering all the mem to CPU and taking 3 hours, doesn't count

tiny cradle Nov 22, 2023, 7:35 PM

#

no it was running quick

fallen wren Nov 22, 2023, 7:35 PM

#

eh?

#

who's got it running at speed on a 3090 already?

drifting vortex Nov 22, 2023, 7:35 PM

#

It does takes some tries to get a good generation going, and too much dynamic a pose ends up deformed. But when it works, it does movements I haven't seen any other video generator pull off before

tiny cradle Nov 22, 2023, 7:35 PM

#

it didn't take more than lowvram = True and decoder_frames = 1

#

id have to scroll up a bit to find who it was 🙂

fallen wren Nov 22, 2023, 7:36 PM

#

oo

#

well, still. 3060 is gonna take a lot longer

tiny cradle Nov 22, 2023, 7:36 PM

#

3090 has 24gb ram so we'd expect it to work right?

#

it was @grim tangle

grim tangle Nov 22, 2023, 7:36 PM

#

Lowvram option seems to run it at fp16

solemn turtle Nov 22, 2023, 7:37 PM

#

idle finch is the only way to run on Apple Silicon so far?

dude, if you try to run it locally on a mac, it will take forever to generate a video. It takes like 20-25min on an m2pro to generate 120 frames using deforum in Automatic1111. I imagine it will be even slower since it requires more ram

grim tangle Nov 22, 2023, 7:37 PM

#

I actually got it running in comfy too, hacky way and not a proper implementation, but it works

#

25 frames at the default Res, with fp16, takes just bit under 20gb

solemn turtle Nov 22, 2023, 7:38 PM

#

grim tangle I actually got it running in comfy too, hacky way and not a proper implementatio...

Lol it is probably a lot easier to run it using comfy rather than straight up code in colab

grim tangle Nov 22, 2023, 7:38 PM

#

But I couldn't get it any lower even by reducing resolution...

tiny cradle Nov 22, 2023, 7:38 PM

#

reducing number of frames reduces memory needs

grim tangle Nov 22, 2023, 7:39 PM

#

Well easier to implement queues and stuff in comfy, can also generate the inits etc.

tiny cradle Nov 22, 2023, 7:39 PM

#

so we need to see what peak memory is at 4 frames

grim tangle Nov 22, 2023, 7:39 PM

#

Yeah frame count influences it greatly

hazy gorge Nov 22, 2023, 7:42 PM

#

Can this model be fine-tuned so that it does more specific things and works faster and with less vram?

tiny cradle Nov 22, 2023, 7:42 PM

#

they've already announced it will be finetuned to do a bazillion specific things

idle finch Nov 22, 2023, 7:45 PM

#

@shell plume everything runs great – downloading and copying files without any problems. but after I try to run it I see this in console:

VideoTransformerBlock is using checkpointing
^C

one time it have downloaded .bin file while attempted to load a model in app, but it was only once and now it keeps fail

copper lynx Nov 22, 2023, 7:46 PM

#

tiny cradle **Setup Instructions** (Python 3.10, 4090, working on Linux): - git clone the re...

I downloaded the weights. where is the checkpoints folder?

tiny cradle Nov 22, 2023, 7:46 PM

#

you make it in the root of the project

copper lynx Nov 22, 2023, 7:47 PM

#

so if I'm in Windows, in my generative-models folder I can create a folder called "checkpoints"?

tiny cradle Nov 22, 2023, 7:47 PM

#

yes. unrelated to OS

copper lynx Nov 22, 2023, 7:47 PM

#

ah, good to know. thank you. so once I create the folder called "checkpoints" I can continue with your install steps? thanks so much! have a great day 🙂 Im gonna try and get this to work on a 3070ti over the next few days

tiny cradle Nov 22, 2023, 7:48 PM

#

put the weights in the checkpoints folder yes

tiny cradle Nov 22, 2023, 7:49 PM

#

copper lynx ah, good to know. thank you. so once I create the folder called "checkpoints" I ...

I updated the instructions to include creating the checkpoints folder

copper lynx Nov 22, 2023, 7:50 PM

#

tiny cradle I updated the instructions to include creating the checkpoints folder

awesome! Im sure that will help other noobs like myself. I tried to run the streamlit and it returned this error

(.pt2) C:\Users\xxxxxxxxx\generative-models>streamlit run scripts/demo/video_sampling.py
'streamlit' is not recognized as an internal or external command,
operable program or batch file.

tiny cradle Nov 22, 2023, 7:50 PM

#

i guess pip install streamlit

#

but streamlit is already listed in the requirements file so it makes me think that step failed for you

copper lynx Nov 22, 2023, 7:51 PM

#

seems to be working now. thank you

#

installing streamlit that is.

#

Using cached smmap-5.0.1-py3-none-any.whl (24 kB)
Installing collected packages: pytz, zipp, watchdog, validators, urllib3, tzdata, typing-extensions, tornado, toolz, toml, tenacity, smmap, six, rpds-py, pygments, protobuf, pillow, packaging, numpy, mdurl, MarkupSafe, idna, colorama, charset-normalizer, certifi, cachetools, blinker, attrs, tzlocal, requests, referencing, python-dateutil, pyarrow, markdown-it-py, jinja2, importlib-metadata, gitdb, click, rich, pydeck, pandas, jsonschema-specifications, gitpython, jsonschema, altair, streamlit

idle finch Nov 22, 2023, 7:52 PM

#

solemn turtle dude, if you try to run it locally on a mac, it will take forever to generate a ...

makes sense... but still interesting how it would perform

copper lynx Nov 22, 2023, 7:52 PM

#

might have been because earlier I was having to convert the instructions from Unix based to Windows based with GPT help lol

tiny cradle Nov 22, 2023, 7:52 PM

#

i just added this step:

double check that pip install actually worked. on windows you may need to comment out xformers and triton

tiny cradle Nov 22, 2023, 7:53 PM

#

idle finch makes sense... but still interesting how it would perform

torch doesn't support the operations needed on apple silicon. so it doesn't perform at all

copper lynx Nov 22, 2023, 7:53 PM

#

tiny cradle i just added this step: - double check that pip install actually worked. on wind...

awesome. you're the man! it said it worked I think

Successfully installed MarkupSafe-2.1.3 altair-5.1.2 attrs-23.1.0 blinker-1.7.0 cachetools-5.3.2 certifi-2023.11.17 charset-normalizer-3.3.2 click-8.1.7 colorama-0.4.6 gitdb-4.0.11 gitpython-3.1.40 idna-3.4 importlib-metadata-6.8.0 jinja2-3.1.2 jsonschema-4.20.0 jsonschema-specifications-2023.11.1 markdown-it-py-3.0.0 mdurl-0.1.2 numpy-1.26.2 packaging-23.2 pandas-2.1.3 pillow-10.1.0 protobuf-4.25.1 pyarrow-14.0.1 pydeck-0.8.1b0 pygments-2.17.2 python-dateutil-2.8.2 pytz-2023.3.post1 referencing-0.31.0 requests-2.31.0 rich-13.7.0 rpds-py-0.13.1 six-1.16.0 smmap-5.0.1 streamlit-1.28.2 tenacity-8.2.3 toml-0.10.2 toolz-0.12.0 tornado-6.3.3 typing-extensions-4.8.0 tzdata-2023.3 tzlocal-5.2 urllib3-2.1.0 validators-0.22.0 watchdog-3.0.0 zipp-3.17.0

#

🫂

idle finch Nov 22, 2023, 7:54 PM

#

tiny cradle torch doesn't support the operations needed on apple silicon. so it doesn't perf...

pity

solemn turtle Nov 22, 2023, 7:54 PM

#

idle finch makes sense... but still interesting how it would perform

Just wait for Draw Things' developer to add this. It might take a bit of time.

onyx ore Nov 22, 2023, 7:56 PM

#

Hey

solemn turtle Nov 22, 2023, 7:56 PM

#

idle finch pity

did u get colab to work for stable vid dif? from ur older comments, u encountered some errors

copper lynx Nov 22, 2023, 7:57 PM

#

tiny cradle i guess pip install streamlit

when I tried to run streamlit it returned this

Traceback (most recent call last):
File "C:\Users\xxxxxxxx\generative-models.pt2\lib\site-packages\streamlit\runtime\scriptrunner\script_runner.py", line 534, in _run_script
exec(code, module.dict)
File "C:\Users\xxxxxxxxx\generative-models\scripts\demo\video_sampling.py", line 3, in <module>
from pytorch_lightning import seed_everything
ModuleNotFoundError: No module named 'pytorch_lightning'

tiny cradle Nov 22, 2023, 7:57 PM

#

yeah your pip install of the requirements file must not have worked

copper lynx Nov 22, 2023, 7:59 PM

#

tiny cradle yeah your pip install of the requirements file must not have worked

should I go back through and do the install again?

Clone the repo
git clone git@github.com:Stability-AI/generative-models.git
cd generative-models
Setting up the virtualenv
This is assuming you have navigated to the generative-models root after cloning it.

NOTE: This is tested under python3.10. For other python versions, you might encounter version conflicts.

PyTorch 2.0

install required packages from pypi

python3 -m venv .pt2
source .pt2/bin/activate
pip3 install -r requirements/pt2.txt
3. Install sgm
pip3 install .
4. Install sdata for training
pip3 install -e git+https://github.com/Stability-AI/datapipelines.git@main#egg=sdata

If not the whole thing, which parts?

tiny cradle Nov 22, 2023, 7:59 PM

#

pip3 install -r requirements/pt2.txt

copper lynx Nov 22, 2023, 8:16 PM

#

tiny cradle pip3 install -r requirements/pt2.txt

I did that. it returned this error

ERROR: Ignored the following versions that require a different python version: 0.55.2 Requires-Python <3.5; 1.6.2 Requires-Python >=3.7,<3.10; 1.6.3 Requires-Python >=3.7,<3.10; 1.7.0 Requires-Python >=3.7,<3.10; 1.7.1 Requires-Python >=3.7,<3.10
ERROR: Could not find a version that satisfies the requirement triton==2.0.0 (from versions: none)
ERROR: No matching distribution found for triton==2.0.0

tiny cradle Nov 22, 2023, 8:18 PM

#

copper lynx I did that. it returned this error ERROR: Ignored the following versions that r...

yes likely you need to comment triton and xformers out of the requirements file

idle finch Nov 22, 2023, 8:19 PM

#

solemn turtle did u get colab to work for stable vid dif? from ur older comments, u encountere...

it keeps disconnected for some reason, I dunno why

tired egret Nov 22, 2023, 8:22 PM

#

copper lynx Nov 22, 2023, 8:30 PM

#

tiny cradle i just added this step: - double check that pip install actually worked. on wind...

ah I see now., sorry I missed this.

requirements pt2 or pt13? and in commenting out do I just delete those rows and save or is there another process?

tiny cradle Nov 22, 2023, 8:30 PM

#

pt2

#

a # at the start of the line comments it out

copper lynx Nov 22, 2023, 8:31 PM

#

ok so save that pt2 file after commenting out and try to install the requirements again?

tiny cradle Nov 22, 2023, 8:32 PM

#

yes

half hound Nov 22, 2023, 8:35 PM

#

Hi all, I made instructions on how to install stable Video Diffusion on windows. Here is the text:

#

Setup Instructions (Python 3.10.11, 4090, working on Windows):
Go to user directory
right click git bash
git clone https://github.com/Stability-AI/generative-models.git

-modify streamlit_helpers.py
lowvram_mode = True

move video_sampling.py file to main dir
create a checkpoints folder in the main dir
download the SVD weights from https://huggingface.co/stabilityai/stable-video-diffusion-img2vid/tree/main
(optional) donwload SVD-XT weights from https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt/tree/main

-modify requirements/pt2.txt file
remove triton==2.0.0 line and save

-modify requirements/pt13.txt file
remove triton==2.0.0.post1 line and save

Open Anaconda
cd to user/generative-models
conda create -n genModelVideo python=3.10.11
conda activate genModelVideo

pip install https://huggingface.co/r4ziel/xformers_pre_built/resolve/main/triton-2.0.0-cp310-cp310-win_amd64.whl
pip install -r requirements/pt2.txt
pip install .
pip install -r requirements/pt13.txt

streamlit run video_sampling.py

click "Load Model"

upload image and there you go.

Will get a tensor error but you can ignore it. Still seems to work

*try 48 decode t frames for faster generation

#

and here's a video explaining it

#

https://youtu.be/HMW9hVoQa0M?si=92mPiYjrSVrzD40f

YouTube

My Why AI

Stable Video Diffusion Install

Setup Instructions (Python 3.10.11, 4090, working on Windows): https://pastebin.com/YpqNSHFy

Requirements:

A good GPU. RTX 3090, RTX 4090

Anaconda

Git

Generative-Models github

SVD or SVD_XT

Download Links:

Anaconda: https://www.anaconda.com/download

Git: https://git-scm.com/downloads

Generative-Models Github: https://github.com/Stabili...

▶ Play video

tiny cradle Nov 22, 2023, 8:36 PM

#

half hound https://youtu.be/HMW9hVoQa0M?si=92mPiYjrSVrzD40f

dont install both pt13 and pt2

half hound Nov 22, 2023, 8:36 PM

#

why?

tiny cradle Nov 22, 2023, 8:37 PM

#

makes no sense. you're supposed to pick whether you're installing pytorch 1.3 or 2.0

#

and svd only works with 2.0 i believe

rigid orchid Nov 22, 2023, 8:37 PM

#

has anyone tried to run this on CPU? it seems to be bypassing the GPU memory but I bet it will output slower

half hound Nov 22, 2023, 8:39 PM

#

tiny cradle and svd only works with 2.0 i believe

I'll test it out

#

I did run svd and pip install pt13 last and it generated a video

#

trying svd_xt right now

tiny cradle Nov 22, 2023, 8:41 PM

#

half hound trying svd_xt right now

well who knows what version of torch you had installed since you tried to install both

half hound Nov 22, 2023, 8:42 PM

#

ok i'll take a look which version I have installed in my venv

#

I tried the svd_xt and it also generated a video

copper lynx Nov 22, 2023, 8:50 PM

#

okay i installed all requirements without any errors.

when I ran streamlit it returned this

C:\Users\xxx\AppData\Local\Programs\Python\Python310\lib\site-packages\torchaudio\backend\utils.py:74: UserWarning: No audio backend is available.
warnings.warn("No audio backend is available.")
2023-11-22 12:49:08.964 Uncaught app exception
Traceback (most recent call last):
File "C:\Users\xxx\AppData\Local\Programs\Python\Python310\lib\site-packages\streamlit\runtime\scriptrunner\script_runner.py", line 534, in _run_script
exec(code, module.dict)
File "C:\Users\xxx\generative-models\scripts\demo\video_sampling.py", line 5, in <module>
from scripts.demo.streamlit_helpers import *
ModuleNotFoundError: No module named 'scripts'

half hound Nov 22, 2023, 8:50 PM

#

don't know if i'll run into issues later though everything seems to be working.

tiny cradle Nov 22, 2023, 8:51 PM

#

half hound I tried the svd_xt and it also generated a video

that is surprising. but either way, installing torch 2.0, only to unisntall it and then install 1.13 doesnt make sense

jade carbon Nov 22, 2023, 8:51 PM

#

I believe when you go to install 1.13, it ignores packages that are already installed.

#

Therefore, any matching dependencies are just skipped

half hound Nov 22, 2023, 8:52 PM

#

yeah I agree. I was running into lots of issues trying to install it so I was trying anything to get it to work lol

tiny cradle Nov 22, 2023, 8:52 PM

#

copper lynx okay i installed all requirements without any errors. when I ran streamlit it r...

now this one, i'd love to get to the bottom of this. everyone on windows runs into this

jade carbon Nov 22, 2023, 8:52 PM

#

Could cause issues with torch though, so I usually manually install this after to match my CUDA version

jade carbon Nov 22, 2023, 8:52 PM

#

copper lynx okay i installed all requirements without any errors. when I ran streamlit it r...

Move the script to the parent folder

#

That would be generative-models

tiny cradle Nov 22, 2023, 8:52 PM

#

copper lynx okay i installed all requirements without any errors. when I ran streamlit it r...

people have fixed it by copy pasting the video_sampling script to the project root. but that shouldn't be necessary. not sure whats going on

half hound Nov 22, 2023, 8:53 PM

#

I had to remove the triton from the requirements

#

and then pip install https://huggingface.co/r4ziel/xformers_pre_built/resolve/main/triton-2.0.0-cp310-cp310-win_amd64.whl

#

to get it to work

#

on windows

rigid orchid Nov 22, 2023, 8:53 PM

#

doesn't work with cpu because some of the layers only support cuda

jade carbon Nov 22, 2023, 8:54 PM

#

It has something to do with python path, though, even manually setting it to the folder, I'm not sure why it doesn't find the scripts folder

tiny cradle Nov 22, 2023, 8:54 PM

#

half hound and then pip install https://huggingface.co/r4ziel/xformers_pre_built/resolve/ma...

i suspect that if you use torch 2.1 then both triton and xformers are not needed

half hound Nov 22, 2023, 8:55 PM

#

No I tried last night. I tried to just do the install with just pt2

#

ran into errors then had to manually do pip installs

#

then just got into a loop of errors

#

then I went back and looked at any errors while I was installing it

tiny cradle Nov 22, 2023, 8:56 PM

#

in my custom fork i'm not using xformers but maybe I made some tweaks to get that working

half hound Nov 22, 2023, 8:56 PM

#

and triton gave me an error and I think it skipped the rest of the requirement installs

#

I think that was the issue

rigid orchid Nov 22, 2023, 9:05 PM

#

copper lynx okay i installed all requirements without any errors. when I ran streamlit it r...

PYTHONPATH=<<directory to git root>> streamlit . . .

solar willow Nov 22, 2023, 9:05 PM

#

Ok. After i generated 2394 nsfw animations i tried architecture... THIS IS IT.... 10x10. No cherrypicking.

half hound Nov 22, 2023, 9:06 PM

#

Looks good 👍

zenith spoke Nov 22, 2023, 9:11 PM

#

fallen wren (remember: SDv1 at launch required 24GiB of VRAM!)

wonder if i should bother with it using 6700xt at 12vram

fossil vine Nov 22, 2023, 9:11 PM

#

Stunningly beautiful oak tree, on the edge of a forest, in the foreground there is grass blowing in a gentle breeze, the tree is in the middle ground, summer, in the background a gently rising hill

grim tangle Nov 22, 2023, 9:12 PM

#

for anyone using Comfy and feeling brave, here's my very VERY early node to run SVD in Comfy:
https://github.com/kijai/ComfyUI-SVD

#

#

half hound Nov 22, 2023, 9:14 PM

#

grim tangle for anyone using Comfy and feeling brave, here's my very VERY early node to run ...

ok I'll try it

grim tangle Nov 22, 2023, 9:16 PM

#

still need to add rest of the settings and better memory management to allow larger workflows around it

tiny cradle Nov 22, 2023, 9:18 PM

#

grim tangle for anyone using Comfy and feeling brave, here's my very VERY early node to run ...

impressive

half hound Nov 22, 2023, 9:19 PM

#

how do I get the install via git url

#

my manager doesn't have it

zenith spoke Nov 22, 2023, 9:20 PM

#

latest comfyui?

#

i had it gone once but had to update comfyui and manager, dunno what fixed it really

half hound Nov 22, 2023, 9:20 PM

#

ok

#

i just updated it

#

ok yep

#

that was it, just need to restart it

#

can I drag and drop this workflow?

solar willow Nov 22, 2023, 9:29 PM

#

solar willow Ok. After i generated 2394 nsfw animations i tried architecture... THIS IS IT......

For me we entered different times... I feel so empowered. 10000000000000000% power up

copper lynx Nov 22, 2023, 9:34 PM

#

jade carbon Move the script to the parent folder

the video_sampling.py script?

copper lynx Nov 22, 2023, 9:35 PM

#

rigid orchid PYTHONPATH=<<directory to git root>> streamlit . . .

ELI5

gray sleet Nov 22, 2023, 9:40 PM

#

shell plume I created a Google Colab to test it, and it's working quite well with an A100. I...

i clickek on this, but i'm so dumb :((( haha i don't know if there's a elevated soul that could send the final method for using the model, installing, thanks ❤️

copper lynx Nov 22, 2023, 9:44 PM

#

half hound Hi all, I made instructions on how to install stable Video Diffusion on windows....

what is Anaconda? is that something i need to install?

#

pip install Anaconda? sorry im such a noob

half hound Nov 22, 2023, 9:48 PM

#

https://www.anaconda.com/download

Anaconda

Anaconda Team

Free Download | Anaconda

Anaconda's open-source Distribution is the easiest way to perform Python/R data science and machine learning on a single machine.

#

need to download it here

#

you also need git

#

https://git-scm.com/downloads

golden oyster Nov 22, 2023, 9:53 PM

#

What are the requirements for my pc to run this smoothly

half hound Nov 22, 2023, 9:54 PM

#

works on my rtx 4090 I've read other people where able to run it with rtx 3090

golden oyster Nov 22, 2023, 9:55 PM

#

I have rtx 3080ti is it enough? What about ram and storage

copper lynx Nov 22, 2023, 9:56 PM

#

half hound you also need git

I have Git. DLing Anaconda now. thank you

copper lynx Nov 22, 2023, 9:56 PM

#

half hound works on my rtx 4090 I've read other people where able to run it with rtx 3090

I need someone to optimize it for 3070ti and I will jump through my ceiling

half hound Nov 22, 2023, 10:02 PM

#

how much vram do you have? You might as well install and test it out. You can decrease the decode t frames

half hound Nov 22, 2023, 10:06 PM

#

golden oyster I have rtx 3080ti is it enough? What about ram and storage

might be not sure, just try the install. I read that it offloads ram when GPU memory is all used up, but it doesn't work on my windows machine I got an error, so it's best just to use the VRAM from your GPU

grim tangle Nov 22, 2023, 10:09 PM

#

half hound can I drag and drop this workflow?

the workflow is just an example, it's just the one node, I could add workflow with metadata tho

#

nice!

sterile mesa Nov 22, 2023, 10:24 PM

#

hey @shell plume , curious why you !pip install what you did in your collab:

!pip install -r requirements/pt2.txt
!pip install .
!pip install -e git+https://github.com/Stability-AI/datapipelines.git@main#egg=sdata

Wasn't clear on why the . and datapipelines.git?

iron topaz Nov 22, 2023, 10:40 PM

#

@tiny cradle is it theoratically possible to add a controllnet? i have hered that its baised on sd 2.1 is that true?

tiny cradle Nov 22, 2023, 10:41 PM

#

iron topaz <@613395195031191585> is it theoratically possible to add a controllnet? i have ...

i doubt that existing controlnets can be applied but who knows

tiny cradle Nov 22, 2023, 10:41 PM

#

iron topaz <@613395195031191585> is it theoratically possible to add a controllnet? i have ...

certainly new ones will be made that apply

iron topaz Nov 22, 2023, 10:42 PM

#

tiny cradle certainly new ones will be made that apply

yes i hope that too.

fallen wren Nov 22, 2023, 11:01 PM

#

zenith spoke wonder if i should bother with it using 6700xt at 12vram

video, on low-vram AMD -- uh, yeah, no, not right now. Wait for a wait for code upgrades n stuff to come out

zenith spoke Nov 22, 2023, 11:02 PM

#

fallen wren video, on low-vram AMD -- uh, yeah, no, not right now. Wait for a wait for code ...

yeah ig... animatediff is not working for me, only deforum does

fair otter Nov 22, 2023, 11:36 PM

#

grim tangle

how are you getting an h265/video output in VHS?

urban linden Nov 22, 2023, 11:36 PM

#

grim tangle for anyone using Comfy and feeling brave, here's my very VERY early node to run ...

Thanks mate! I suck at comfy, is there any way I can import your workflow in?

grim tangle Nov 22, 2023, 11:37 PM

#

fair otter how are you getting an h265/video output in VHS?

Doesn't it have it by default?

grim tangle Nov 22, 2023, 11:38 PM

#

urban linden Thanks mate! I suck at comfy, is there any way I can import your workflow in?

I'd need to make a better shareable one, this is just one node and I don't know yet the best way to use it, basically you can take any default comfy workflow and stick the node in

urban linden Nov 22, 2023, 11:39 PM

#

grim tangle I'd need to make a better shareable one, this is just one node and I don't know ...

I will try! Thanks!

grim tangle Nov 22, 2023, 11:41 PM

#

Video helper suite (VHS) has good nodes to make it into a video

urban linden Nov 22, 2023, 11:41 PM

#

grim tangle I'd need to make a better shareable one, this is just one node and I don't know ...

So I basically stick it to the very end of my workflow?

grim tangle Nov 22, 2023, 11:42 PM