#▶|stable-video-diffusion

1 messages · Page 1 of 1 (latest)

onyx ore
#

YAYY I'm firat

glass helm
#

!!!!

hidden temple
#

hi

jade vessel
trim bone
#

Yahoo

storm kettle
#

whats up

iron dust
onyx ore
robust lagoon
#

lol noice

robust lagoon
#

lmao

grave mantle
humble iris
#

40vram required ? xD

silent hinge
#

gib vram

jade vessel
#

12kb vram required

glossy trellis
onyx ore
forest jay
#

Where am going to get 40gb of VRAM

humble iris
gentle nest
#

how many 4090s is that?

onyx ore
#

Like I do for hugging face spaces

quartz bolt
#

404

silent hinge
thin sand
#

o

silent hinge
flat crystal
silent hinge
#

i got one

mortal crypt
#

Welcome in!

gentle nest
ebon salmon
wide lava
#

i dont have 2 4090s 😢

thin sand
#

woah, i love this!

wide lava
#

i only have one 4070ti

thin sand
mighty sentinel
#

40GB vram bruh

copper berry
#

run it on your cpu xdd

humble iris
wide lava
#

yup 40gb vram

flat crystal
thin sand
flat crystal
#

1x 4090 = 24GB VRAM

zenith spoke
#

me with my 6700xt.. wonder if it is even worth trying

wraith vale
#

🤔 hm.... I WANT THIS!

valid trail
#

the vram from 2 gpus won't add up afaik

iron dust
thin sand
#

i got an idea! we throw away 4/5 of all the weights in the model!!!

#

i am a genius

onyx ore
thin sand
wraith vale
humble iris
iron dust
valid trail
#

mods bot account

silent hinge
#

40 GBS OF VRAM BRUHH

zenith spoke
ebon salmon
left geode
#

ML?

silent hinge
#

you need rtx 6000 ada generation gpu for this video ai

hollow wren
ebon salmon
humble iris
silent hinge
rustic hinge
#

can this run across two GPUs?

flat crystal
humble iris
ebon salmon
knotty stream
valid trail
humble iris
silent hinge
floral chasm
#

Stability.AI should release their own GPU lineup with their models at this point

keen knot
#

how to use this

thorny charm
#

We'll probably see an optimized version of the model soon, able to run on 3090s at least

silent hinge
#

I wonder how good it is

iron dust
tepid stream
humble iris
copper berry
#

the pipeline can't finish xdd

silent hinge
flat crystal
ebon salmon
gentle nest
#

21,000€

keen hazel
#

Damn

#

A car

left geode
#

guess I gotta wait for the 24gb patch

keen hazel
#

Or some ai videos

humble iris
rustic hinge
modest coyote
#

40GB VRAM for inference, rip fine tuning already

iron dust
thin sand
keen hazel
#

12gb so it can run in Google colab

ebon salmon
supple peak
ebon salmon
#

Result quality might vary though

tepid stream
#

You can still settle with tools that will do animations with less VRAM cheems

ebon salmon
silent hinge
#

I wanna see how good this one is

thin sand
flat crystal
silent hinge
#

maybe in the future we will get ai so good it will replace animators

supple peak
modest coyote
#

all you need is runpod, vastai, bananadev or something like that

keen hazel
thin sand
#

lol

prime echo
#

lets go we're about to get some quality gifs up here

hollow wren
prime echo
#

and videos of course

marble folio
#

@ebon salmon I see you keep on top of the news

prime echo
ebon salmon
supple peak
marble folio
iron dust
# silent hinge I wanna see how good this one is

There are some examples available on the website if you'd like to give them a look! https://stability.ai/news/stable-video-diffusion-open-ai-video-model

(Here's a video too!)

keen hazel
peak shore
#

oh so you need 40GB VRAM? rip

hollow wren
ebon salmon
#

And nowhere near enough either depending on what you do ^^'

flat crystal
haughty garden
#

If I save money on a Ferrari is it still a deal?

wide lava
#

thats amazing

ebon salmon
teal vigil
flat crystal
#

Gotcha

#

I cannot for the life of me find the 80GB models available anywhere online..

humble iris
#

but i cant get what is the different between this sd video and animated diffusion node ?

modest coyote
silent hinge
#

I will compare same prompts in runway now

wide lava
#

i might be able to convince our media production to get 80gb vram.

flat crystal
#

Do they have that on that website/

rustic hinge
#

does anyone know how to run this and if it'll work across two GPUs?

royal mauve
#

Yoh, so how do we run this model ? Not obvious

floral chasm
supple peak
#

get 500gb and run the trillion param LLM

hollow wren
wide lava
humble iris
modest coyote
rustic hinge
#

where even is the documentation?

flat crystal
silent hinge
ebon salmon
gilded reef
#

Any idea of speeds for the 14/25 frame versions on a "budget" GPU that can handle this like an a6000?

modest coyote
#

your move, openai

flat crystal
hollow wren
rustic hinge
iron dust
royal mauve
#

How do we run the model ? Can't see any code

hollow wren
gilded reef
silent hinge
iron dust
modest coyote
#

what does this mean

#

it just takes an image as input and thats it?

iron dust
# modest coyote what does this mean

The model being released for research is a image-to-video model. The text-to-video portion will be included in the web interface to be released soontm

keen hazel
knotty stream
trail quiver
wraith vale
teal vigil
#

Is that how pizza is made?

knotty stream
empty oar
teal vigil
#

Neat

knotty stream
#

Maybe pizza ovens are already running video models

silent hinge
ebon salmon
#

I'm curious to test this and see how it differs with AnimateDiff

humble iris
ebon salmon
#

Motion is definitively better, and from the examples I've seen there are less/no weird limbs suddenly appearing and disappearing

silent hinge
hexed oak
#

It's been 15 minutes and we still lack a video of will smith eating spaghetti. come on people.

silent hinge
#

im also gonna test pikalabs to see

humble iris
silent hinge
shadow osprey
ebon salmon
#

I can already see animation makers raging 🤭

#

"They're taking our jobs!"

silent hinge
#

soon in the future u can make a 20 min animation in 10 minutes 😆

haughty garden
#

Next stop... 3D rendering and destructive enviroments in games.

viral wren
#

Apple M chip should work😁

ebon salmon
cold mulch
silent hinge
#

will smith isnt eating spaghetti

ebon salmon
shadow osprey
silent hinge
#

3090 also has 24 gb of vram that a lot

ebon salmon
silent hinge
#

like cpu and gpu bottleneck

hexed oak
#

will smith is eating the spacetime continuum

ebon salmon
#

Even if A100s have 80gb

ebon salmon
silent hinge
ebon salmon
#

You can even rent H100s nowadays

wraith vale
ebon salmon
haughty garden
#

Well Davinci Resolve (example) you can render on CPU (free version) or GPU (paid version) up to 32k resolution. A 4090 does it jsut fine.

silent hinge
#

wait

#

4 x 2 = 100

#

96 dollars a day

shadow osprey
silent hinge
#

96 x 30

median helm
#

69$ per frame

shadow osprey
silent hinge
#

2880 dollars a month

ebon salmon
silent hinge
#

2880 x 12

#

34k

#

34 560

#

so basically every year 34 560 I think

lean gate
shadow osprey
ebon salmon
silent hinge
median helm
#

give or take

ebon salmon
haughty garden
silent hinge
#

artists lost their job now animators

#

and soon teachers

#

and pretty much everoyne will lose their jobs

haughty garden
#

Nonsense.

#

Stop the sillyness.

ebon salmon
steady field
#

there is still a major controllability and editability problem in AI

steady field
#

editing generative images and videos still requires expert skillsets

ebon salmon
flint obsidian
ebon salmon
steady field
#

it will come with time, but it definitely lags behind

ebon salmon
#

Even if yes it still requires quite a bit of expertise to get the most out of it

lean gate
#

that's why in AI animation you need a lot of VRAM

silent hinge
#

isnt microsoft developing chatgpt v with sdxl

lean gate
#

you need to fit the model for inference into ram

silent hinge
#

or developing sdxl with chatgpt v

ebon salmon
rapid rapids
#

So I'm reading paper for SVD and am confused, can anyone explain to me how many parameters does it have? Paper says that it's 1521M, but checkpoint size is 9 gigabytes. Did they ship optimizer state there as well?

shadow osprey
#

It's coming sooner or later

haughty garden
#

Stop the sillyness.

eager sandal
#

have anyone tried if this can run in 24G vram enviornment?

lean gate
#

for now

eager sandal
#

with lowering to fp8 or fp16 to reduce vram consumption

ebon salmon
shut narwhal
#

Hey! So the Stable video diffusion isnt available yet right ?

#

Really stocked to see what it uses, if it use some AnimateDiff, Deforum, Ip adapters, LCM or if it's fully dev by Stability 👀

tight fossil
#

40gb for local inference

#

someone pin that

#

this question is gonna come up 300 million times in this chat

lean gate
knotty stream
#

it is in research preview

shut narwhal
eager sandal
#

it is in announcement

lean gate
eager sandal
#

but my focus here is if it can be inference in fp8/16 without losing the quality too much

#

as it is fp32 model

#

which is why it is costing so much vram

lean gate
#

it will get optimized, it'll work on consumer GPUs down the road

shut narwhal
#

Oh wow xD, require so much gpus ahahah

silk jolt
shut narwhal
silk jolt
#

Can you use multigpu for inference?

outer trail
#

Saying, "this will be on the web" isn't addressing some people's concerns, such as will this ever be runnable locally on consumer hardware.

ebon salmon
iron dust
#

Anyone who doesn't have a spare 40GB of VRAM sitting on their shelf collecting dust should keep an eye out for the web experience coming soontm

lean gate
lean gate
#

let's not be pessimistic, these tools are amazing

iron dust
knotty stream
#

Yes, all these tools help a lot in the creative process

ebon salmon
iron dust
echo yew
#

To me, the tools that will succeed the most in production settings are those that can use input images and work on top of them without destroying the intent from the artist. E.g. someone sketches a rough tree and adds the lighting direction and temperature, the AI output resembles the orignal sketch but rendered out faster.

outer trail
#

Unspinned answer sounds like no. 🙂

knotty stream
#

There is space for everyone, for those who want to continue creating as they always have and for those who are interested in testing new paths

#

and the ones who will mix everything too

echo yew
#

Text to image isn't as usable in every production setting, its fine for things that are unimportant or as a starting point that artists have to work on top of afterwards

silk jolt
#

2 x 3090?

eager sandal
#

i do believe by reducing the model to fp8 inference it would be very likely that would run in 24G vram enviornment

#

but the only issue is if the characteristic of learnt data would be lost when its inferencing in lower precision

rapid rapids
iron dust
unkempt mica
#

who wants to go in on an A100 with me?

eager sandal
#

without much quality lose

unkempt mica
#

I will put in $20

eager sandal
#

but idk if thats applicable here

lean gate
eager sandal
distant epoch
eager sandal
#

its in the dev branch in A1111

#

fp8test branch by kohaku

distant epoch
#

Alternatively, I can go on Tiktok live and start ebegging

short gulch
#

This is an image to video model not a text to vid right? the subreddit is saying image to vid

iron dust
unkempt mica
distant epoch
#

I don't know why, but on the website the example clips looks like a Robot 'post nut', now stuck in a crisis.

keen hazel
winged raft
sturdy reef
#

I'm sure it's been asked, but how do I run the model locally? automatic?

olive wave
#

arg, why did they used SVD as abbreviation 🤦‍♂️

grim tangle
drifting glen
unborn acorn
#

Where is the ComfyUI nodes for svd_xt models? Models don't work in AnimateDiff loader.

glossy plover
#

what is the difference between the xt model to the non-xt model?

iron dust
ebon salmon
#

At how many FPS? 8 like AnimateDiff?

#

(just wanted to mention for those that might find it's not enough: there are very good frame interpolation tools to transform 8fps into 64fps or more if you want, like RIFE)

lean gate
#

13 to 30 fps

#

AnimeDiff has a 16 frame context, you can combine it to any fps you like (via comfyUI, dont know about auto1111)

unborn acorn
#

How to use the models?

lean gate
#

it's mostly for research purposes right now, if your GPU can run it you can test it locally (you need at least 40GB vram)

obtuse bridge
#

The autoencoder for the video model is the same one as 2.1 isn't it? I'm going to end up testing it later to see if the temporal bits to the decoder work on other video models like AnimateDiff

grim tangle
keen hazel
#

Damn

sullen flower
#

go into scripts/demo/streamlit_helpers.py and enable lowvram_mode = True for model offloading and it will run on 24gb

#

set frame decoding to 5 in the UI as well

lean gate
#

but is it really usable like that?

#

there'll be optimizations down the road

sullen flower
#

3090

glossy plover
sullen flower
#

the streamlit demo in the repo

nocturne star
#

curious if the multi-view synthesis finetune of the video model will be avaliable at some point as well, thats far more interesting to me than the actual image to video, might be good for creating interesting 3d models or alphas

glossy plover
half hound
#

will this get support for RTX 4090 cards?

sullen flower
#

ehh, did not have that error

tiny cradle
sullen flower
#

where its lowvram_mode = False now

tiny cradle
#

ah I see I misread which file - thank you

glossy plover
#

now I'm getting ModuleNotFoundError: No module named 'scripts.demo'

#

I managed to fix the invisible watermark thing

subtle agate
# sullen flower

nice, can you upload scene with human face in it? it's a crucial weak point in the most of competitiors, and it will be nice to see raw generation of the model

spring cedar
#

can we have a room for just videos?

tiny cradle
#

Setup Instructions (Python 3.10, 4090, working on Linux):

  • git clone the repo
    • git clone git@github.com:Stability-AI/generative-models.git
    • cd generative-models
  • pip install -r requirements/pt2.txt
  • double check that pip install actually worked. on windows you may need to comment out xformers and triton
  • pip install .
  • modify streamlit_helpers.py lowvram_mode = True
  • create a checkpoints folder in the root folder of the project
  • download the weights from https://huggingface.co/stabilityai/stable-video-diffusion-img2vid/tree/main to checkpoints folder
  • streamlit run scripts/demo/video_sampling.py
  • set "Decode t frames at a time)" to 2 or lower
  • click "Load Model"
  • upload image and go
sullen flower
subtle agate
glossy plover
tiny cradle
#

trying on m1 right now. lets see

glossy plover
#

ok, I bypassed the thing that wanted Triton. now I'm getting:
C:\Users\joker\OneDrive\Desktop\A.I\generative-models\venv\lib\site-packages\torchaudio\backend\utils.py:74: UserWarning: No audio backend is available. warnings.warn("No audio backend is available.") 2023-11-21 23:36:09.160 Uncaught app exception Traceback (most recent call last): File "C:\Users\joker\OneDrive\Desktop\A.I\generative-models\venv\lib\site-packages\streamlit\runtime\scriptrunner\script_runner.py", line 534, in _run_script exec(code, module.__dict__) File "C:\Users\joker\OneDrive\Desktop\A.I\generative-models\scripts\demo\video_sampling.py", line 5, in <module> from scripts.demo.streamlit_helpers import * ModuleNotFoundError: No module named 'scripts'

tiny cradle
#

its hard coded for cuda. I suspect the code can be made to run on m1

File "/Users/bryce/.pyenv/versions/3.10.13/envs/gen-mdls-3.10.13/lib/python3.10/site-packages/torch/cuda/__init__.py", line 239, in _lazy_init
    raise AssertionError("Torch not compiled with CUDA enabled")
AssertionError: Torch not compiled with CUDA enabled
sullen flower
#

you need to pip install . probably

glossy plover
half hound
#

how do you install?

sullen flower
#

pretty sure that fixed it for me though i did it with the -e flag, but not sure that should make any difference

tiny cradle
glossy plover
tiny cradle
#

run your commands from the root of the project

glossy plover
#

I changed the requirements file, continued as normal and it still errored out, a completely different error though

sullen flower
#

i ran mine in wsl because of triton

glossy plover
tiny cradle
#

not sure if thats how env vars work on windows. apologies if its wrong 🙂

quartz crescent
#

pip install .

glossy plover
quartz crescent
#

oh weird, am on win & that fixed it for me

tiny cradle
#

i can only get 8 frames on a 4090 but I see some opportunities for better memory usage

glossy plover
#

yeah, still getting ModuleNotFoundError: No module named 'scripts'

#

that's after the UI began loading itself in browser though

grim tangle
glossy plover
half hound
quartz viper
#

Congrats to Stability team for releasing a video model! GO JOE!

iron topaz
#

new we just need to foce gpu manufactuers to solve the vram proplem

quartz viper
#

also, maybe we can not call it SVD as that is kind of homophonic with another initialism

quartz viper
iron topaz
#

i have hered you can mod gpus to dubble the vram by switching the ram chipp

quartz viper
#

thats right. but we use gaming cards to do it because gaming cards are priced for consumers. so we need to encourage developers to make games even more unoptimized and demanding on vram

#

start demanding 4k specular and normal maps

grim tangle
iron topaz
#

ram is so inexpencive in comparison its just sad that thats the limiting factor

quartz viper
#

since the bios already has 16gb code

iron topaz
#

they shuld use uncompressed 8k textures for everything

quartz viper
#

i don't think we'll get LLMs running locally in our games. Games that use those will tie into openai's api or whatever exists since that's busy self immolating right now

iron topaz
#

another option are games that will use ai feutres so nvida needs to add more vram for that

quartz viper
iron topaz
#

but you are probably right 😦

quartz viper
#

Microsoft just bought activision. do you think things will get betteR?

iron topaz
#

i hope the sever costs are so extreme and the user so unpredictibil that it iwll not work

quartz viper
#

You think GTA6 trailer is going to show off client side ai? lol

glossy plover
#

gaming cards were 4-24gb for about a decade now. at this point it probably won't cost NVIDIA more to bump that up to higher VRAMs

iron topaz
#

server costs are not free. and game companys that run llm 24/7 will notce that soon if they try to do that

quartz viper
iron topaz
#

even open ai seems to make a loss

quartz viper
lean gate
quartz viper
#

their server costs are basically donated by microsoft

iron topaz
quartz viper
quartz viper
#

Microsoft, the games publisher, who has legions of servers

iron topaz
glossy plover
#

also NVIDIA keep acknowledging the existence of the many users of their GPUs to run AI locally. they literally made the last few drivers specifically for Stable Diffusion. there could be a good chance that they'll bump up VRAM in the future

quartz viper
#

i think corporate strategy is very much going to try to price home local AI out and consolidate it all locally. Thats why we need stability. They're the only major players releasing this stuff

quartz viper
#

Runway ML sure didn't take up the torch. Stability got an open model released before they ever put gen2 out

iron topaz
#

stabillety ai gpu when 👀

quartz viper
#

another compnay making GPUs would still have to rely on the same silicon forges that all the processor makers do

#

microsoft making a processor design now even. everyone's getting into it

#

i'm hoping intel's new ML focused instruction sets will bring the kind of speed GPU's benefit with, to the CPU. Thhen we just need to stick a new dimm into an open bank

#

Tried and true scaling form

jade oar
#

what the hell those results all look insane

half hound
#

what directory are you supposed to put the safetensor files?

jade oar
#

the temporal coherence here is much much better just looking at it, no?

jade oar
timid storm
lean gate
iron topaz
#

when i inslal it do i need ti install the pt2 requerments or pt13 requermetns or something diffrent ?

tiny cradle
grim tangle
chilly nymph
#

Is there a stable diffusion discord bot that I can add to my own server?

tiny cradle
grim tangle
tiny cradle
#

ah maybe that's it then - thanks

quartz viper
grim tangle
quartz viper
#

what a time to be alive!

tiny cradle
iron topaz
#

just clone the github and create a checkpoints folder and put them into it?

grim tangle
half hound
#

@iron topaz imagiAlrgy-Bryce posted this earlier.
setup (Python 3.10, 4090):
pip install -r requirements/pt2.txt
pip install .
-modify streamlit_helpers.py lowvram_mode = True
streamlit run scripts/demo/video_sampling.py

#

I set up a virtual environment first though

#

you also need to do pip install steamlit

grim tangle
#

that's pretty much it yeah

half hound
#

but I ran into an issue

iron topaz
#

ok thanks

half hound
#

still trying to figure it out

tiny cradle
iron topaz
#

nice i installed the wrong requerments file but i guess thats what conda is for xdD

tiny cradle
half hound
#

I am guessing because I can't figure out where to put the safetensors files lol

#

that might be the issue

iron topaz
#

if you look into the skript you see that it looks for it ther. but idk

half hound
#

i'll try that

iron topaz
#

why does everything happen when i am supost to sleep x-x

grim tangle
iron topaz
#

hahahah

timid storm
#

the model is rly addictive to use

iron topaz
#

i didint even use it jet x.x

half hound
#

I feel like I am so close to getting it working lol

iron topaz
#

just isntalling a ton of dependecies.

half hound
#

are you on windows?

iron topaz
#

no

#

kubuntu

half hound
#

ah ok

iron topaz
#

but my 3090 is halve broken let hope it works. becaseu a evil person sold it to me. in most tasks it works but at some intense ai stuff it blacksceens my pc

half hound
#

darn

iron topaz
#

i can play cyperpunk very well but animate diff for more then 20frames is to mutch

half hound
#

finally got it

grim tangle
iron topaz
#

cool

half hound
#

so I ran into 3 issues

#

idk if you will run into them

#

but this is what I had to fix

#

in video_sampling.py I changed it to streamlit_helpers import *

iron topaz
#

ok

tiny cradle
half hound
#

3.10

iron topaz
tiny cradle
#

and you're running from the root of the project right?

half hound
#

had to do pip install torchvision and pip install opencv-python as well

grim tangle
#

on windows I just copied the scripts from the demo folder to the root and ran from there

pastel storm
#

Has anyone gotten this to run locally yet?

grim tangle
pastel storm
#

I’m guessing there will be optimizations in the future tho? I have a 3060

turbid rampart
nimble cape
#

Hey! First time poster. I saw you guys just launched your video model. Super big step... congrats. I am intersted in working with it.... I am going to need a new system though. Other then the 4090 is there a card that runs the model well?

grim tangle
tiny cradle
earnest tartan
#

When this has matured, will the VRAM requirement be lower or greater?

turbid rampart
iron topaz
#

i get this message 🤔

nimble cape
tiny cradle
iron topaz
grim tangle
tiny cradle
half hound
#

pip install einops, pip install imwatermark, pip install invisible-watermark, pip install omegaconf ran all these other dependencies, thought it was weird I had to install so many, then it circled me back to the script issue again. I am going to make an issue post on the repo. I did get it to the main page, but as soon as I try and select a model it gives me an error

tiny cradle
iron topaz
#

witch one is needed the 2 or the 13?

half hound
#

I ran the pt2.txt

#

yeah I guess I should have done 13 as well

#

i'll try that lol

tiny cradle
half hound
#

which model did you use?

tiny cradle
#

maybe it didn't succeed? or maybe windows is just hard. SVD

half hound
#

got this error when trying to pip install pt13:
ERROR: Ignored the following versions that require a different python version: 1.6.2 Requires-Python >=3.7,<3.10; 1.6.3 Requires-Python >=3.7,<3.10; 1.7.0 Requires-Python >=3.7,<3.10; 1.7.1 Requires-Python >=3.7,<3.10
ERROR: Could not find a version that satisfies the requirement triton==2.0.0.post1 (from versions: none)
ERROR: No matching distribution found for triton==2.0.0.post1

tiny cradle
#

yeah probably comment out triton - might be just for linux. not sure

west carbon
#

I am on WSL2 and I am getting the same issue: No module named 'scripts'

#

tried different python versions, conda environment, venv, etc...

tiny cradle
iron topaz
#

ok i solved it by moving the skript to the main directory. strage

west carbon
#

this is after loading the page in the browser.. similar error in there

iron topaz
#

and start it with this comand "streamlit run video_sampling.py
"

tiny cradle
# west carbon

interesting. moving the script apparently works. wonder why I didn't have to do that

west carbon
#

oh, I will try, thanks

tiny cradle
#

you might try PYTHONPATH=. streamlit run scripts/demo/video_sampling.py

iron topaz
#

what options do u use?

tiny cradle
#

just svd should be fine

iron topaz
#

why is ther no way to imput a prompt?

tiny cradle
#

yeah thats not a feature of this

iron topaz
#

image to video?

west carbon
iron topaz
tiny cradle
#

you input an image and get out a video. no prompt

iron topaz
#

hels hope that it does not crash

#

i am at about 20gb vram

tiny cradle
#

looks like it uses 15 gb during generation

iron topaz
#

at what res are u genrating

tiny cradle
#

and decoding i'm not sure but you can set it to decode 1 image at a time

#

1024x576

iron topaz
#

it used 20 while genrating and tehn it crashed becseu out of mamorey.

#

how can i activate system memory fallback on linux?

tiny cradle
iron topaz
#

nice it worked thanks for all the help

grim tangle
#

decoding 1 at the time actually makes this run fast on 4090, duh...

#

should've just done that from the start 😛

turbid rampart
#

i hope you don't mind me trying it out with your video

iron topaz
#

dont google cat ! xD

turbid rampart
iron topaz
iron topaz
turbid rampart
#

instead of it being partly choppy

iron topaz
#

cool

turbid rampart
#

here it is as a looping gif i think

half hound
#

Thanks @tiny cradle @iron topaz for the help. Looks like it's finally generating. Crossing fingers this time.

iron topaz
patent escarp
#

stuck on ModuleNotFoundError: No module named 'imwatermark' unfortunately even though its installed

half hound
#

Moving video_sampling.py to main dir worked. I ran into some other errors. Said I had python version 3.10.13 and said I should use v 3.10.11 so I created a new venv and then I also had to do pip install xformers and I had to take out triton==2.0.0 from the pt2.txt file to get it to work.

half hound
patent escarp
#

i commented out triton since i'm on a mac and i moved the video_sampling.py to the root dir. still no go though. your vids look good tho! 😄

half hound
#

isn't triton for unix systems?

#

I am on windows

patent escarp
#

some forums say its for linux, not sure if its for mac though. i wasnt able to pip install it even with python 3.10. i'll keep pluggin away.

iron topaz
#

i dont think it will work for mac

#

but i am sure that it will work in future

patent escarp
#

yeah, you are probably right. i'll just sit tight for awhile and work on some other projects : P

iron topaz
# patent escarp yeah, you are probably right. i'll just sit tight for awhile and work on some ot...

Follow along and set up LLaVA: Large Language and Vision Assistant on your Silicon Mac and any other llama.cpp supported platforms. The performance of 4bit quantized 7B model is amazing and this can be your local ChatGPT Vision alternative and keep your data private.

Timestamps:
00:00 - Introduction
00:59 - Installation & building LLaVA
02:18 -...

▶ Play video
#

it can see images

glossy plover
#

I've managed to fix stuff, now getting FileNotFoundError: [Errno 2] No such file or directory: 'outputs/demo/vid/svd\\samples\\000003_h264.mp4'

glossy plover
gleaming tide
#

so if you don't have a 3090, best to just wait for the website launch?

oak swift
#

probably basic question, when I git clone git@github.com:Stability-AI/generative-models.git it says "Cloning into 'generative-models'...
git@github.com: Permission denied (publickey).
fatal: Could not read from remote repository.

Please make sure you have the correct access rights
and the repository exists."

quartz crescent
oak swift
#

can you get the dogs to actually move and not just camera possibly, like if you says dogs running?

#

looks good though.

nova sparrow
quartz crescent
west carbon
#

you can try changing the seed

oak swift
glossy plover
nova sparrow
#

me

quartz crescent
# oak swift Do mean can't prompt specific motion like animatediff, but general prompts work?

you cant prompt at all(?) (in the current release. base model supports txt2vid/txt2img2vid according to the paper)

from hf page:

Limitations
The generated videos are rather short (<= 4sec), and the model does not achieve perfect photorealism.
The model may generate videos without motion, or very slow camera pans.
The model cannot be controlled through text.
The model cannot render legible text.
Faces and people in general may not be generated properly.
The autoencoding part of the model is lossy.

west carbon
#

here is a character moving... it's funny because it seems like the character is trying out some camera lenses that distort him, and he is aware of that... haha

oak swift
#

ohh, yeah my bad. I forgot txt to vid wasn't released yet.

quartz crescent
west carbon
#

good to know, thanks!. I would love more explanation about the rest of parameters, like for example: s_churn #1, etc...

west carbon
#

have you tried to loopback it? take the last frame and feed it again at 24 fps using the same seed

quartz crescent
west carbon
#

haha, me too

quartz crescent
#

ooooh yeah the autoencoder is lossy indeed agony

gleaming tide
#

from a dalle3 still image

strange pawn
#

anyone got advice for running in windows with 4090 - i have to restart the streamlit app after every attempt; seams like a gpu mem leak. Yes i'm using "Decode t frames at a time (set small if you are low on VRAM) = 1"

rustic hinge
#

does anyone have a published script to run this yet?

#

it looks like stable diffusion didn't publish a script to use it

gleaming tide
#

doesn't work so great on 2d art but I'm fascinated by what it came up with here

quartz crescent
torpid falcon
#

Is there a way to run this in a Colab? Would it be hard for me to figure out how to create one? I have so many questions and so little vram 😭😭😭

tiny cradle
#

A lot of my generations are just doing simple translations of the image. Annoying. Any tricks to prevent that?

echo ether
#

Maybe there will be a finetune or something

gleaming tide
#

for 2D it most often does camera pans and doesn't really animate them at all

obtuse bridge
#

haha, just ported the VAE to AnimateDiff. It works really well for it.

#

It sure loves eating VRAM though.

obtuse bridge
# tiny cradle is this a comparison?

Yes, first is the default decoder, second is the temporal decoder with timesteps=4 (which I am assuming is just the amount of cross-frame attention applied)

tiny cradle
#

so the new vae is less noisy. nice

raw coral
#

A small question, what's the difference between svd and svd_image_decoder?

shell plume
#

Hello everyone

#

I created a Google Colab to test it, and it's working quite well with an A100. It downloads two models (svd and svd_xt) from Hugging Face. If it's useful to anyone, here's the link: https://bit.ly/stable-difussion-video

I'm also posting some results. Does anyone know which settings to adjust to make the movement smoother?

gleaming tide
#

your link is not public

sinful vine
peak shore
unborn acorn
echo ether
# shell plume

It seems like theres some sort of a bias for looping vids or small movements...

obtuse bridge
bold prawn
shell plume
bold prawn
unborn acorn
#

20steps with EulerA. Faster generation, a bit less movement.

fair otter
#

Hi everyone. I’m trying to get this running on either a1111 or comfy and maybe I’m just tired but I feel like I’m missing something any help is appreciated.

tiny cradle
fair otter
pastel storm
tiny cradle
pastel storm
#

So close to 12gb! I have a 3060 so I anticipate further optimizations

mental magnet
tiny cradle
#

number of frames increases memory usage so you might be able to run it just generating fewer frames

pastel storm
#

Interesting, do you think there will be a ui for it soon?

royal nebula
obtuse bridge
#

Just wait for diffusers to implement it

#

I just extracted the decoder weights from the model and hacked together something horrible in a jupyter notebook

#

Hopefully diffusers implements some optimizations too because... well, the VAE is requiring 24GB to run on this machine.

#

Assuming I didn't break something which is entirely a possibility still, this desperately needs some form of tiling

tiny cradle
#

it doesnt require that if you set the number of images to decode at atime to 1

obtuse bridge
tiny cradle
#

no its not

#

i think...

obtuse bridge
#

Also I must emphasize that I have completely gutted the model, I am just using the decoder. I'm guessing it's the timesteps argument?

tiny cradle
#

decoding_t

#

model.en_and_decode_n_samples_a_time = decoding_t

obtuse bridge
#

Yeah that's the timesteps argument I think

tiny cradle
#

doubt it

#

that usually means something else

obtuse bridge
#

Setting it to 1 reduced peak memory consumption to under 20gb

obtuse bridge
#

...no, memory usage is still at 20 when using timesteps=16?

#

I'm just going to have to run the profiler PAIN

tiny cradle
solemn turtle
tiny cradle
#

what class is video_encoder

obtuse bridge
obtuse bridge
#

inherits from decoder which I think has the actual important methods in it

tiny cradle
#

hmm i'm tracing how the argument gets to the decoder

obtuse bridge
tiny cradle
#

n_samples = default(self.en_and_decode_n_samples_a_time, z.shape[0])

#
                if isinstance(self.first_stage_model.decoder, VideoDecoder):
                    kwargs = {"timesteps": len(z[n * n_samples : (n + 1) * n_samples])}
#

so timesteps is related

#

you might be right

#

so I think you're missing this logic thats in DiffusionEngine

    @torch.no_grad()
    def decode_first_stage(self, z):
        z = 1.0 / self.scale_factor * z
        n_samples = default(self.en_and_decode_n_samples_a_time, z.shape[0])

        n_rounds = math.ceil(z.shape[0] / n_samples)
        all_out = []
        with platform_appropriate_autocast(
            enabled=not self.disable_first_stage_autocast
        ):
            for n in range(n_rounds):
                if isinstance(self.first_stage_model.decoder, VideoDecoder):
                    kwargs = {"timesteps": len(z[n * n_samples : (n + 1) * n_samples])}
                else:
                    kwargs = {}
                out = self.first_stage_model.decode(
                    z[n * n_samples : (n + 1) * n_samples], **kwargs
                )
                all_out.append(out)
        out = torch.cat(all_out, dim=0)
        return out
obtuse bridge
tiny cradle
#

but this function is running the decoder multiple times, to decode all the frames, and then compiling them together

obtuse bridge
#

so basically just what vae slicing does in diffusers

tiny cradle
#

yes kind of

obtuse bridge
#

thanks for the help

unborn acorn
mental magnet
#

Hello,

I'm currently experiencing a technical issue with a Python script that involves downloading a pretrained model using the open_clip module from Hugging Face Hub. However, I'm facing a LocalEntryNotFoundError as the script is unable to access the necessary files from the Hugging Face Hub due to network connectivity issues. The specific model in question is CLIP-ViT-H-14-laion2B-s32B-b79K, and the file it's trying to download is open_clip_pytorch_model.bin.

#

Given this situation, I'm considering manually downloading the model file and placing it in an appropriate location on my local system. However, I'm uncertain about the correct directory where the open_clip module expects to find this file. The default cache directory for huggingface_hub seems to be ~/.cache/huggingface/transformers/, but I'm not sure if this is where I should place the downloaded file.

Could you please advise on the correct procedure for manually downloading and placing the pretrained model file, so that my script can access it without needing to download it from the internet?

Thank you for your assistance.

mental magnet
#

?

mental magnet
#

nope

pure compass
finite breach
#

Woohoo, new here

pure compass
wispy valley
#

how much VRAM is needed for svd_xt? I tried it on 3090 24GB and OOM was reported...

solar willow
#

Hello all!

#

Does anybody know anything about motion buckets and their id's? For now i just spam some random int from 1-255 but maybe it means something 😄 😄

solar willow
unborn acorn
meager geode
#

The generated resolution is 1024*576. Pictures that are not of this ratio will be automatically deformed and compressed to this ratio.🥹

onyx silo
#

Look for decoding_t: int in the code and set it lower, defaults to 14 IIRC, can goes all the way down to 1, that should reduce VRAM reqirements.

#

Doesn't work on MPS though the stabilty-ai code assumes XFormers

nocturne magnetBOT
grim tangle
#

comparing motion bucket id values, 1024x640, 25 frames with SVD-XT on 4090 (lowvram mode, decoding 1 at the time)

solar willow
grim tangle
solar willow
grim tangle
solar willow
#

Thank's DUDE! For the lowvram and for the comparison!!!!!

grim tangle
#

yeah I didn't even dare to try XT at first... turns out it runs just fine...

#

1 min 22 seconds per gen

solar willow
turbid sluice
#

It lost it's face, but i think it's cool.

grim tangle
shell plume
#

Any idea?

last basin
#

does this new model runs on a UI?

turbid sluice
cosmic cobalt
#

Note: 40GB of VRAM required -> LMAO

iron topaz
shell plume
iron topaz
cosmic cobalt
iron topaz
cosmic cobalt
iron topaz
#

I have q idea for later. U automatically grab the last frame of a video and make a new vid with it and make q long video with this 🤔

cosmic cobalt
solar willow
#

Ok... am i correct that each seed is attached to a certain camera movement? Might be super wrong 🙂

#

or the other way... how tf the model selects the movement 🙂

unkempt mica
fallen wren
fallen wren
#

i think if you edit some motion blur into an image that'll strongly influence what motion the model creates in a more controllable way

iron topaz
fallen wren
#

tbh a lot of the early days here is gonna be stuff for developers to play with moreso than end-users

#

normal people get to have the most fun with AI tech only after developers have figured out to how to make it work right and then build an interface around that

half hound
#

I am trying to find the sweet spot for generations and I have a couple of questions for everyone. I put my answers on the side:

  • What OS are you using? Windows

  • What graphic card are you using? RTX 4090

  • What image size are you using for the image generation? 1024x1024 will be testing smaller ones now haha

  • What T value are you using? 48

  • What FPS are you using? 12

  • What Decode t frames at a time are you using? 24 but 48 is way faster but more unstable

  • How long does the generation take? 3 hours. decode t frames at 48 is good and takes about 10 mins, but crashes. Trying to find sweet spot for 48.

  • What errors do you run into when generating? I only get errors at the very end. Either low vram errors, Expected all tensors to be on the same device, but found at least two devices error. Its too bad I have to wait a long time before I can tell if it errored out or not.

copper lynx
#

will SVD only run on machines with 4090 or greater? I have 3070ti and was wondering if it's even worth it to install. loving what I am seeing so far from what people are generating. thanks in advance

silent hinge
#

Anyone running SVD with a new MacBook M3 Max? I'm attempting but having issue with Pytorch not working device=mps

half hound
silent hinge
copper lynx
grim tangle
gleaming tide
gleaming tide
fallen wren
#

the model itself can definitely run in lower vram that the demo code has it

#

just, yknow, day 1 demo code is always built for the hardware it was trained on, not end-user/consumer tier hardware

#

(remember: SDv1 at launch required 24GiB of VRAM!)

silent hinge
flint current
solar willow
#

Ooooooh my 🙂 This year the Holliday cards will be AWESOME 🙂

half hound
wispy garnet
#

物語のはじまりを語る上で
バックパックとマフラーを装備した
女子高生の背中が力強い事を知った

DCEXPO2023講演「クリエイティブAIとAIDXが拓く新市場 - メタバース・放送・メディアアートのその先に」シンギュラリティについての考察補足|しらいはかせ(Hacker作家) @o_ob https://note.com/o_ob/n/n22d154730de2?sub_rt=share_pw #note

Original "singularity tunnel" image-to-video Demo
https://www.pixiv.net/artworks/113439831

https://youtu.be/KumvEA6Wu2s

#SVD #StableVideoDiffusion

▶ Play video
solemn turtle
idle finch
tiny cradle
solemn turtle
idle finch
solemn turtle
shell plume
#

Hello

solemn turtle
shell plume
#

I just updated it, can you try again? It should be fine, and separated

shell plume
solemn turtle
shell plume
solemn turtle
shell plume
#

check if the file exists where is looking /content/generative-models/HERE

#

if is not there, copy it or move it there

shell plume
#

then the line where is copied is not working for some reason

#

maybe some of the lines didnt run correctly

#

you have the project in the folder?

solemn turtle
wispy garnet
onyx ore
pastel storm
#

Has it been optimized for consumer hardware yet? I have a 3060 (12gb vram)

fallen wren
#

gonna have to wait a lil more than a day for that lol

#

it'll probably be running on 3090s within a few days

#

3060s will take longer (weeks/months, not sure, definitely not soon)

tiny cradle
#

its already running on 3090

#

i dont know an obvious way to get it down to 12gb though

#

maybe if you only generate 2 frames

fallen wren
#

only posts of it "running" on a 3090 have been buffering all the mem to CPU and taking 3 hours, doesn't count

tiny cradle
#

no it was running quick

fallen wren
#

eh?

#

who's got it running at speed on a 3090 already?

drifting vortex
#

It does takes some tries to get a good generation going, and too much dynamic a pose ends up deformed. But when it works, it does movements I haven't seen any other video generator pull off before

tiny cradle
#

it didn't take more than lowvram = True and decoder_frames = 1

#

id have to scroll up a bit to find who it was 🙂

fallen wren
#

oo

#

well, still. 3060 is gonna take a lot longer

tiny cradle
#

3090 has 24gb ram so we'd expect it to work right?

#

it was @grim tangle

grim tangle
#

Lowvram option seems to run it at fp16

solemn turtle
grim tangle
#

I actually got it running in comfy too, hacky way and not a proper implementation, but it works

#

25 frames at the default Res, with fp16, takes just bit under 20gb

solemn turtle
grim tangle
#

But I couldn't get it any lower even by reducing resolution...

tiny cradle
#

reducing number of frames reduces memory needs

grim tangle
#

Well easier to implement queues and stuff in comfy, can also generate the inits etc.

tiny cradle
#

so we need to see what peak memory is at 4 frames

grim tangle
#

Yeah frame count influences it greatly

hazy gorge
#

Can this model be fine-tuned so that it does more specific things and works faster and with less vram?

tiny cradle
#

they've already announced it will be finetuned to do a bazillion specific things

idle finch
#

@shell plume everything runs great – downloading and copying files without any problems. but after I try to run it I see this in console:

VideoTransformerBlock is using checkpointing
^C

one time it have downloaded .bin file while attempted to load a model in app, but it was only once and now it keeps fail

copper lynx
tiny cradle
#

you make it in the root of the project

copper lynx
#

so if I'm in Windows, in my generative-models folder I can create a folder called "checkpoints"?

tiny cradle
#

yes. unrelated to OS

copper lynx
#

ah, good to know. thank you. so once I create the folder called "checkpoints" I can continue with your install steps? thanks so much! have a great day 🙂 Im gonna try and get this to work on a 3070ti over the next few days

tiny cradle
#

put the weights in the checkpoints folder yes

tiny cradle
copper lynx
tiny cradle
#

i guess pip install streamlit

#

but streamlit is already listed in the requirements file so it makes me think that step failed for you

copper lynx
#

seems to be working now. thank you

#

installing streamlit that is.

#

Using cached smmap-5.0.1-py3-none-any.whl (24 kB)
Installing collected packages: pytz, zipp, watchdog, validators, urllib3, tzdata, typing-extensions, tornado, toolz, toml, tenacity, smmap, six, rpds-py, pygments, protobuf, pillow, packaging, numpy, mdurl, MarkupSafe, idna, colorama, charset-normalizer, certifi, cachetools, blinker, attrs, tzlocal, requests, referencing, python-dateutil, pyarrow, markdown-it-py, jinja2, importlib-metadata, gitdb, click, rich, pydeck, pandas, jsonschema-specifications, gitpython, jsonschema, altair, streamlit

idle finch
copper lynx
#

might have been because earlier I was having to convert the instructions from Unix based to Windows based with GPT help lol

tiny cradle
#

i just added this step:

  • double check that pip install actually worked. on windows you may need to comment out xformers and triton
tiny cradle
copper lynx
# tiny cradle i just added this step: - double check that pip install actually worked. on wind...

awesome. you're the man! it said it worked I think

Successfully installed MarkupSafe-2.1.3 altair-5.1.2 attrs-23.1.0 blinker-1.7.0 cachetools-5.3.2 certifi-2023.11.17 charset-normalizer-3.3.2 click-8.1.7 colorama-0.4.6 gitdb-4.0.11 gitpython-3.1.40 idna-3.4 importlib-metadata-6.8.0 jinja2-3.1.2 jsonschema-4.20.0 jsonschema-specifications-2023.11.1 markdown-it-py-3.0.0 mdurl-0.1.2 numpy-1.26.2 packaging-23.2 pandas-2.1.3 pillow-10.1.0 protobuf-4.25.1 pyarrow-14.0.1 pydeck-0.8.1b0 pygments-2.17.2 python-dateutil-2.8.2 pytz-2023.3.post1 referencing-0.31.0 requests-2.31.0 rich-13.7.0 rpds-py-0.13.1 six-1.16.0 smmap-5.0.1 streamlit-1.28.2 tenacity-8.2.3 toml-0.10.2 toolz-0.12.0 tornado-6.3.3 typing-extensions-4.8.0 tzdata-2023.3 tzlocal-5.2 urllib3-2.1.0 validators-0.22.0 watchdog-3.0.0 zipp-3.17.0

#

🫂

solemn turtle
onyx ore
#

Hey

solemn turtle
# idle finch pity

did u get colab to work for stable vid dif? from ur older comments, u encountered some errors

copper lynx
# tiny cradle i guess pip install streamlit

when I tried to run streamlit it returned this

Traceback (most recent call last):
File "C:\Users\xxxxxxxx\generative-models.pt2\lib\site-packages\streamlit\runtime\scriptrunner\script_runner.py", line 534, in _run_script
exec(code, module.dict)
File "C:\Users\xxxxxxxxx\generative-models\scripts\demo\video_sampling.py", line 3, in <module>
from pytorch_lightning import seed_everything
ModuleNotFoundError: No module named 'pytorch_lightning'

tiny cradle
#

yeah your pip install of the requirements file must not have worked

copper lynx
# tiny cradle yeah your pip install of the requirements file must not have worked

should I go back through and do the install again?

  1. Clone the repo
    git clone git@github.com:Stability-AI/generative-models.git
    cd generative-models
  2. Setting up the virtualenv
    This is assuming you have navigated to the generative-models root after cloning it.

NOTE: This is tested under python3.10. For other python versions, you might encounter version conflicts.

PyTorch 2.0

install required packages from pypi

python3 -m venv .pt2
source .pt2/bin/activate
pip3 install -r requirements/pt2.txt
3. Install sgm
pip3 install .
4. Install sdata for training
pip3 install -e git+https://github.com/Stability-AI/datapipelines.git@main#egg=sdata

If not the whole thing, which parts?

tiny cradle
#

pip3 install -r requirements/pt2.txt

copper lynx
# tiny cradle pip3 install -r requirements/pt2.txt

I did that. it returned this error

ERROR: Ignored the following versions that require a different python version: 0.55.2 Requires-Python <3.5; 1.6.2 Requires-Python >=3.7,<3.10; 1.6.3 Requires-Python >=3.7,<3.10; 1.7.0 Requires-Python >=3.7,<3.10; 1.7.1 Requires-Python >=3.7,<3.10
ERROR: Could not find a version that satisfies the requirement triton==2.0.0 (from versions: none)
ERROR: No matching distribution found for triton==2.0.0

tiny cradle
idle finch
tired egret
copper lynx
tiny cradle
#

pt2

#

a # at the start of the line comments it out

copper lynx
#

ok so save that pt2 file after commenting out and try to install the requirements again?

tiny cradle
#

yes

half hound
#

Hi all, I made instructions on how to install stable Video Diffusion on windows. Here is the text:

#

Setup Instructions (Python 3.10.11, 4090, working on Windows):
Go to user directory
right click git bash
git clone https://github.com/Stability-AI/generative-models.git

-modify streamlit_helpers.py
lowvram_mode = True

move video_sampling.py file to main dir
create a checkpoints folder in the main dir
download the SVD weights from https://huggingface.co/stabilityai/stable-video-diffusion-img2vid/tree/main
(optional) donwload SVD-XT weights from https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt/tree/main

-modify requirements/pt2.txt file
remove triton==2.0.0 line and save

-modify requirements/pt13.txt file
remove triton==2.0.0.post1 line and save

Open Anaconda
cd to user/generative-models
conda create -n genModelVideo python=3.10.11
conda activate genModelVideo

pip install https://huggingface.co/r4ziel/xformers_pre_built/resolve/main/triton-2.0.0-cp310-cp310-win_amd64.whl
pip install -r requirements/pt2.txt
pip install .
pip install -r requirements/pt13.txt

streamlit run video_sampling.py

click "Load Model"

upload image and there you go.

Will get a tensor error but you can ignore it. Still seems to work

*try 48 decode t frames for faster generation

#

and here's a video explaining it

tiny cradle
half hound
#

why?

tiny cradle
#

makes no sense. you're supposed to pick whether you're installing pytorch 1.3 or 2.0

#

and svd only works with 2.0 i believe

rigid orchid
#

has anyone tried to run this on CPU? it seems to be bypassing the GPU memory but I bet it will output slower

half hound
#

I did run svd and pip install pt13 last and it generated a video

#

trying svd_xt right now

tiny cradle
half hound
#

ok i'll take a look which version I have installed in my venv

#

I tried the svd_xt and it also generated a video

copper lynx
#

okay i installed all requirements without any errors.

when I ran streamlit it returned this

C:\Users\xxx\AppData\Local\Programs\Python\Python310\lib\site-packages\torchaudio\backend\utils.py:74: UserWarning: No audio backend is available.
warnings.warn("No audio backend is available.")
2023-11-22 12:49:08.964 Uncaught app exception
Traceback (most recent call last):
File "C:\Users\xxx\AppData\Local\Programs\Python\Python310\lib\site-packages\streamlit\runtime\scriptrunner\script_runner.py", line 534, in _run_script
exec(code, module.dict)
File "C:\Users\xxx\generative-models\scripts\demo\video_sampling.py", line 5, in <module>
from scripts.demo.streamlit_helpers import *
ModuleNotFoundError: No module named 'scripts'

half hound
#

don't know if i'll run into issues later though everything seems to be working.

tiny cradle
jade carbon
#

I believe when you go to install 1.13, it ignores packages that are already installed.

#

Therefore, any matching dependencies are just skipped

half hound
#

yeah I agree. I was running into lots of issues trying to install it so I was trying anything to get it to work lol

tiny cradle
jade carbon
#

Could cause issues with torch though, so I usually manually install this after to match my CUDA version

jade carbon
#

That would be generative-models

tiny cradle
half hound
#

I had to remove the triton from the requirements

#

to get it to work

#

on windows

rigid orchid
#

doesn't work with cpu because some of the layers only support cuda

jade carbon
#

It has something to do with python path, though, even manually setting it to the folder, I'm not sure why it doesn't find the scripts folder

tiny cradle
half hound
#

No I tried last night. I tried to just do the install with just pt2

#

ran into errors then had to manually do pip installs

#

then just got into a loop of errors

#

then I went back and looked at any errors while I was installing it

tiny cradle
#

in my custom fork i'm not using xformers but maybe I made some tweaks to get that working

half hound
#

and triton gave me an error and I think it skipped the rest of the requirement installs

#

I think that was the issue

rigid orchid
solar willow
#

Ok. After i generated 2394 nsfw animations i tried architecture... THIS IS IT.... 10x10. No cherrypicking.

half hound
#

Looks good 👍

zenith spoke
fossil vine
#

Stunningly beautiful oak tree, on the edge of a forest, in the foreground there is grass blowing in a gentle breeze, the tree is in the middle ground, summer, in the background a gently rising hill

grim tangle
grim tangle
#

still need to add rest of the settings and better memory management to allow larger workflows around it

half hound
#

how do I get the install via git url

#

my manager doesn't have it

zenith spoke
#

latest comfyui?

#

i had it gone once but had to update comfyui and manager, dunno what fixed it really

half hound
#

ok

#

i just updated it

#

ok yep

#

that was it, just need to restart it

#

can I drag and drop this workflow?

solar willow
copper lynx
gray sleet
copper lynx
#

pip install Anaconda? sorry im such a noob

half hound
#

need to download it here

#

you also need git

golden oyster
#

What are the requirements for my pc to run this smoothly

half hound
#

works on my rtx 4090 I've read other people where able to run it with rtx 3090

golden oyster
#

I have rtx 3080ti is it enough? What about ram and storage

copper lynx
copper lynx
half hound
#

how much vram do you have? You might as well install and test it out. You can decrease the decode t frames

half hound
grim tangle
#

nice!

sterile mesa
#

hey @shell plume , curious why you !pip install what you did in your collab:

!pip install -r requirements/pt2.txt
!pip install .
!pip install -e git+https://github.com/Stability-AI/datapipelines.git@main#egg=sdata

Wasn't clear on why the . and datapipelines.git?

iron topaz
#

@tiny cradle is it theoratically possible to add a controllnet? i have hered that its baised on sd 2.1 is that true?

tiny cradle
tiny cradle
iron topaz
fallen wren
zenith spoke
fair otter
urban linden
grim tangle
grim tangle
grim tangle
#

Video helper suite (VHS) has good nodes to make it into a video

urban linden
grim tangle
#

Yeah if you are generating a single image

#

Just make sure the resolution of the image is compatible, needs to be divisable by 64

rustic hinge
#

how do I set the streamlit demo to use multiple GPUs for shared VRAM?

fair otter
tiny cradle
#

the streamlit version outputs two videos for me every time

grim tangle
sterile mesa
#

ive got the ui set up and running, but no clue what settings to change lol

ionic palm
quick finch
quick finch
sterile mesa