#šŸ’¬ļ½œgeneral-chat

1 messages Ā· Page 120 of 1

topaz parcel
#

good analogy. I think most of us are guilty of abusing the 'abundance' we have now in resources. Maybe having less like you say would result in more creativity and creative thinking. (and much less pixar versions of Marvel characters in our streams )

true canopy
#

it would get me 600 hours for the gpu i just bought

#

id rather get the gpu

karmic cedar
#

simplicity is a constant in pretty much every domain of life, it’s no surprise that it pays off with AI as well

true canopy
#

but if u can get rly cheap eletricity, like is iceland, then u could get into selling gpu power, its in“tresting

karmic cedar
#

there is a potential game plan for all of this…but it involves fusion energy, which is going to involve lots of helium for cooling…but once we have zero point energy, we will be able to AI all we want.

true canopy
#

i would rly like to know what kinda hardware they are running sora on, must be something massive

karmic cedar
#

considering it’s estimated that GPT 4 took upwards of 3-400 million dollars to train…

#

and that’s just a language model

true canopy
#

no 1$ an hour there šŸ˜›

karmic cedar
#

I know I’m just sayin…I’m sure those numbers are connected

true canopy
#

imagine messing up prompt and stuff on that hardware, expensive

karmic cedar
#

yeah seriously

#

that’s the most 21st century problem like…..ever lol

#

ā€œwhen you mess up a prompt on a multi-billion dollar state of the art reality distortion algorithmā€

storm shard
#

Get https://drawthings.ai/ it's perfect for apple silicon and very powerful once you get to know the interface
There's also a discord which is very helpful, it's full with tips and extras and the people over there are also helpful as well as friendly

#

And it's totally free

karmic cedar
#

Does Draw Things run outside of iOS/iPadOS?

#

I remember it being the first app to drop with diffusion support.

storm shard
#

yes, it runs on iPadOs, MacOs and iPhone

karmic cedar
#

Nice!

#

Does it support img2img and control nets now?

storm shard
#

yes

karmic cedar
#

whoa

broken cave
#

what is your goal? what images do you want to make? and why?

storm shard
#

also lora training (from 8gB)

#

and video, but you need 16gb ram for that
and ipadapter
just a lot, have a look @karmic cedar @topaz parcel

broken cave
#

why do you need a "LORA"?

karmic cedar
#

I stopped using it once my needs grew, but I’m really impressed at the scale of development. That dev is a one-man band

broken cave
#

what is a specific image that you want to create, as an example?

karmic cedar
#

His issue is more medial I think

waxen tapir
#

Hey where is the promt option to generate image

waxen tapir
green mauve
#

thanks

oblique jay
#

Does anyone know of AI models that can do TTS locally? I am looking for results similar to ElevenLabs, but I'd like to run it locally.

karmic cedar
#

not yet

fallen gale
#

has anyone figured out how to use champ well yet

#

im getting bad results

astral goblet
#

whats champ

fallen gale
#

the dance thing

#

controllable human video generation

astral goblet
#

neat!

violet fern
#

Someone access to the SD3 model? How do you run it? Asking since it probably wont work with automatic1111 or comfy?

astral goblet
#

no weights are released yet

#

its a web interface

#

people internally at stability are probably using comfyui custom version. he works for them

violet fern
#

So i've got the invitation email, linking to their huggingface with the model-files...

true canopy
#

ouch, people are going to be upset lol, getting the inv and asking here how it works

still glacier
astral goblet
#

seems made up lol

#

if you got invited to download weights, you're not going to be someone whose asking how to use them

#

or maybe the new CTO/CEO is incompetent in his direction

violet fern
#

I just did sign up today and almost instantly got the email.

#

Anyway. I will figure it out then.

charred mesa
#

so it is that way

astral goblet
#

maybe i'm wrong. this guy says he got the weights immediately after signing up

astral goblet
#

why would someone go online and lie?

#

you really think?

charred mesa
#

absolutely not

true canopy
#

its even worse then when people come in here talking about religion

charred mesa
#

Its all factual information

charred mesa
astral goblet
karmic cedar
#

Those go down in the dm tho

charred mesa
topaz parcel
# storm shard yes

I used to use DFrawthings -and it's currently installed. I stopped using it cuz, at the time, you could not use Controlnet, etc. But I may have to give that another go, too. If i recall though, it was very slow. maybe its got better?

topaz parcel
# broken cave what is your goal? what images do you want to make? and why?

I just like to control my images, with a control net - but as I say, XL models and control net on a silicon Mac just do not play very well together. was fine with SD 1.5 models. and running standard txt2img XL models is no prob - and very fast. But img2img and controlnet - not so much fun on a Mac M2.

karmic cedar
#

As fast as the M-series is, nothing compares to running big data models on mythical cloud hardware

bleak drift
#

Is it normal for a rtx 3060 gpu to generate an image in under 10 seconds that is supposed to be super detailed?

#

Could it be the Lora I am using?

storm shard
broken cave
broken cave
astral goblet
#

i dont think he's confused. i think he knows exactly what he's saying

sweet fox
#

It's normal. RTX3060 is nearly the slowest rtx3000+ gpu but it's not that slow. Also not having to offload because you have 12GB of vram to work with is really nice.

sudden hedge
#

Is reactor still a good faceswapping tool? All my images after face swapping come out blurry. Ive looked into this and cant find any definitives as to why. Some say my input image isnt detailed enough.... ive upscaled it and i can literally see the pores, and the bags under the eyes. Lol

astral goblet
#

No. Reactor is one of the old crusty versions of face reconstruciton. It's just pasting the new face on top of the old one, and then polishing it with codeformer. Very crusty. Very old.

IP adapters use insightface to do face swaps now. They bring the reconstructed face in at each step of the diffusion process, so that it's blended into the final pic much better. They still use the insightface 128px non commercial model though

#

@sudden hedge

#

you can only go so far with pasting a face on top of another

sudden hedge
astral goblet
#

i reocmmend ipadapter face swapping. make sure you use the lora with it

astral goblet
#

huh?

dusk canopy
#

There is a plugin that's great with face swap

#

Refractor

trail lion
#

refractor or reactor?

astral goblet
#

Reactor is absolute crap compared to the IP adapter or Instant ID

crude notch
broken cave
loud solar
#

P*rn šŸ˜„

broken cave
#

there is already a lot of it so i don't know what the point is

#

it's like who the hell cares

loud solar
#

I don't ^^

astral goblet
#

just go full nihilist

#

whats the point of music? theres so much already. who cares?

loud solar
#

I'm not sure I know what art is ... I'm just doing stuff šŸ˜„

astral goblet
#

whats even the point of doing stuff. it's all been done

loud solar
#

I should do more exhibitions ^^

#

At least it's your stuff ...

timid island
#

what upscaler do people use? i've been using 4x_foolhardy_Remacri

karmic cedar
#

SUPIR

split kestrel
#

My brain hurts

cedar salmon
#

different models and methods depending on desired effect

#

that foolhardy remacri seems to do well for me with nature type things

split kestrel
#

Do you guys jump around on different model sets or generally right tool for right job stuff?

dusk canopy
#

Ye

#

I settled on Proteus with Loras

timid island
#

i was happy with 1.5 but everything is slower now on my poor gtx1070 on sdxl 😢

astral goblet
#

theres tons of 1.5 models yet. just use those.

#

cyber realistic just put out a few new versions that are really good

timid island
#

just been getting some nice details on sdxl

#

oh the new cyber realistic one looks good, there is always new ones of that one coming out

karmic cedar
#

1.5’s value will last a relatively long time on account of the power of upscalers

trail lion
#

anyone know of a way in auto to generate a bunch of depth maps in batch? for auto1111

timid island
#

DPM++ 2M Karras or DPM++ 3M Karras? is 3M just better or only in certain situations?

karmic cedar
arctic sedge
#

Why would they think it's not useful?
From what we've seen it's amazing.

astral goblet
#

ego? i don't know.

#

Everyone is sleeping on lavi-bridge

timid island
#

one thing i can't get a solid answer for from reading about it, will a full fp32 model get better results than the reduced fp16 version?

astral goblet
#

no. i use fp8 and get results nobody would guess wasn't full precision

karmic cedar
#

I would think so?

timid island
karmic cedar
#

Good to know

astral goblet
#

base models are trained in full precision. inference doesn't really need it

timid island
#

so full precision is more needed for training then?

astral goblet
#

hard to say if refining weights even benefits from full precision. negligible imo

#

i train loras in half precision

timid island
#

i think my 1070 would cry at being trained on lol

split kestrel
#

I got my card to crash yesterday šŸ™‚

#

240 frame batch.

astral goblet
#

over heat?

karmic cedar
#

My hunch is that precision would come into play more as prompt complexity increases

split kestrel
#

Maybe it was 512z

#

Naw, just ran it dry of resource

#

Windows got angry

astral goblet
#

how? more frames doesn't increase memory use. just the context batch size. šŸ˜®ā€šŸ’Ø

#

i'm exhausted trying to convey this to people

split kestrel
#

lol

#

I was upscaling to death I think

pastel turtle
#

Will SD3's 8B model (w/o T5) fit into 20GB VRAM on FP16? I'm sure the T5 encoder model is optional... right?

split kestrel
#

No idea how I got it to choke

arctic sedge
split kestrel
#

It crashed when it was on the final part where it was making a gif

karmic cedar
#

I wonder if Sora runs on 64-bit precision…

#

I wonder if Sora has that 16-bit blast processing chip!!!! iykyk

timid island
#

any reason to use vae decode as tiled? am looking at a workflow and have no idea why there is a tiled version

split kestrel
#

Keep in mind friend. I am noob and do not know what I’m doing 90% of my life

karmic cedar
split kestrel
#

It made sense when I looked at it lol

#

Animated likes to use a high step count I’ve noticed

#

24+

astral goblet
#

i animate at 4cfg and 20 steps. 1 cfg and 14 steps with lcm lora

split kestrel
#

Lora is just a different type of checkpoint model right?

#

All that shi* i crammed in my brain yesterday lol gone

timid island
#

no lora is more of an addon

#

add something that isn't in the model

#

so like mini-model you use alongside the base model

crude notch
#

3m better with lower cfg

crude notch
karmic cedar
#

😮

crude notch
#

on the h200s

#

sora takes too long to be viable

timid island
crude notch
#

i dont change cfg much lol

split kestrel
#

H200’s?

#

So is Sora a render farm type thing?

crude notch
split kestrel
#

What’s the length of video render?

#

5 hours of render would not be bad if you got what you asked lol

#

But if it’s 5 hours per 24 frames…. I’m out lol

#

Ask for sad movie about a penguin and a monkey… then sit back and wait for 2 years

karmic cedar
#

peddling hollywood with it right away is a sure fire means to drive human despair downward

low moon
#

So Midjourney and Leonardo came out with "consistent characters" When is SD doing it? xD

karmic cedar
#

SD is slamming the brakes

low moon
#

And why is Sora not open sourc eif its mad eby a company called OPENai

#

nothign makes sense

karmic cedar
#

it’ll be up to the community to develop this stuff

#

because economy

#

that’s why

low moon
#

but there is more debt in the world than actualy money

karmic cedar
#

indeed

low moon
#

so the economy is clearly a very reasonable standard

karmic cedar
#

the economy defines debt

timid island
#

i honestly don't know why they called themselves OpenAI when they've never had any intention of making their stuff open

karmic cedar
#

but…we define money

low moon
#

exactly

karmic cedar
#

in theory.

split kestrel
#

So… when SD3 arrives. Do we think it’s gonna break all current workflow type stuff?

low moon
#

yes

timid island
#

probably

karmic cedar
#

Depends on how quantized it is.

low moon
#

probably wotn even work in A1111

karmic cedar
#

er…neutered

#

er…lobotomized

low moon
#

it will first work on comfy of course...

karmic cedar
#

yep

low moon
#

fooocus will add it last

karmic cedar
#

But it’ll be great when they do

low moon
#

SDXL matured really well

timid island
#

i really hope 3.0 isn't neutered like 2.0 is. SDXL isn't neutered i don't think

low moon
#

i remember some sayign a fe wmonths after ir eleased "oh it failed no oen cares people prefer 1.5"

#

well now SDXL stuff is everyhwree and really refined

amber bloom
#

I think all the major frontend developers have got access to the SD3 beta, so hopefully we'll have full support from day 1

timid island
#

what resolution is SD3 at?

split kestrel
#

SDXL has been fun

low moon
#

Oh yeah will SD3 still have boobs?

split kestrel
#

lol

low moon
#

anywya they cna be added with loars

#

loras

split kestrel
#

Cat girl people losing minds

dusk canopy
#

someone give me prompt

#

im bored

karmic cedar
#

Time will tell. I think what we’re seeing—based on Sam Altman’s movements in D.C. as well as Sora’s apparent ultra-high end commercial appeal—is that the continued development of img2vid models will break apart similarly to what’s happening now with Stability

dusk canopy
#

ill generate anything tbf

split kestrel
#

Add a NSFW filter

#

If the prompts are better

timid island
#

don't add a NSFW filter

dusk canopy
#

i put this as a prompt

split kestrel
#

Little joker in that

#

Mech warrior cat girl zombie horde

#

, big googly eyes

teal pagoda
#

yo

#

Anyone here seeing a "dark" future for the open-source AI?

dusk canopy
#

no

charred mesa
#

in what way

dusk canopy
#

aslong as they release sd3

charred mesa
#

šŸ‘€

dusk canopy
#

i dcouldnt give a shit

#

šŸ’€

charred mesa
#

same fr

dusk canopy
#

i have a 7b model llm ive been finetuning

charred mesa
#

SD3 will generate infinite amounts of fun

dusk canopy
#

and a image model ive been fine tuning

#

i actually dont care aslong as they release sd3

#

because id have a good llm and image gen setup locally

charred mesa
#

and peopel will further refine it with additional DPO and datasets and create perfect models for actions, expressions, arststyle, ETC

teal pagoda
# charred mesa in what way

In a way that we'll enter into some dark times for the free AI art generation especially, everyone thinking about profits

karmic cedar
charred mesa
#

^

#

if SD3 ends up being uncensored by the community we'll be golden then they can't do crap to us offline folks

#

but we need to act quick

karmic cedar
#

They will do crap to the offline folks by other means

#

šŸ™‚

charred mesa
karmic cedar
#

Regulation of AI will be a very big deal once it hits.

charred mesa
#

yes

karmic cedar
#

I’m old enough and pessimistic enough to know how this shit is going to go down, and it disappoints me.

charred mesa
#

I mean in the current state, using Comfy with our SD3 models

karmic cedar
#

sure sure

dusk canopy
#

im bored

charred mesa
#

for the future models and etc, absolutely

dusk canopy
#

someoen gimme prompts

karmic cedar
#

I mean, get your creativity out while you can…I guess.

dusk canopy
#

i deadass ran out of ideas

astral goblet
#

after studying a bunch of the diffusers manual, trying to figure out how to weild the python code effectively, i have come to the conclusion that covid has brain damaged me and i can't code anymore

charred mesa
#

but with our current UIs and models we'll be safe

karmic cedar
#

It’s not just COVID…it’s age.

#

lol

astral goblet
#

so frustrating to see every other dev with talent just completely sleep on lavi-bridge. how awesome would using t5 for sd15 models be?

charred mesa
#

exactly

karmic cedar
#

yeah, that’s odd.

charred mesa
#

I'm surprised that there's no Forge or Comfy plugin yet

#

its been like week(s)?

karmic cedar
#

are we certain folks are sleeping on it?

astral goblet
#

covid accelerated the aging. i'm only 40. shouldn't be this inept

karmic cedar
#

also 40

#

šŸ™‚

#

okay, maybe it’s COVID

#

lol

astral goblet
#

i'm willing to accept i'm old but studying shouldn't be this hard lol

trail lion
#

speaking of forge, been using it for 2 days, it really is better, I was skeptical

low moon
#

maybe open AI stuff will go the way of the pirate ships if too much economy considerations and cencorship come sin

#

imagine downlaoding SORa on piratebay

#

leaks make thw world go around

trail lion
#

not sure why they had to fork it, maybe the auto1111 guy is stubborn

astral goblet
karmic cedar
#

there’s already an Open Sora project going

astral goblet
#

a lot of the extensions seem to break forge too. its neat if you got older hardware i guess

karmic cedar
#

it’s…pretty silly atm but it’s got potential

dusk canopy
#

we need torrenting buit for training ai models

charred mesa
#

man, is there a way to improve prompt adherence with Stable Cascade

dusk canopy
#

everyones gpus work together to train model

trail lion
#

hrm, maybe, I've tried ipadapter, reference, depth and various other controlnet models, and I havent gotten a single oom, so I'm pretty happy right now. Dont even need the medvram flag I had to use before

low moon
#

im addicted to foooooooooooooooocus lately, i looked down on ti when i heard about it thinkign somethign so simple cant be good probably its a stripped down version of A1111 but no, its pretty good and memory efficient

karmic cedar
#

yep, and it knows how to use a GPT2 prompter

dusk canopy
#

which was eh

#

i got good selfies from it tho

astral goblet
buoyant merlin
#

lmao i cant get to gen a face that doesnt look like the eyes were smooshed out

astral goblet
#

someone coming into your house and changing everything.. resisting that isn't exactly stubborn. if the changes prove themselves they'll be adopted surely. illy has even said forge may turn into a pull request one day

crude notch
#

you could do something like SLURM, but even then

dusk canopy
crude notch
#

only could work with batch training

dusk canopy
#

like bitcoin mining

crude notch
#

e.g. same model for all, pass diff data for each, do things after

split kestrel
#

SVD is too short 😦

#

And … kinda… bland

#

Cool

charred mesa
#

yeah its mostly pans

split kestrel
#

But bland

charred mesa
#

SVD_XT can go up to like 25 frames lol

astral goblet
karmic cedar
#

well, if social media continues down the tiktok rabbit hole short will feel like long and it’ll all blow over and be okay because culture!

astral goblet
#

when i want to show off image making to new people, i load up fooocus

split kestrel
#

From my understanding and all the bloody reading…. You can’t batch render SVD either

#

It’s just… 25 frames

karmic cedar
#

you can feed it the last frame and the same seed, but the fidelity takes a nice knock

low moon
astral goblet
split kestrel
#

Meh tho.

astral goblet
#

or code one giant megalith node

split kestrel
#

Cool! But I think it’s more work than it’s worth

astral goblet
split kestrel
#

I don’t know why I’m so hyper fixated on animating SDXL

astral goblet
#

vision

split kestrel
#

I’m not sure if that’s a statement or a node

trail lion
#

since several of you seem to be into video, is there anything else that does what ebsynth does? seems like not

split kestrel
#

What is that?

trail lion
#

like give it 10 keyframes and it fills it in with intermediate frames

astral goblet
#

yeah theres tons of frame interpolation things

trail lion
#

ffmpeg has an 'minterpolate' flag, I was excited when I found it, but it's terrible

astral goblet
karmic cedar
#

that’s impressive stuff, but are the keyframes it generates going to be enough to interpolate into smooth enough motions? It seems like there might still be a stilted motion effect from how spaced apart they are

astral goblet
#

FILM makes super smooth interpolations from animated diff generations

karmic cedar
#

that’s the one i usually do

astral goblet
#

the webui extension supports it if deforum is installed

karmic cedar
#

it’s really easy to over configure

#

and get suuuuper smooth video llol

#

It’s great for certain types of motion, but unlike dyna it can’t anticipate real world motions as well

teal pagoda
karmic cedar
teal pagoda
teal pagoda
#

But I really like the idea of torrents

split kestrel
#

Open sora

dusk canopy
split kestrel
#

Nutty

dusk canopy
#

just setup a discord bot

#

with stable diffusion api

teal pagoda
#

Did you ever think that we can make their own tools (especially the LLMs like chatgpt, claude3, copilot and gemini) to tell us how to leak these or to even leak inside information for us?

astral goblet
#

automatic111 vs forge though, they'll make identical images with identical settings

split kestrel
#

3 days…. For 2 seconds of video

timid island
#

haven't heard of forge things move so fast, is comfyui still the best ui?

astral goblet
#

you wont be making FHD videos with animatediff, not for a bit

split kestrel
#

Hmph

#

Lol

astral goblet
#

unless, you want those 2 day iterations

split kestrel
#

Well I need to figure it out first

#

I was upscaling a video 2x on a 4x upscale

high ruin
#

I am using Topaz Video AI with a RX 6600 XT fps is low if I upgrade to a 1080 TI would the fps in Topaz increase a decent amount??

charred mesa
split kestrel
#

I have lots to learn šŸ™‚

timid island
#

i am too impatient to make videos, even if i had the gpu for it

split kestrel
#

That’s why I want to build a workflow that kinda…. Steps through each process.

trail lion
#

about 4s is the max video I'm willing to make, it gets crazy with all the multi-pass img2img, but it can be fun if you get tired of looking at static photos

split kestrel
#

The end goal. Is a video - video workflow

timid island
#

i mean i wouldn't mind doing each step, it is just the total amount of time to render it

astral goblet
#

hard to do that with one graph. pre work is required pretty heavy on videos to get good results. just hucking prompts at a video without hand crafting the guidance to the specific situation, gonna be messy.

split kestrel
#

Source - control net is probably one flow

astral goblet
#

you'd want to turn the video into a few different remaps. depth, openpose, canny, segments, you name it. theres lots of approaches

trail lion
#

I do things by hand, I dont like any of the scripted solutions so far

astral goblet
#

turning that battletoad video into sonic was a fun experiment to do, but thats with minimal amount of effort and it shows

split kestrel
#

8 minutes of generated story line… is a lot that I bit off

astral goblet
trail lion
split kestrel
#

Not in one sitting lol

astral goblet
split kestrel
#

Have it

#

Just need to bring it to reality

#

But I need to work on some 5-10 second clips at a batch

astral goblet
split kestrel
#

Looking

astral goblet
#

if a 10second clip is a slow pan over a scene, consider generating one pic and panning over it

#

a lot of the high production value ai videos are a LOT of editing room efforts

split kestrel
#

The video I’m working on is essentially getting abducted by Alien space craft.

low moon
#

Old Westerns would be a good first AI feature film candidate to make. slow establishing shoots and pan shots, large vuistes minimal sets, closeups of intense faces. quick swift action, little talk...

astral goblet
low moon
#

hahaha

astral goblet
#

could do it to the wild wild west song

trail lion
#

hah, I blocked video for a while on reddit I was so sick of seeing smith

foggy halo
#

sd3 release when lol

timid island
#

what resolution is sd3 even trained on?

low moon
#

4096 4096

timid island
#

... many years until i'll be using sd3 then lol

astral goblet
#

lol i'm pretty sure it's a 1024x model too

karmic cedar
#

i’m asking sleepy questions

#

for some reason i was still thinking about sora

#

probably because of sora

low moon
#

For all its greatness sora is still slow motion

#

i have yet to see actual normal time videos

astral goblet
karmic cedar
#

those are probably part of the turbo model that they’re keeping for hollywood producers

split kestrel
#

Seems like render hell

astral goblet
#

i mean, slow motion is easily changed right? you'd just speed up the video at a higher frame rate, or drop some... no biggy

trail lion
split kestrel
#

Right

trail lion
#

50 photos, 50 frames, it's all the same

split kestrel
#

I can’t think of this in terms of video

#

I would want to render 60fps / at 10 seconds

#

Even tho 5 seconds of footage is almost too long in video edit world

timid island
#

why 60fps? 30 for video looks good

split kestrel
#

Post production workflow

timid island
#

i don't know why spend the time making it 60 though

split kestrel
#

I’m gonna drag all this stuff into Davinci when I’m done

timid island
#

all the movies you watch aren't above 30

split kestrel
#

Not true.

#

šŸ˜‰

#

All skydive footage I capture at a min of 120 - 240

trail lion
#

I certainly dont need above that, because the quality of the current tools doesnt justify it

astral goblet
#

the model only knows 8fps video clips. so you render at 8fps then interpolate to get to 60. i set mine to make the final file a 60fps setting and interpolate 8 -10 times.

animate diff just generates the frames. ffmpeg stitches them into a file with the fps setting.

timid island
#

movies, like the avengers, not you jumping out of a plane lol and skydiving makes sense because it is fast moving, can slow it down

split kestrel
#

Final render is usually 24fps / 30fps

#

But the initial capture you want more frames to scrub and drop.

astral goblet
split kestrel
#

Speed ramp

#

Those are 60

astral goblet
#

i often run interpolation algorithms on my movies at home too cause i love crispy smooth frames. i just got fast eyes and it looks better to me

teal pagoda
timid island
astral goblet
#

i think the rerelease of avatar1 was 45fps too

timid island
#

45 seems like an odd number

trail lion
#

wonder what dune2 was, that was very nice in imax

astral goblet
#

so is 24 really

astral goblet
split kestrel
#

24 / 48

#

Avatar was printed on imax film stock

#

Sony HDC-F950 to be fully spec

astral goblet
#

the only reason that movies haven't gone full high frame rate across the board is becauuse studios can still save a lot of money using lower frames and people seem to be fine with it

#

higher frame rates just look better though. objectively

split kestrel
#

That and the industry huffs at over 24fps

lavish lake
astral goblet
#

saving money is the only reason to

split kestrel
#

You always want to film at double the resolution of your final edit since you can’t always go back and re-shoot

#

I’m treating this…. Like film

#

And I don’t know that I should lol

trail lion
#

so from my perspective, if I come across frames that are bad, I just go back and re-img2img those

astral goblet
#

pretty soon cameras are going to have entirely electronic shutters too. solid state shutters are gonna change everything

trail lion
#

seems better than doing 2x or 3x the renders just to throw them away

split kestrel
#

Mine has no shutter

#

I think it’s a 16bit 4k, which would make stupid good video to drop in for models

astral goblet
#

sony i think, just released a photo boy, not a videoone , that has a crazy electronic shutter. no rolling. all the pixels come on and off at the same time

#

sexy

split kestrel
#

I may jump with redbull, but I don’t get redbull money lol

#

Anyways. Depth gen looks good

#

I imagine there is a specific size this video should be in order to pass correctly, or does that not matter.

astral goblet
#

input video should be the same size as the final render imo

split kestrel
#

Drifting over a YouTubers workflow and it looks like hell lol

static schooner
#

This week i am cursed with SDXL

split kestrel
#

In what way?

static schooner
#

Cursed images

split kestrel
#

Could be enjoying the nightmare that is video

#

lol

#

Can’t wait for my epic stuff to finally exist

astral goblet
karmic cedar
#

pretty sure Mr. SECourses guy asked me a question under a different username in the comments of that video

#

cloud question—should I stick with runpod or should I switch up to something else? I occasionally do video workflows, so having access to high RAM environments is nice

astral goblet
#

then remake a video and publish it as if he's the one providing all the value. yeah he's been called out on that a few times and is very likely using alts now

karmic cedar
#

lol

astral goblet
#

how did you imported lol oh i see what you did there

karmic cedar
#

mmmhmm

astral goblet
floral nimbus
#

i have a question, when it comes to loras is there any extension or plugin that allows sorting? it would be amazing if i have a character,pose,background etc for loras that i can move and keep orginized

still glacier
#

For exemple,I go :
|- SD15
|--- Artists
|--- Character_Anime
|--- Character_Realistic
|--- Facial_Expressions
|--- LCM
|--- Location_Background
|--- etc
|- SDXL
|---- etc

karmic cedar
#

we need a webui in the form of an unreal engine-powered supermall where makers have their own shops, etc.

#

because image diffusion is basically just a mall of options

floral nimbus
karmic cedar
#

ur butt’s a subfolder

#

…alas i do not know 😦

floral nimbus
#

😦

nova zodiac
#

the subfolders dont create the code to make new tabs sadly

floral nimbus
#

So it only helps downloading and sorting symlinks

astral goblet
#

skeu-spatiomorphism?

floral nimbus
karmic cedar
still glacier
astral goblet
karmic cedar
#

^ any comfy workflow that involves 30+ modules

floral nimbus
#

i just wish there was a extension that would allow folders inside the lora bar or just make new tabs in general

dusk canopy
#

bro my lora is pulling a gemini moment

#

every time i generate a couple the guy is black

still glacier
#

yeah swarm is a nice solution too to manage many models, loras, etc

dusk canopy
#

and the women is white

#

šŸ’€

#

wtf

#

nvm fixed it

floral nimbus
#

skorchekd reaction to that ^

#

bruh i did not know SDXXXL takes like 18 gb vram

dusk canopy
#

the lora is racist

#

against white peopke

#

šŸ’€

karmic cedar
#

or maybe it’s a reflection of the datasets it was trained on

#

ever think of that?

untold herald
#

Does anyone know the size of the dictionary the tokenizer SD3 use? Or what is its native prompt max_tokens without any chunk-breaking tricks?

floral nimbus
#

is sd3 even out?!?

untold herald
#

Some people who have access to it or people who have connections to StabilityAI could reply.

karmic cedar
#

I get the feeling those folks are all living in their own bubble right now.

sand flax
#

@floral nimbus Hey i have a question to ask you in private, also SD3 isn't for free use I believe until 3 months later

broken cave
#

i am doubtful the largest SD3 model is going to be released

#

i think it's kind of donezo

#

they may release the small and medium models for free use, and possibly without the better conditioning. i am not sure how the aesthetics will compare.

#

it doesn't take a genius to see that Emad departed over specifically the fully open release of this upcoming model. it is viewed as a valuable asset, but on the other hand, bing image creator is completely free, so i don't know how valuable it is in reality.

karmic cedar
#

that’s how on the line everything is.

split kestrel
#

I know this goes without saying… but I’ll say it. I appreciate the hell out of the community of artists who do this stuff. Those that keep it open source is an extra high five, I wanna buy you a cup of coffee. I know there needs to be some housekeeping and such for all the reasons…. But The tools that exist to allow us to bring nightmare fuel to life….. well…. That’s something.

Cary on.

hasty hornet
opal hedge
#

Let's save the doom and gloom for when there's a good reason

gusty oriole
opal hedge
pearl ocean
young wigeon
hasty hornet
#

other AI tts thingies are pretty far away from 11labs, unfortunately

versed night
#

hello

rotund remnant
#

any news for sd3, when?

shell tendon
pearl ocean
lapis dirge
#

good afternoon, how to generate?

low moon
#

Light a candle and clap your hand 5 times then inhale and do 2 carthweels.

young wigeon
#

what's this?

past marsh
#

how to ues?

heady steppe
#

guys, i just saw https://arcads.ai
it's a service to create AI video marketing.
do you have a recommendation opensource tool as alternative?

#

i have RTX 4080, currently running A1111

pale crow
#

Hi folks, Newbie here. Anybody use the Stability AI module in Make.com?

amber bloom
ancient marlin
#

Hello guys

#

I'm new here

frosty swallow
#

hi friends, can I pm anyone regarding image generation? I have png info but I can't recreate the photo for some reason. Plz help ty!

opal hedge
spiral mirage
#

Hi friends, I am new on this server.
I have been playing with stable diffusion 2.1 last year and I really liked it.
Now I am very excited to try out stable diffusion 3 and I just enrolled in the waiting list.
Did any of you already have access ? Or are people going to have access when the model is launched, if so, when would be the launch date in your opinion?

dry trellis
charred mesa
#

so in the BEST case, the end of April

opal hedge
charred mesa
#

yeah I was naive enough to think it'd be mid or early april

#

lmao

#

now its more like End April-Mid May

#

if not later

opal hedge
#

Yeah, I wish they'd just release SD3 and call it a day

#

It'll suck on release anyway lol

charred mesa
#

it seems they really have a lot of work left to do

#

controlnets, optimizations, final training pass with DPO and RLHF, etc

opal hedge
#

If they can get control nets to work that'd be awesome

charred mesa
#

and I thought it was really close cause the pics from Lykon and Comfy looked amazing for a base model

opal hedge
#

The thing that makes it amazing in my eyes is its token limit and prompt adherance

charred mesa
#

exactly

#

512 token limit sounds awesome

#

its plenty

opal hedge
#

The fingers and hands are still a bit messy though

charred mesa
#

I bet in the future we're gonna be crying about ONLY having 512 tokens thomas

potent spire
#

someone said it wil lwork with 8 gb VRAM

charred mesa
opal hedge
potent spire
#

i dont believe it tbh lol

charred mesa
#

I saw a stability staff write on reddit that Comfy is targetting 8GB

#

but im not so sure about that goal

opal hedge
charred mesa
#

yeah that sounds like the 2B

charred mesa
opal hedge
#

And then there'll be the gigachad version everyone will be training on that's probably 12-16GB vram

potent spire
#

the competition will be interesting in the future

#

DALL-E 3 is supposedly getting inpaint as well apparently

charred mesa
#

I wish that the 8B would work with 12GB + highresfix

#

but I just don't know about the 8B MMDiT weight being able to fit on only 12GB, even at fp16

opal hedge
charred mesa
charred mesa
#

the rest of us get a massive slowdown

charred mesa
#

SD3

potent spire
#

i meant DALL-E

charred mesa
#

I meant SD3

#

it's in the paper

potent spire
#

SD3 will very likely get those addons as well xD

charred mesa
#

yeah I hope so

#

I just hope it won't be separate models cause these 8B model will be MASSIVE in file size

potent spire
#

i wont be able to play with those anyway except its implemented in Alpaca plugin for Photoshop

#

i dont feel like paying for another software right now tho...at least not generative AI one

opal hedge
#

It'll probably just be dreamshaper, Pony, and maybe Zavychroma or another top-tier checkpoint that comes out

charred mesa
#

oooh I can only IMAGINE those models

#

I hope the massive finetunes like DreamShaper and etc will give us expressions and actions with detailed dataset captions

opal hedge
#

God I hope so too

#

One of the good points of 8B is it'll probably have most expressions and actions in-built already

#

Rather than fixing the model, hopefully fine-tunes will just be guiding it towards a certain direction

charred mesa
#

eh idk

#

I hope the same but I'm not sure about actions and expressions

fervent thunder
#

chili peppered tronisanator

fervent thunder
#

japanese egg water

#

cheddar cheese dogs

#

a muffin momma

still glacier
#

a turned off bot

balmy pecan
#

Hi everyone, i am trying to use the best resolution for controlnet, for my image2image. in A1111, the resolution is in multiples of 8, while in comfyui, it is in multiples of 64.

is there a node for me to use controlnet in multiples of 8? how does controlnet actually work?

I will prefer to use multiples of 8 as i can get a depthmap to match my original img. the shape of my original image is very important and i want to try not to have it off by even 1 pixel.

gilded wedge
#

Hi guys, there used to be a channel specifically for artists who use stable diffusion, is that still a channel? I can’t find it

potent spire
still glacier
#

it depends of what you want exactly

distant swift
#

though I wonder what use will T5 have when doing image conditioning, as T5 can only do conditioning on text

trail lion
#

dont ever leave a '>' off the lora tag in a prompt, caused all kinds of chaos before I found it

karmic cedar
#

how have the community generations been going for SD3? has anyone discussed trends in what they’re seeing?

oblique jay
frigid escarp
#

Anyone familiar with getting SD up and running inside something like GIMP with inpainting? I see several plugins that claim to do this, wondering if there’s a leader in that space

lean junco
#

Hi all, can you tell me who is using what? (Free) and not using PC power.

simple comet
#

Do you remember of Stable Diffusion ?

tranquil stump
#

There are also German users here

karmic cedar
#

it’s all using power

#

lol

frigid escarp
#

I tried to find a wind powered version but doesn’t look like it’s there yet

karmic cedar
#

i tried to find one powered by zero point energy but apparently i need moon helium for that soooooo i guess i have to build a space railway now

arctic sedge
charred mesa
#

comfy is targetting 8GB but I have my doubts

#

I'd be stoked if it runs on 12GB even

arctic sedge
charred mesa
#

I have trust in 8B, not much in 2B

karmic cedar
#

tiling magic>

#

longer processing times for sure

charred mesa
#

well tiling for VAE makes sense

#

but idk about tiling for generation

karmic cedar
#

for as much as possible from a data perspective

arctic sedge
karmic cedar
#

if you’re working with those limitations

charred mesa
#

it's either a massive 8B model, or something that's smaller than SDXL (and another one that's smaller than 1.X)

#

but yeah

distant swift
karmic cedar
#

AI is like a giant whale at the bottom of the ocean right now—the little fish (us) are sweeping in to have our share, and as much of it as we can glean. But pretty soon our shares will be shrink-wrapped and come from stores. šŸ™‚

charred mesa
#

I just want to know how much VRAM the 8B MMDiT model will take eventually

#

like alone on itself, cause we know that T5 will be loaded separately

karmic cedar
#

speed (processing time) versus quality (data heterogeneity) is going to become the driving balance for all this stuff, isn’t it

#

as it economically evolves more

distant swift
karmic cedar
#

but you’re not wrong!

cobalt crag
#

yo

#

I want to report someone from this server plz?

karmic cedar
#

whoa you’re dedicated

#

šŸ™‚

cobalt crag
#

Justice shall be done

trail lion
#

cant you just right-click their post and hit report?

karmic cedar
#

^

trail lion
#

I'm also fine with group therapy, lol

karmic cedar
#

it’s super effective, take it from me.

trail lion
#

"...and how did that make you feel?"

#

maybe lie down on the couch

karmic cedar
#

(they don’t invest in couches)

trail lion
#

kevin pollack has a great stand-up routine where he's talking about a football ref needing to unload, and just blowing his whistle and talking to the audience

grizzled zealot
#

Is it normal for sdxl lora training to take over 3 days to finish?

#

Got an rtx 3090 and it's 250 images between 1024x1024 and vertical or horizontal of 1024x1536

#

I recall training this on 1.5 taking only about 6 hours or so.

honest mica
young wigeon
oblique jay
trail lion
nova pilot
#

can anyone recommend a good place to download some better trained models? I know of huggingface, but... it's not the friendliest to use

trail lion
#

so if you're doing 100 repeats and 10 epochs, the number of steps goes up dramatically

#

like add a few zeros šŸ˜‰

#

repeats should be small though, control the steps with epochs (full batches) vs repeats, unless you are doing multiple concept training or reg images, in which case the repeats helps balance out the training

astral goblet
# loud solar Civit.ai

just, be careful of all the smut there. some of it is shocking if you don't know what you're getting yourself into.

loud solar
#

I only take clean stuff šŸ™‚

teal pagoda
#

anyone tried ways of monetizing the AI art? Just out of curiosity. You don't have to tell the methods. Only "yes" and if it worked.

#

I wasted my time with etsy

#

šŸ™‚

molten sage
#

Hii! Do you guys know any node in comfyUI that receives an image and, based on the size of that image, it outputs the closest width an height recommended for SD 1.5?

I found one that works for SDXL it's called "NearestSDXLResolution", part of the "ComfyMath" node's pack, but the author didn't include an SD 1.5 option and he seems to no longer work on the project 🄲

heavy lark
#

I'll paste a picture in the other general with images channel

crude notch
#

anything mod 8 works

trail lion
#

recomended size is 512x512

crude notch
#

512x512 is train size, most models are trained on 768x768 max

#

sd1.5 breaks when above 768x

molten sage
#

sorry but what mod 8 means?

crude notch
#

modulo 8, aka anything that can be divided by 8

#

this should be forced in comfy

molten sage
#

@trail lion @crude notch gotcha! Thanks for the info :Ā­D

crude notch
#

np ;3

wary belfry
# arctic sedge Xformers?

xformers are not entirely lossless. Even generating the same image with the same seed gives a tiny bit of variation. I had one case where the character sometimes had closed eyes and some time closed ones.

charred mesa
#

^

arctic sedge
wary belfry
arctic sedge
wary belfry
#

however, it also could be a janky implementation of it in Auto1111

trail lion
#

it's widely used because it's so good with memory, but only works on nvidia, so meh

astral goblet
#

gonna try running this "aniportraits" code today. letts seee what happpens

mortal delta
#

Is there a simple way to create game assets like objects, people, and such kinda like spritesheets with ai, i use comfui so there is that....

ive possibly asked a similar question?

astral goblet
#

simple? nope. reliable? not really.

mortal delta
astral goblet
#

I think you'd do well with these new img to 3d models coming out. Make a simple 3d character, take it into blender, fix it all up and rig it, generate a sprite sheet animation from that, then fix it again

#

push button game development isn't here. you still need so much passion for your project that the legwork is beautiful to you.

mortal delta
#

woudent that only work for 3d stuff like how could i also achieve 2d? im fine with 3d but ive been wondering manly about 2d.

astral goblet
#

3d models can render into 2d images. You can then paint over those 2d images to make them pixel art if you want

#

the 3d model in this case would be more like a rough scafolding for your final product. accelerating your content creation still

mortal delta
#

I see, well thank you wise person for letting me know this info.
bad thing is i run stable diffusion on cpu so im not sure what will happen or how longer things will take for 3d.

trail lion
#

yup, you'll eventually probably need better local hardware, or start leveraging cloud compute

mortal delta
#

I do have a gpu it just doenst run ai ive tried everything to get around this but nothing works, but ill just use cpu for now intill i can afford/get better hardware.

astral goblet
#

amd isn't it

#

its okay. you can tell us. this is a safe space

mortal delta
# astral goblet amd isn't it

yeah im that dude with the amd 480, 8vram, im surprised you all remembered, but it basically charshes my pc when running ai like suddley my screen just turns black and i have to reboot my pc. its so annoying.

astral goblet
#

think i encouraged you to try linux last time but i fully understand why you're not into that. it's a steep learning hike

mortal delta
#

oh i forgot to try linux, im so sorry i forgot.

astral goblet
#

my vega64 8gb would do diffusion images in a minute . that was in october 2022

#

sdp wasn't out yet then

mortal delta
#

is there a linux os you would suggest for duel booting? by chance.

astral goblet
#

manjaro and garuda are the two downstream arch distros i was riding for a year. i've heard good things about linux mint, downstream from ubuntu

#

couple years actually

distant swift
# arctic sedge Xformers?

no. there was an optimization NVIDIA made a while ago that wrote a model in an "engine" format, which has the same precision as the original PyTorch checkpoint, except it runs much faster than the original checkpoint

astral goblet
#

tensor RT models. you can only use them for one specific resolution iirc

mortal delta
astral goblet
#

the distros i used were easy to install. arch is a rolling release unlike others

mortal delta
astral goblet
#

i still use steam OS if you count that as arch

distant swift
astral goblet
mortal delta
#

so thanks for reminding me about linux and such.

astral goblet
distant swift
astral goblet
#

sometimes you just gotta flex your inner dunst you know?

charred mesa
#

God...
SD3 for Idegram/DALLE3 will be the SD1.4 for DALLE2

#

I get to experience a massive leap in AI once again

distant swift
#

well, nevertheless, it was still a way to optimize inference without having any impact on the model's outputs, so I guess if quantization won't be a thing with SD3, that's the direction we'll be heading

#

or just keep SD1.5 alive as we saw happen many times before

icy quest
#

im trying to upscale my pic, but it keeps tiling it

#

dam piddy got swatted

cosmic swallow
#

Hello, is it no longer possible to create images here?

broken cave
#

what kind of game are you trying to make?

distant swift
unreal marten
#

hello all! im having a hard time stomping a bug. im run stable video diffusion locally and for some reason it works with some images and not others. i get an unboundlocal error about the input image. I've checked everything i can find and can't figure out why this image won't work

small stream
#

Hey everyone.

trail lion
#

also if you're tossing external images in there, know SD likes certain resolutions

unreal marten
#

i tried cropping the images. ironically the images im having problems with are also being generated locally from stable diffusion xl

gaunt pulsar
#

the tech seems to still be far from that

trail lion
toxic elbow
#

Who has some good tips on prompting in Stable Diffusion 3? I feel like it's harder to get a specific style than previous versions.

trail lion
#

nobody has it yet, so you're not going to get much help

toxic elbow
#

ah okay. i got access to the Stable Assistant today, it's where i'm using it. thought more people have gotten it.

gaunt pulsar
tawny quail
#

Hello, this is a random question for any experts on training ai models. Do you know if it is possible to build a model using videos as training data (rather than stills). I’m not talking about video to video, but rather the training process. Thanks

opal hedge
coral mist
#

Wesh y a des gens connectƩ ?

rapid frost
#

does anyone here use hugging face for creating SD images?

#

I'm trying to figure out a way to create images using FOSS that doesn't use my old mac

tall musk
#

I use Diffusers some of the time, on Apple Silicon and Colab on occasion.

fervent thunder
#

Hi

rapid frost
fervent thunder
#

Just made a bot, trained my own model to work with nsfw, not using stability tho,

tall musk
#

Yes on my Mac and sometimes using Google's Colab system

fervent thunder
#

made a discord bot, that will generate images on prompt

rapid frost
rapid frost
tall musk
#

It can do but pytorch support for the non Apple GPU's is a bit spotty and deprecated, so make that it might do, you would have to try it

fervent thunder
#

just dm me,

rapid frost
rapid frost
arctic sedge
#

What model architecture do you think Ideogram is using?
It looks pretty close to what i would call a custom finetune of SD3. Some of the outputs with the same prompts from SD3 look really close to each other in composition.

charred mesa
#

Sometimes I think that's it performs like a ~2B, but other times it looks like as capable as ~8B

#

idk

arctic sedge
#

@charred mesa Ikr!? Wtf. It also appeared around the time after SD3 was announced. (I think. Correct me if i'm wrong)
It makes images so close to SD3. Idk what is going on there.

charred mesa
#

Idk about the architecture behind it, it could be DiT, or just UNet with a heavily captioned dataset, no idea...

rugged mirage
#

are there any live coding/working with SD streams which people can recommend?

keen rose
#

how should I go about trying to use 2 character loras in one image without them blending into each other? like is there a way to tell the program where to use which features?

rugged mirage
#

you can try using them one at a time via in-painting

grizzled zealot
trail lion
rustic sigil
#

Hi! šŸ‘‹ I just joined šŸ™‚

pearl ocean
#

S D 3

narrow kernel
#

Either that or you're doing training at 32 bit instead of 16 or bfloat16

#

Or your batch size is too high

tall aspen
#

so

crude notch
trail lion
#

not likely in txt2img. could try regional prompt, but regardless you'll be fixing it in img2img

regal glacier
#

Yooo
I need to convert an image to image with a line art work,
tutorial says to use control net
then I tried to configure the parameter as what tutorial told me, but the results definitely looks wrong
not even shapes as the pose at all

shell tendon
forest trout
#

I mean I'm chums with some of the guys who work at stability.ai and I never got an invite. Just sayin...

shell tendon
#

Yeah who knows

#

I signed up in hocus

#

Hours

#

Fully expect to not get an invite until June the way this has been going

regal glacier
shell tendon
#

Probably but I wouldn't know

#

I do pretty much everything in comfyui

#

Getting the output you mentioned is pretty trivial in there

#

I don't use a1111 for anything except lycoris-ia3

sand flax
regal glacier
sand flax
regal glacier
tall musk
# rapid frost Can you give me your set up? I might just get a used M1 mac and try to make it w...

Sure, I have a GitHub Repo for both the mac stuff, although targeted at a 8Gb M1 ( fine for SD1.5 and SD 2.x, not good for SDXL due to swap usage but works) and Colab.
Not I have a 24Gb M3 I use the Colab scripts with a find and replace of 'cuda' replaced with 'mps' you'll be better of starting there as the 8G M1 scripts have aged badly and need the fp16 madebyollin vae adding to replace the default vae.

https://github.com/Vargol/StableDiffusionColabs
https://github.com/Vargol/8GB_M1_Diffusers_Scripts

My Setup is basically , install python 3.10 or 3.11 from macports
create a venv somethere

cd my_directory
python3 -m venv Diffusers
cd Diffusers
. bin/activate
pip install diffusers accelerate transformers

That should get you good to go

If you close Terminal, you'll need to reactivate the venv

cd my_directory/Diffusers
. bin/activate
#

Starter SDXL diffusers script

import random
import sys
import torch
import gc
from diffusers import DiffusionPipeline, AutoencoderKL

prompt = "A film still of a close up of A red haired woman standing in a lush green jungle"
negative_prompt = "painting, drawing, illustration, glitch, deformed, mutated, cross-eyed, ugly, disfigured"


use_refiner = False

vae = AutoencoderKL.from_pretrained("madebyollin/sdxl-vae-fp16-fix",
                                    torch_dtype=torch.float16,
                                    force_upcast=False).to('mps')

pipe = DiffusionPipeline.from_pretrained(
      "stabilityai/stable-diffusion-xl-base-1.0",
      torch_dtype=torch.float16,
      use_safetensors=True,
      variant="fp16",
      vae=vae
      )
pipe.to('mps')

#pipe.enable_sequential_cpu_offload()

pipe.enable_vae_tiling()

seed = random.randint(0, sys.maxsize)

images = pipe(
  prompt = prompt,
  output_type = "latent" if use_refiner else "pil",
  generator = torch.Generator("mps").manual_seed(seed),
  num_inference_steps=30
  ).images

if use_refiner:
  pipe = None
  refiner = None
  gc.collect()
  torch.mps.empty_cache()

  refiner = DiffusionPipeline.from_pretrained(
      "stabilityai/stable-diffusion-xl-refiner-1.0",
      vae=vae,
      torch_dtype=torch.float16,
      use_safetensors=True,
      variant="fp16",
  ).to('mps')

  refiner.enable_vae_tiling()

  images = refiner(
      prompt = prompt,
      image = images,
      ).images

images[0].save('sdxl.png')

Hopefully your card supports fp16, if not change float16 to float32

opal hedge
#

Another day another no SD3 invite

zenith prawn
#

Does steam now allows stable diffusion art ? How can we prove it's not using any copyrighted content ?

sand flax
opal hedge
#

Does anyone know how to lower vram usage with ultimate SD upscale?

#

I'm using comfyui if that makes a difference

fallow wren
#

hi, I want to learn stable diffusion deeply , not how to use it but know the Principles and underlying怂Where to start and Advanced怂thank you.

sinful island
#

I want to build a UI where people can enter a prompt to generate images using the stability API. Do I need to apply content moderation to the prompts before sending them to the API? it says in the TOS that moderation is already applied to the prompt but do I need to implement an additional layer of moderation?

gaunt pulsar
#

I think the endgame of Stable Diffusion once it can always make great pictures without mistakes could be being incorporated into games for full customization. The dev of a VN could make it so you can make a prompt for the main character to be literally whatever you want for example.

vestal hatch
#

Is anyone able to help? I've installed stable cascade, however when running a prompt is gives me an error "AttributeError: module diffusers has no attribute StableCascadeUNet" I'm usuing A1111

loud solar
vestal hatch
loud solar
vestal hatch
loud solar
vestal hatch
loud solar
#

Sometimes a small update of one component just cranks up the system ...

buoyant marsh
#

hi

bleak matrix
#

Good morning, everyone!

#

How are we all today?

vale steeple
#

Hi

elder plaza
#

So, anyboy else as worried as I am about the rumors of Microsoft acquiring Stability?

trim magnet
#

its fake its gonna be tencent trust me (real)

elder plaza
#

Well, any major company acquiring Stability isn't good. But I know they need the cash. Real shame though.

amber bloom
#

I'm kinda taking Emad at his word that SD3 would be the company's last model. He clearly knew that the jig was up.

elder plaza
#

yup, think you're right.

amber bloom
#

I just hope the SD3 weights get released before Stability implodes

elder plaza
#

me too!

charred mesa
#

we will

#

the new CTO said 4-6 weeks ETA

#

Our plan is to soon release the API first to collect more human preference data and validate that our safety improvements don't cause the quality to suffer. Then we'll do some more fine-tuning (DPO/SFT) and release the weights and source code. Current ETA is 4-6 weeks.

#

CTO posted on March 25th

#

So it's gonna be fine

amber bloom
#

plans can change, especially if company is acquired and new leadership is installed with new strategies

#

it's not impossible that Stability is taken over by someone who plans to keep SD3 closed and insted run it as a midjourney-like service

charred mesa
#

ok I understand the pessimism elsewhere when it comes to SD3 but not this, they 100% will release the models šŸ™

#

They won't get bought out in the last second

#

maybe AFTER SD3, sure

amber bloom
#

I sure hope you're right! šŸ‘

#

I'm probably too pessimistic. Emad owns enough stock to block any takeover, at least for a while

heavy lark
#

Everyone has a price to sell out.

snow stone
#

I don't understand how i can generate picture, someone can help me please ?

twilit grove
#

heya, where can i download stable diffusion to generate some images?:)

twilit grove
#

tyty

waxen glen
#

i

spark pagoda
#

how do i use this?

#

i want to create images

amber bloom
spark pagoda
#

oh really? why not? did they close it?

honest spear
stable jacinth
#

Hello everyone I am looking for an application for TTS with French/English/Japanese AI voices, I have already looked for some applications but they are not perfect for example I would like a wide range of choices for the voice, I want good French voices. sorry I don't know if this is the right place to ask my question, if it is the wrong one I will post my message somewhere else.

north wigeon
stable jacinth
oblique jay
#

Is there a reason why SDXL Turbo is not available via API?

heavy lark
heavy lark
#

And it's fast. I get back 6 simultaneous requests in 15 seconds.

oblique jay
#

Interesting, that's good to know

#

Are there ETAs of the "Coming Soon" models? Not SD3 but other ones like Stable Audio? I don't utilize SD enough to do the monthly payment stuff.

heavy lark
#

Unknown. I've only looked at the image side of it

oblique jay
#

Gotcha.

#

Do they have announcements area as to when certain things will be released?

heavy lark
#

Not that I've seen, only ones that have been. Getting release dates out of them for anything seems impossible. They're not a big enough company for that.

fervent thunder
#

I dont know where the api is

#

Oh wait

#

SDXL

#

I read sd3 lol

heavy lark
fervent thunder
#

generate image of chewable thesis

#

why isn’t anything happening?

#

wtf

#

uuugh

oblique jay
#

Is there any substantial differences?

heavy lark
charred mesa
#

The more I look at Ideogram images the more they start to look like 2.1

#

there's something about them

oblique jay
#

I don't know if there needs to be something more specific than this šŸ˜…

#

I appreciate it. In your experience is there a strong difference between SDXL and Core?

heavy lark
oblique jay
#

Oh that's great!

#

What LLM prompt expander do you use?

heavy lark
#

Mixtral with a long instruction.

#

I've written various automation

oblique jay
heavy lark
#

The secret sauce is really the instruction.

#

Any of the models can handle it above a certain size. Mistral 7b will mostly do it, but it's usually too long for the 77 token context length that sdxl needs. Mixtral does it right and the big guys will easily do it

oblique jay
#

Do you use a quantized version of Mixtral?

#

Also do you mean that anything above 77 tokens would be too long, or do you need something that long or above to get a good response?

dusk canopy
#

Mixtral unquanted will take 80gb vramnto run

#

Just letting you know

#

Or 90

charred mesa
#

well there's gguf (plus quantization of course)

dusk canopy
#

That makes the data less

oblique jay
#

Yeah that's what I was curious about, cause unquantized was eighty six last time I saw with quantized 4 bit being 24 GB

dusk canopy
#

Quanted versions of mixtral can't make ascii

#

Whilst unquanted versions can

charred mesa
#

probably, yeah

#

no idea

dusk canopy
#

Yes

#

Do you know what quantisation is

#

It's basically removing some of the data to make it less

iron ingot
#

So how does this work?

dusk canopy
#

You can't Quant image models in the same sense

#

You can just lower the resolution of the images

oblique jay
#

But the fact about the ASCII art is interesting, I didn't know that

oblique jay
dusk canopy
#

It's like taking a sentence and removing a few words but making sure the rest of the sentence makes sense

#

Or taking. A detailed paragraph and shortening it

heavy lark
#

I'm using mixtral q8, so it's about 46 gigs.

oblique jay
#

I get what you're saying, I just don't think I view it in that fashion, since it doesn't automatically guarantee a loss of data. It just means that the parameters (weights and activations) just are less granular, which may or may not result in information lost.

heavy lark
#

Full size is 96 gigs

oblique jay
#

Or do you run something else

heavy lark
#

For llm, I'm using an m2 Mac so it's easy to load big models.

#

When the m3 comes out I'll get a really big one to run full size models

#

But you can run mistral 7b q8 on any 10gig nvidia card.

#

It's "good enough" just not ideal. You'll lose details if the prompts are too long.

#

Want my prompt instruction?

oblique jay
#

Sure I'd love to see them

oblique jay
heavy lark
#

I'll give you the short version that doesn't give it training examples.

#

Or I'll trim some out.

oblique jay
#

I'd like to run a lot of these things locally, but I unfortunately bought a 5700XT a few years back.

#

You're welcome to DM me it

#

In parts if it's too long

heavy lark
# oblique jay Sure I'd love to see them

Limiting your response to 50 words, act as a creative agent who generates a very terse but highly creative image prompt derived from the prompt I send you. Include descriptive visual elements of the subject, lighting and surroundings. Specify an artistic style or camera settings at the beginning of the sentence, using descriptive elements that pertain to this artistic style. Include no more than 10 elements presented as discrete descriptors in one long sentence without story. Put the most important descriptive elements at the beginning of the sentence. Here are 6 example prompts that should serve as a template for text to image prompts that I ask you to create.

Surrealist painting: Adorable puppies frolicking in a tempestuous sea of mewing kittens, surrounded by gargantuan, glistening ice cubes. Soft, warm lighting illuminates the fantastical scene, emphasizing the contrasting textures of fur and frost. Vivid colors swirl in a dreamlike atmosphere, capturing the playful energy of the impossible scenario.

Vibrant 3D Pixar style render, neon-lit forest, adorable squinting animals, oversized gummy sword, water balloon gun, exaggerated mock duel, hilarious facial expressions, dynamic action poses, volumetric lighting, depth of field.

Vibrant digital art, dynamic lighting: Elderly grandmother with mischievous grin piloting unique mecha suit made of large, colorful speakers, blasting blue sound waves at unsuspecting people, bustling cityscape background with mix of modern and vintage buildings, lively atmosphere.

Neon-lit microscopic view: Colorful anthropomorphic bacteria, viruses, and microbes dancing wildly on a glowing Petri dish dance floor, surrounded by pulsating organelles, with a DJ microbe spinning records on a DNA turntable, while microscope lasers create a dazzling light show overhead.

Please create an image prompt for:

crude notch
dusk canopy
#

Hm

oblique jay
heavy lark
#

I left some examples in. Those examples are created with Claude 3, so they're examples of perfect prompt instruction following for it to know how to make them

dusk canopy
#

Transformers architecture is not optimal at all

#

It's not efficient

#

Ai is constrained by this architecture I believe and other models

heavy lark
oblique jay
split kestrel
#

Vivid real life, 8k, 4k, a wolf, a wolf is alone in the woods surrounded by trees and moonlit rocks, the rocks have angry faces and are mad at the wolf, the wolf unfortunately lost at poker and did not have enough money to pay the rock people, the rock people are dystopian society of secret magicians who dont like wolves. It was a great night at the bar

heavy lark
oblique jay
#

What do you mean when you refer to single subject images, such as an image of a single person?

#

Or where there's a direct focus on a particular topic/subject?

split kestrel
#

One girl / one human / one thing

#

A wolf

#

A car

#

A train

#

Vs…. A pack of wolves, a crowd, lots of subjects

charred mesa
#

how

oblique jay
#

Oh one last question, do API credits expire after a certain amount of time?

heavy lark
heavy lark
autumn egret
#

Another day of radio silence about SD3 🤐

#

Sounds like a plan ...

crude notch
#

maybe even 4-5 bit

split kestrel
#

What if apple buys SD3 and includes it in ios18 under lock n key?

stone latch
oblique jay
#

Appreciate you answering!

split kestrel
#

Is anyone having great success with SDXL and animate?

#

Cuz I really…. Wanna use the SDXL stuff