#๐Ÿ†•๏ฝœsd3

1 messages ยท Page 81 of 1

bitter hearth
#

and cat walks

bitter hearth
#

they are working on next version of sd3

#

not sure if that can compete with flux

hallow lion
#

Luckily you can usually get away with a bit less than the minimum requirements... Especially with Comfyu..

bitter hearth
#

lot of sd3 team went to create flux

sacred jewel
bitter hearth
#

Me in the morning sadcat

bitter hearth
bitter hearth
hallow lion
#

SD3.5 will kick huge butt. It's always every other model that SD makes that is legendary.

sacred jewel
#

Whatever it is it's scary LOL ...

You will NOT believe the prompt: ๐Ÿ˜›

tony hawk using trains as skateboards

bitter hearth
#

Flux just seems better at anatomy idk

torn wharf
#

do you guys ever think about the drake equation and how our observations don't match the predicted expectations, giving rise to the fermi paradox? they say there is a great filter that is causing the fermi paradox but we don't know what that filter is. somethings going on cause the math that math says so

bitter hearth
torn wharf
#

you guys ever freak out about the math?

bitter hearth
#

I didn't like it in school agony

hallow lion
#

SD 1 meh SD 15 amazing SD2 ugh SDxl wow SD3 ugh and the next one will be outstanding

sacred jewel
#

Now look at this seed...

Literally

A Hawk
A Train
A Skateboard

Can't fault it... it is good ๐Ÿ˜„

torn wharf
#

i programmed my graphing calculator to play snake in school. maybe shoulda paid attention

bitter hearth
#

You won't believe it

#

Click now

frigid saffron
#

Hi, everyone, I have problems on flux

#

I find Flux is bad at multi-views for background
When I generate a landscape or city or background, I failed to control any viewpoint using like"from distant view/ bird's view/ from blew/top view/", Is this happening to all of you?

bitter hearth
#

algebra is fun but trigonometry gives me headaches

frigid saffron
#

For example, Prompt like: "It's a vibrant Japanese town with traditional Edo period architecture, with narrow streets lined with wooden buildings and tiled roofs. Blooming cherry blossoms against a blue sky add a touch of pink to the scene. At the center of the image is a tall, multi-story pagoda that looms large in the background, evoking a sense of historical grandeur. The image captures the serene atmosphere of spring, with falling flower petals scattered along the streets and large red lanterns hanging from the facades of buildings, making for a charming and picturesque cityscape. The whole town can be seen from a high altitude"

bitter hearth
#

I have a theory for why only the apple was weird
the steps and schedule you had
was giving steps that were too large during a range of the sigmas where the ODE was likely to have a large local discretization error
and the apple had less straight trajectories throughout that sigma range
because the relationship between trajectory straightness and sigmas range can vary based on the composition
notably the apple is a much simpler composition than the others
you could test my theory by running the workflow with ten times as many steps

frigid saffron
#

flux can't give me a high altitude

sacred jewel
bitter hearth
#

if my theory is right then running the workflow with ten times as many steps will fix the apple

#

because it was a case of local discretization error

frigid saffron
#

How you control the viewpoint using flux?

sacred jewel
craggy crest
bitter hearth
#

when it comes to anatomy flux has it's flaws too, i think the most aesthetic you can get with anatomy is with pony models

#

Cx

torn wharf
#

true nature of reality

sacred jewel
bitter hearth
#

nice big balls

bitter hearth
bitter hearth
craggy crest
bitter hearth
frigid saffron
craggy crest
sacred jewel
bitter hearth
#

not looking too, i dont like cat fights

craggy crest
#

@bitter hearth

bitter hearth
#

Now THIS is how you skateboard

frigid saffron
bitter hearth
sacred jewel
bitter hearth
#

you talked the same thing about sd3 as long as you could even tho everyone got deformed junk with sd3 and anatomy

brittle nexus
bitter hearth
#

flux however is much better but has its flaws

craggy crest
frigid saffron
bitter hearth
#

are you democrat, you talk like one

craggy crest
bitter hearth
#

missing logic most of the time, i wonder what kind of math you actually do

#

probably the kinda math that goes into sd3

craggy crest
bitter hearth
#

flux isn't as bad

#

flux can mess up some of the anatomy but not horribly bad

frigid saffron
bitter hearth
#

you atleat would have to cherry pick good ones with flux but with sd3 is a surge of mess one after the other

#

I tried a few times on flux with that prompt with the view from above at the start of the problem and it doesn't do anything

frigid saffron
#

Midjoruny is good at view control

#

but flux seems no concept

errant dust
sacred jewel
#

I broke it...

prompt: motion blur

errant dust
#

You are the one who resorted to this

frigid saffron
bitter hearth
#

that's the reason im not bothering to respond to him

#

not me you missed the line of convo then

errant dust
errant dust
bitter hearth
#

scroll up then and im not interested to talk about it, shut it pls

errant dust
#

I rest my case

bitter hearth
#

dont insinuate like a fool

errant dust
#

and you just keep burying yourself don't you

torn wharf
#

all these balls and you gonna tell me math isn't real? i unno man. i just dunno

bitter hearth
#

if you keep your mouth shut that wil help

#

I feel like in the long run the captioning for image model training data needs to be done by a VLM that was made specifically with that task in mind
cos a lot of image models are using something general like CogVLM for the captioning and that's not really ideal

#

or talk about things that are not nonsense

sacred jewel
errant dust
bitter hearth
#

ok time to block you two

craggy crest
#

he's a 12 year old child that's really not old enough to be on discord

errant dust
#

you do know that blocking anyone on Discord is an utter waste, right? You will instead get drowned with messages saying 'Message Blocked'

brittle nexus
torn wharf
bitter hearth
#

gees hounds

torn wharf
#

no no . deez balls

errant dust
#

@sterile pendant SOmeone posted the combined file Dev+Schnell in Civit, and I must say it is a total winner

sterile pendant
#

as you can see, you don't need the second model to load for the sigmas

bitter hearth
craggy crest
torn wharf
#

so i love that all these models can create really destructive scenes now a days. but i've noticed people and crowds are just business as usual. my proposal, put a traffic control person in the image then it makes sense

frigid saffron
craggy crest
bitter hearth
#

what that would do is specifically test the hypothesis that the issue is local error from going from continuous to discrete

frigid saffron
#

Why flux can't do this

bitter hearth
#

running a very high step count simulates having a perfect schedule at a lower step count
because it a very high step count gives you small steps everywhere

#

a perfect schedule at a lower step count would have small steps only when needed, and large otherwise

craggy crest
bitter hearth
#

I removed everything but from above and the temple lmao

craggy crest
frigid saffron
bitter hearth
#

when the step count goes high enough all schedulers trend towards the same output
as discretisation errors trend to zero

sterile pendant
bitter hearth
sacred jewel
craggy crest
bitter hearth
#

0 steps ๐Ÿค”

craggy crest
bitter hearth
#

haha yeah they would

#

if it's ever possible to use the flux algorithm / or some lora to fix hands/fingers with other sd models that would be fantastic

bitter hearth
sterile pendant
#

you need a LOT of data for hands

bitter hearth
#

Millions!

#

I did spend a while this week using Realvis Lightning
which is one of the strongest out of the low step models
its actually kind of okay

sterile pendant
#

probably in the tens of millions range, honestly

frigid saffron
bitter hearth
#

Then with flux... It's just a bit dumber than sd3

craggy crest
bitter hearth
#

Realvis Lightning at 4 steps is good enough for maybe 99% of AI images I have seen

torn wharf
craggy crest
bitter hearth
#

Even Godzilla stopped

#

That man is powerful

sterile pendant
#

think about all the locations on the screen hands can be, now think of all the various distances they can be from the camera, all the different ways they can be obscured by things like being in a pocket, then think of all the infinite combinations of positions/angles they can be in, all the potential combinations of individual finger positions, then all the possible textures and skin colors, the list goes on and on

torn wharf
#

i prompted Joe Biden

frigid saffron
bitter hearth
#

UniPC 10 steps with beta scheduler is also not bad most of the time

#

or maybe 12-15 steps for safety

craggy crest
craggy crest
bitter hearth
#

Boobawaow

craggy crest
#

12 year old kid

bitter hearth
#

at low steps PAG goes a long way

#

PAG is the most impressive node to me

#

a tiny drop of PAG has fixed images perfectly sometimes

sterile pendant
#

this should have the workflow to show it all

bitter hearth
#

makes sense cos discord normally does preserve the comfy data

craggy crest
sterile pendant
#

main takeaway is that you dont need to load the second model to get the sigmas when they are just a simple y=mx equation that ranges from 1.0 to 0

sterile pendant
bitter hearth
sterile pendant
#

it's basically tangental to self attention guidance

bitter hearth
#

perturbed attention guidance
you can just set it at 0.3-1 on every image TBH

errant dust
bitter hearth
#

or up it to 1-3 if needed

craggy crest
#

on the snarky subject of Flux not able to do hands

bitter hearth
#

SAG is also great, again at low strength like 0.2

brittle nexus
sterile pendant
#

i saw they got PAG working with SD3 on diffusers. doesn't quite work the same way, but they get some interesting results from it

bitter hearth
#

there was a very very recent comfy change
to make PAG and SAG work below 0.1
so it can be used with CFG++

craggy crest
bitter hearth
#

yeah diffusers gets a lot of stuff first

#

if you just put the PAG node in before ksampler
and set it to 1
that will do it

craggy crest
bitter hearth
#

no I haven't seen it

torn wharf
#

like no shit the crowd is calm. look at all those traffic control devices out there. just another day in nyc. they got it handled.

just a quick hack if anyone wants to use this for their images.

craggy crest
bitter hearth
#

ah I think I saw it
a ksampler

sterile pendant
# craggy crest okay, how do you use it?

if you're using it with sdxl, you just set pag to something like 3.0 and lower whatever CFG you were using by 3. basically, if you were using a cfg of 7, you'd lower it to 4. 4+3=7. you can also use the advanced version of the node and set rescale to like 0.5-0.7(personally, i do it and seem to get better results). there are some pros and cons to pag, but try it out and experiment

craggy crest
bitter hearth
#

lol

craggy crest
bitter hearth
#

I hate the blursadcat

#

Who invented blur on cameras agony

#

3 PAG is strong
that's not always bad
I mostly use more like 0.3

#

high PAG has a certain "look" to it, which you may or may not like

craggy crest
bitter hearth
#

this one was very high PAG for example

#

it can start to get pretty cooked

torn wharf
#

do you see how it immediately changes the narrative of the image from "why is everyone chill?" to "oh i see why they're all chill" it's just a little hack to remove distraction from your action images and make them more believeable.

bitter hearth
#

if I remember rightly that image had something else weird in it
either high CADS noise or FreeU with low B and high S

sterile pendant
bitter hearth
#

FreeU with high S was a major source of funny proportions

#

you could probably tonemap these a fair bit to remove the PAG+CFG burn

sterile pendant
#

oh they aren't burned, it's just hte prompt

#

point is, target cfg=pag+cfg. you can also throw in automaticcfg into the mix as well

torn wharf
#

you could totally pair this pag stuff with the TCP hack i described for ULTRA believeable images

sterile pendant
#

same settings, pag on the left, normal on the right

bitter hearth
#

I'm just obsessed with the low CFG look its a personal thing

#

a bit of ComfyUI_experiments/sampler_tonemap_rescalecfg.py can help

sterile pendant
#

if i want to make adjustments, i just do it in another DCC

bitter hearth
#

its in a node though

#

you just hook it up

sterile pendant
bitter hearth
#

ok but in that case
you would still want a rescale cfg node
just not the tonemapped version

#

rescale cfg does its operation each step

#

so it can't be replicated in photoshop

#

its just this:

2 # Apply regular classifier-free guidance.
3 cfg = neg + weight * (pos - neg)
4 # Calculate standard deviations.
5 std_pos = pos.std([1,2,3], keepdim=True)
6 std_cfg = cfg.std([1,2,3], keepdim=True)
7 # Apply guidance rescale with fused operations.
8 factor = std_pos / std_cfg
9 factor = rescale * factor + (1 - rescale)
10 return cfg * factor
#

so step 3 here is just normal cfg

sterile pendant
#

the middle one is pag+automaticcfg. it's my goto setup for sdxl

bitter hearth
#

photoshop can replicate changes made at later sigmas
like colours or textures or contrast level
but rescale cfg will also affect early sigmas that decide the layout

#

this lora doesn't seem to make any big difference from what the flux model can already deliver

sterile pendant
#

i usually use the advanced version of the node though and tweak some shit

bitter hearth
#

automatic cfg has some rescaling in anyway if I remember rightly

sterile pendant
#

it scales everything per setp

#

to ensure good ranges

#

but your cfg is no longer your cfg basically

bitter hearth
#

the reason I started using thresholding/rescaling is that for some reason the DPM++ paper mentions it a lot
and says it really helps

sterile pendant
#

the dynamic threshold node is a good alternative to automatic cfg as well, but they basically do similar things

bitter hearth
#

yeah its a similar idea

sterile pendant
bitter hearth
#

he wrote a new one BTW
this week

#

skimmed_CFG

sterile pendant
#

ill have to check it out

#

oh right

#

i told you about it lol

bitter hearth
#

lol ah ok thanks

#

I get quite a lot of things from you like the sigma graphs

sterile pendant
#

havent tried it out yet though, looks cool for stylized stuff

bitter hearth
#

and I got FreeU from crystalwizard

#

I haven't checked if skimmed_cfg works with CFG++

#

yeah its a shame cos I love FreeU

sterile pendant
#

well i was just saying how they added PAG to SD3 on the diffusers repo, so im sure someone can make a comfy node for it

bitter hearth
#

I can get FreeU effects a bit using CADS
for example this was SD3 with high CADS
and it looks a bit like FreeU with high S1 S2

sterile pendant
#

dont know how well SAG would work with sd3 though

bitter hearth
#

if I remember rightly SAG is basically blurring part of the image and then sending that to the unconditional
should be possible in any model

#

not flux though

sterile pendant
#

oh i know, but with transformers, you can also do some similar shit

#

they are layered and blocked as well

bitter hearth
#

there's a new paper also that is meant to be an upgrade from PAG

sterile pendant
#

ah yeah, i saw that one the other day but didnt have time to read it

bitter hearth
#

and this one is meant to be an upgrade from CADS
https://arxiv.org/abs/2407.11555

torn wharf
#

if thats what they want i mean who are we. who are we really?

sterile pendant
#

flux is also really good for img2img like if you denoise at like 80-90%. you can sometimes correct f-ups from sd3 with it

#

no, wouldnt do that

#

not losing the 16 ch vae

#

you could do a really convoluted sd3->flux->sdxl refine->resample at like 30-40% with flux again though

bitter hearth
#

on reddit someone did this order:
flux -> sd3

sterile pendant
#

to gain back the benefits of the 16ch vae

#

that would work as well

#

sd3 is hella good at details

fleet meteor
#

Do anyone have a workflow for flux + a lora loader?

bitter hearth
#

if you VAE encode and decode a photo
SD3 VAE is better than flux VAE

sterile pendant
#

flux or sd3 depending on what youre doing

#

if it's humans, flux

#

if it's art, sd3

bitter hearth
#

best VAE last

sterile pendant
#

both flux and sd3 have amazing vaes

bitter hearth
#

yeah

#

its probably situational

sterile pendant
#

but the last pass would be a 30-40% resample to fix whatever got broken in an sdxl refinement stage dropping down to 4ch vae

fleet meteor
bitter hearth
#

also if you have too much noise injections then SD3 3B can't handle it
so for noise injection workflow it has to be flux

sterile pendant
#

i was messing around with a 4step schnell->sd3 workflow earlier and getting some really good results, but then again, i don't make people that often

bitter hearth
#

SD3 is still my favourite TBH I have gone off flux
this is SD3 2B

bitter hearth
#

SD3 2B (not even the current 8B) is the closest to a photo

#

8B will win eventually though after fine tuning

narrow river
sterile pendant
#

I know I do, but I always add that shit in post if I want it

craggy crest
#

wrong color hoodie

fleet meteor
bitter hearth
#

don't push your narrative on others, everyone can try things the way they like .. you narcisstic freak

torn wharf
#

You dont want to use any of these models for text??

viral grove
#

/Tide play, Green theme, Doll personality traits and accessories according to MBTI personality, Make an INFJtrendy toy plush toy, Cartoon face made of silicone, The body of a plush toy, Short white hair, White goat huzi, Round eyes, Dense black silhouette, Simple nose, Simple mouth, Minimalist facial features, The body is the plush body of the toy koala, Wearing a robe, Lovely, Holds a staff in the shape of lightning, Quiet and mysterious, An inspiring and tireless idealist at the same time, Indicates the MBTI personality type infj, diplomat, Advocates, creating a simple yet captivating style. This sticker perfectly captures personality using vibrant colors and adorable elements, High definition, oc, c4d, 3d, blind box toy style, rococo style, Bauhaus, god rays, High quality --s 400 --niji 6

tough oriole
#

Anyone try the Control net for flux is it safe? How much VRAM added?

bitter hearth
torn wharf
torn wharf
#

as for vram, i'm not sure. gonna wager a guess. its' 1.5gb and probably in fp16 mode so i'm thinking comfyui might take either another 1.5gb or 750gb.

bitter hearth
#

its the colours mostly
for example look at the red on this hand

#

SD3 looks more natural to me

sage burrow
sage burrow
sage burrow
# narrow river

Not nearly enough for this server!

PS what was your prompt? ๐Ÿ˜„

torn wharf
#

oh if you wanna use that you need to get that specific pull request merged to your side. it's not merged yet.

sage burrow
torn wharf
#

i haven't tried using it yet

#

xlabs_flux_controlnet would be the branch it's on. i'm unsure how to swap git branches in swarm environment

sage burrow
#

they put out a model sans workflow?! So mean! lol

torn wharf
#

a company called xlabs put it out with inference code and the diffusers team has code supporting it now. comfy is testing their code. thats what i think is going on. there is a workflow for it. you just need the right branch

#

they said tommorrow is the 1024 x 1024 version

bitter hearth
#

yeah its out in diffusers properly already

#

generally if you want the latest stuff you need to be able to use both comfy and diffusers

#

hi-diffusion came to diffusers first for example

alpine summit
torn wharf
#

Guys I just realized something just now and I have to tell people

#

Barclay from star trek is Murdoch from ateam

#

That's all thanks for listening

bitter hearth
#

wow I had no idea

bitter hearth
#

his face looks different lol

torn wharf
#

i saw him in another movie about oppenheimer and i was like .. you know? he kinda looks like murdochhhhooh wait one second

#

i got it working. used the jump man logo. not much quality above 512 but it works great at this res

#

just as i thought. the claims i seen about the model not even guiding well are just wrong. as usual. cool.

#

"it's 12 billion parameters it'll never work it's outlandish!"

rotund vine
#

/style prompt a cat

torn wharf
#

source for the canny guidance. i run it through the filter for the outline but thats about all it is

#

tommorrow it'll be the 1024 pixel model and they have others on the way. neat

strange grotto
#

does anyone know if cascade has 16ch vae?

#

or 4ch vae?

hollow swift
torn wharf
#

cascade's vae is stage a and b as it decompresses the latents through a diffusion network instead. i don't fully get it. stage a i guess is the final leg of that model

strange grotto
#

So it's special? Can't compare it to sdxl or flux?

torn wharf
hollow swift
torn wharf
#

with just the one guidance net theres a ton of potential . it does poses well

#

if even just with a sillohuette of the shape for the first 15% of steps

#

@sage burrow all i had to do was open a terminal in the swarmui's backend comfyui folder and type git checkout xlabs_flux_controlnet and i was set. THOUGH, i am going to monitor this and when it merges i'll run git checkout master in the same folder. for some reason i thought it would be harder with the project in a project situation.

hollow swift
torn wharf
#

theres just the one i saw and i didn't see a point to it.

strange grotto
#

'NoneType' object has no attribute 'keys'

#

and i found this

#

did u convert the canny cnet?

spice spear
#

how to install the canny cnet?

spice spear
bitter hearth
#

That's really funny

torn wharf
#

lol i'm off to sleep soon but i could do a starfish on jumpman logo soon

#

thats a good one

torn wharf
dry wave
dry wave
#

to be precise: it's a weird hybrid of VQGAN and vae, but it's technically very similar to a vae. It's a convolutional-only decoder (effnet), though, so it might less powerful than the SD VAE

strange grotto
#

got it

alpine summit
sullen moss
#

If someone has already tested the Realistic LoRA and Kenny, please share the workflow.

alpine summit
sullen moss
sacred jewel
hollow swift
sacred jewel
sullen moss
#

Not stable but...

alpine summit
hallow lion
#

Ah control nets are here.

errant dust
#

So on a slightly different note, has anyone tried Auraform? They released v0.2 just recently

#

Support for fp16 is out for lesser HW and can run in Comfy

odd basalt
dry wave
#

never found it good

errant dust
#

Yes, they are using the Dall-E 3 technique, per OPenAi's paper, of massive labeling

dry wave
#

soo many parameters for quality worse than even pixart

errant dust
#

Well, they just released 0.2

#

as in... 10 days ago

errant dust
#

SD used a different method

dry wave
#

no

#

SD3 also used synthetic captions

errant dust
#

Yes, they even published a paper on it

#

Yes, but not DE3's method

dry wave
#

what's Dall-E 3 method?

#

if I remember their paper correctly they just run an vllm on 90% of the data and keep 10% intact

#

which is what everyone is more or less doing

errant dust
#

SD3 used their own proprietary method as published in their paper

#

regardless, Aura Form is in beta and actively being developed. 2nd release in a month

dry wave
#

if you cannot be more precise....

#

there is no conceptional difference here

errant dust
#

and since it has only been out a month, the talk about 'never finding it any good' seems a bit precipitated

dry wave
#

all modern diffusion models train on synthetic captions. Pixart was I think the first one which did this

errant dust
dry wave
#

right, because results are quite bad for Auraflow xD

errant dust
#

oh boy

dry wave
#

yeah, maybe it will at some point get better. Currently, it's just bad

#

technically it's not much different to Flux and SD3.

errant dust
odd basalt
hollow swift
#

different workflow, create different image

bitter hearth
odd basalt
# bitter hearth Give prompt <:waow:1017853838516035725>

Of course , A majestic star destroyer dominates the darkening skies, its imposing form silhouetted against a fiery sunset casting deep crimson and purple shadows across its angular hull. Turbulent clouds swirl around it, illuminated by flashes of artillery fire leaving wispy trails in their wake. Distant explosions cast flickering glows on the ship's weathered surfaces, as if foretelling an impending reckoning. The composition is cinematic, emphasizing the star destroyer's colossal scale and menacing demeanor.

bitter hearth
#

I see @odd basalt in chat I must gen one

#

haha

odd basalt
#

Great concept art anon ghost ship

errant dust
dull star
odd basalt
#

And I used Flaux and took it on aura โ€ฆ when I get on pc I can def use comfy and share if it does not crash

dull star
errant dust
odd basalt
mortal mesa
brittle nexus
limpid thunderBOT
#

Last 7 days <Aug 01 2024> โ†’ <Aug 07 2024>

  • Member counts
  • 344652 โ†— 344702 โ†— 344792 โ†— 344840 โ†— 344910 โ†— 344965 โ†— 345012
  • Action members
  • 0 โ†’ 0 โ†’ 0 โ†’ 0 โ†’ 0 โ†’ 0 โ†— 116
  • Message members
  • 0 โ†’ 0 โ†’ 0 โ†’ 0 โ†’ 0 โ†’ 0 โ†— 66
  • Reaction members
  • 0 โ†’ 0 โ†’ 0 โ†’ 0 โ†’ 0 โ†’ 0 โ†— 68
    More details
Summary | comcom Analytics

comcom analytics ใฏใ€Discord ใพใŸใฏ Slack ไธŠใง้‹ๅ–ถใ•ใ‚Œใฆใ„ใ‚‹ใ‚ณใƒŸใƒฅใƒ‹ใƒ†ใ‚ฃใ‚’ๅˆ†ๆžใƒปใƒขใƒ‹ใ‚ฟใƒชใƒณใ‚ฐใงใใ‚‹ๅฎŒๅ…จ็„กๆ–™ใฎใƒ€ใƒƒใ‚ทใƒฅใƒœใƒผใƒ‰ใงใ™ใ€‚็พๅœจใ€ใƒ‘ใƒ–ใƒชใƒƒใ‚ฏใซฮฒ็‰ˆใ‚’ๆไพ›ใ—ใฆใ„ใพใ™ใ€‚

acoustic kite
#

thought this was kinda funny

sage burrow
mortal mesa
#

too much grass for NYC

torn wharf
#

i keep see people using sdxl as a refiner workflows. don't they know that they should do a 3rd pass through a superior vae model and end the refining chain with the king of the game?

#

do people just have bad eyes and not even notice that sdxl washes out details in their images?

gusty trail
#

Most people don't care

torn wharf
#

seems to be that way. they pretend to care about things like quality, but that turns out to just all be theater and dramatic hyperbole.

torn wharf
#

this one is only 512 resolution too. i wont bother testing it. i had great results from the smaller one

sage burrow
#

@bitter hearth

#

er, not sure if catw4gbgpu is who asked for it, but I just realized, no way you can run it, except maybe glif?

wispy spade
#

is it okay to create sub folder on comfyui like checkpoint> sd1.5, sd3 or Lora>style subfolder, posing etc so i can organizing my model in a better way? it wont create any error right?

green shoal
wispy spade
#

thanks

sage burrow
#

hmmm, that long list of 100 checkpoints is getting out of hand, think I should try this ๐Ÿ˜„

green shoal
#

at 100 checkpoint files, it will eventually get out of hand even with subfolders

#

maybe we need a new UX for the loader node

torn wharf
bitter hearth
#

you just want SD3 to refine the fine details

#

it works great

sage burrow
#

Does anyone have a workflow I can borrow that brings SD3 back in at the end? ๐Ÿ˜„

odd basalt
#

Flux + AUra + camera raw

odd basalt
torn wharf
odd basalt
mortal mesa
#

should call that one how to complicate a basic workflow

odd basalt
mortal mesa
#

i see that has a negative prompt not sure why, the first thing i do is undo that to save time, so many workflows i see are encoding a negative just to plug the hole in ksampler

#

like so, to not have to process a negative that i wont use, time saver on my old rig

bitter hearth
#

conditioningzeroout and an empty prompt are not neccesarily the same

craggy crest
mortal mesa
torn wharf
#

i thought negative prompting just piggy backs the cfg process and it's cfg which is causing the extra generation time

mortal mesa
#

basically

#

so i dont care if zero out or encoding a blank prompt are different, i get the same result faster at 1 CFG

craggy crest
mortal mesa
torn wharf
bitter hearth
#

looks for balls

sacred jewel
errant dust
#

I read something quite curious and wondered whether others had seen this:
Flux comes in two variants:

Timestep-distilled (black-forest-labs/FLUX.1-schnell)
Guidance-distilled (black-forest-labs/FLUX.1-dev)
Both checkpoints have slightly difference usage which we detail below.

Timestep-distilled
max_sequence_length cannot be more than 256.
guidance_scale needs to be 0.

#

in other words Schnell should be run with a guidance of zero

torn wharf
errant dust
#

I use Dev-Schnell

odd basalt
#

I use Dev and pro

errant dust
#

I run from my PC

odd basalt
#

Pro on iPad and dev on pc

errant dust
#

well, Dev-Schnell is exceptionally good and fast

odd basalt
#

From the iPad

odd basalt
#

As long as I can do my horror I love it ๐Ÿ˜ inspired by true events

errant dust
#

Blind date?

mortal mesa
torn wharf
#

hehehe its a full moon out

odd basalt
# errant dust Blind date?

Actually I never thought of that when I made it ๐Ÿ˜‚ I was mashing up scream with pretty girl and piglet from Pooh turning hott girls into killers

bitter hearth
sage burrow
#

Flux Canny, but apparently I haven't gotten the settings right. I haven't looked into it yet, I just chanbed the checkpoint in Crystalwizardd's apple workflow

mortal mesa
mortal mesa
icy drift
#

I'm been having a very hard time getting 4K renders out of FluxDev. ๐Ÿ˜ฆ

#

I want to replace my desktop background, because it's a great new model.
But with multiple upscale steps and a low denoise on the final render, I get all sorts of artifacts.

#

The model's physics are amazing, for sure. Those reflections are flawless as far as I can see.
For comparison, here's my current desktop background, which is a Cascade render. The reflections are obviously wrong, because Cascade isn't very smart, and the details aren't perfect. The flowers are all just impressionist mushy messes when you zoom in.

#

But it's still overall better than FluxDev. Anyone know how to get it to render well at 4K? ๐Ÿ˜•

rain current
#

It works very differently from usual. If there are artifacts, you need to increase the denoise, and if it generates an HDR effect, you need to increase the max_shift. It is complicated to reach a good point for all types of images...

icy drift
rain current
#

Well, if that happens, increase the max_shift even more, even to 2..

acoustic kite
#

why are my images noisy? could it be a wrong setting somewhere?

#

i'm using sdxl_lightning so the steps aren't too low

rain current
#

But I have given up... when it seems like I have achieved a good breakpoint with another type of image, it is a disaster.... I am done with dev and the upscale with sdxl...

mortal mesa
#

what sizes people generate at, i was doing what i was used to doing 1MP resolutions, but that was a 2MP, i forget this can do 2MP out of the box

bitter hearth
#

your best bet would be waiting to see if this model gets support by the Hi-diffusion team
as that is currently the top method for high res diffusion outside of Wรผrstchen

#

otherwise, until then some form of block by block scheduled latent downscale could help, i.e. a manual version of deep shrink

mortal mesa
dull star
#

man I can't wait for boring reality lora for flux, but the realism lora is nice for now

bitter hearth
#

oh there's a lora?

dull star
#

before lora, after lora

#

no more strong (+ overexposed) plastic skin

bitter hearth
#

its a lot better yeah

#

its still too saturated and plastic for me but not as bad

dull star
#

boring reality will take things next level

bitter hearth
#

sadly not neccesarily

#

in order for a neural network to be really good for finetuning it has to be a little bit under trained

#

in Flux's case it seems to be overtrained a little to bake in an "aesthetic" or midjourney look

#

this can't necessarily be removed by fine tuning that easily

errant dust
#

mno โ€” Today at 4:24 PM
Hi, let's skip the community chat this week, and let us continue to work hard on shipping the best text to image model ever existed, called Ideogram 2.0. It'll be in beta testing very soon.

#

So new Ideogram is on the way

bitter hearth
#

I kinda just ignore the closed source models TBH

mortal mesa
brittle nexus
#

2x upsscale flux

odd basalt
#

Flux + aura

#

Well least we all get mj for free through SD โ€ฆ making up for the SD3 XL model that they want us all to forget โ€ฆ there back up model that was suppose to be sd3 makes up for it allโ€ฆ the boat switch worked sd3 sunk like the titanic

#

Sarcasm^

mortal mesa
sacred jewel
#

How are folks plugging in the LoRA? I just tried the reality LoRA from Comfy but not sure if I have it connected properly...

uncut river
#

damn you flux-schnell ! got it right in about 2 or 3 images - 1 prompt tune

errant dust
#

Same schpeed, better results

fleet meteor
torn wharf
bitter hearth
#

nah its still midjourney

#

if midjourney was open source I would barely use anything else

brittle nexus
errant dust
# bitter hearth nah its still midjourney

No, it really varies. MJ is amazing at many things, but not all. For example with text it is just crap, and some styles of art too. Its understanding of complex prompts is also not the best, even if it can often produce the most amazing visuals. However, if you want uncensored productions of artist styles or known celebrities, it has no peers. Ex: give me an illustration in the style of Frank Frazetta. It will, like nothing you ever saw. Ideogram was king of text and has fantastic art too. I say 'was' because I haven't tested Flux enough to really compare. Flux has serious limitations so far in terms of pure creative illustration-style art. A lot of it seems by the letter, with few extra details and touches. For a lot of things it is amazing so don't get me wrong. None of them are universally king though. Not even Flux.

#

Here is an example of Midjourney asked to make a fantasy image in the style of Frank Frazetta. For this type of request, it is peerless, it is true. just don't ask it to spell anything:

#

Here is the sort of artistic text and variety in fonts and presentation in which Ideogram until now has had no rivals:

#

Dall-E 3 is king IMHO in terms of general illustration style art. Others can get it hit or miss, but it is the most consistent. And it will imitate the style of known artists too. Censored you say? Only the version in ChatGPT/OpenAI. Microsoft's Copilot has no such issues:

#

Assuming it recognizes them of course.

#

The last one I can't seem to get Flux to do anything similar no matter how hard I try. Ideogram is okish, and MJ can too, but Flux seems to resist any attempt. Might simply need some major prompt tweaks and adjustments on my part, but it hasn't been for lack of trying

#

And dont think I am dissing Flux. It is absolutely amazing, but it is clearly better at some things than others. It has a huge focus on photorealism, but illustration type art? Not so much so far.

errant dust
#

Yes

#

Through Copilot

fleet meteor
#

Iยดll try achieving something like that on flux rn

errant dust
#

You can use it 15 prompts a day for free, so 60 images. More if you don't mind waiting a bit for more. Ideogram gives 10 a day.

#

I have subscribed to them all, and used SD as well since 1.4

#

TO show Dall-E 3 can do artist styles, here is the same image but requested in the style of Keith Haring.

#

This is a picture by Keith Haring to illustrate the style:

torn wharf
#

if a company made an ai accelerator that looked like this, woudl it sell?

errant dust
#

Would be nice if the AI did not insist on blurring that as if depth of field meant anything here

torn wharf
#

its dramatic that way. makes you think about how deep the symbolism is

errant dust
#

lol

fleet meteor
#

Do you have the prompt? (i just wrote something )

errant dust
#

Or reassures me that AI is still dumb

fleet meteor
bitter hearth
#

can anyone get skim cfg node to work

errant dust
#

"A vibrant and whimsical comic of a tree with numerous cats of various colors, sizes, and expressions perched on its branches. The cats exhibit a range of emotions and designs, from smiling faces to curious expressions in a caricature style. Some cats are perched alone, while others are grouped together. The overall mood of the image is playful and enchanting, and accentuated by the dark outlines in the drawing."

#

For the record, Ideogram was not great at this either.

fleet meteor
errant dust
#

You don't want to know how slow it is on my laptop

fleet meteor
#

Better but not that good

errant dust
#

Yeah, Ideogram was like that, though sometimes a bit better.

#

Like I said, they all have their strengths and weaknesses

young blade
errant dust
#

I mean some things they all do similarly well, like those godawful boring Japanese cartoon styles. But others are very hit and miss and some are better than others.

fleet meteor
torn wharf
#

just put blur in the negative. ez

young blade
torn wharf
#

hooked up cfg guider and fed it a negative prompt

#

doom guy comes out better when i negative doctor victor and halo

young blade
#

got it, the default 8 cfg is obviously too high lol, you find a good average? 1-3 seems ok

torn wharf
#

i hover in that range. seems to do something. more data still needed kinda though

covert galleon
alpine summit
torn wharf
#

its still off but its better

alpine summit
errant dust
# fleet meteor

Well, may be a matter of prompting to some degree as I am getting closer. But you really need to spell out everything. For example, if you don't tell it the tree has a lot of leaves, it gives you:

#

So I added "dense foliage" and "LOTS of cats":

#

Not quite as creative or interesting as DE3's images, but still waaaaay better than the initial ones.

#

Will try Flux Dev, and after playing more with the prompts. See what I can learn from it. (BTW, I always leave prompt and workflow in images I share here)

bitter hearth
#

how long did it take on what hardware?

fleet meteor
#

yaaay!

torn wharf
#

if you use an LLM to really elaborate your prompt concept, it'll go hard too. that's often what the walled services are doing.

sterile pendant
#

I see why, it's actually sometimes using all 100% of my CPU alongside it. Normally, it will only use the e-cores (usage sits around 40%). Was noticing that sometimes it would be 8 or 9 sec/it, then other times, like 12 sec/it.

errant dust
#

It will show you what the LLM version was in each case as well.

alpine summit
torn wharf
alpine summit
alpine summit
alpine summit
rain current
dry wave
# bitter hearth how long did it take on what hardware?

I have a 3090 with 24gb. With batch size 1 I usually have 5 seconds per step. So definitely slower than SDXL, but still doable. Problem is rather to find good parameter settings first. My validation images always look as if the method blows up, but then it always turns out it is still undertrained

dusky thistle
icy drift
#

Testing negative CFG now. Thanks @torn wharf for hint.
FluxDev FP8, 1024x1024, euler/beta, 16 steps, fixed seed
same prompt to clip_l / t5: `mountain landscape, grass, lake, sky, birds flying, soft white clouds, mountains reflected in the rippling water, wildflowers in the grass,

cinematic masterpiece, dynamic lighting and shadows, extremely highly detailed, landscape photography, desktop background wallpaper`
Baseline result

#

With negative prompt red, crimson, carmine, ruby (Holding all other parameters constant throughout tests. Will only change negative prompt.
Obvious absence of red, so far negative CFG appears strongly effective. Reason for blurriness unknown, will experiment after CFG tests.

#

With negative prompt blue, azure, cobalt, marine, navy, ultramarine Definite effect, less obvious accuracy.

#

With negative prompt green, jade, emerald. Again less obvious than red, but definite absence of green from original on fixed seed. No observed blur means probably side effect of seed, not of negative.

#

With negative prompt color, rainbow, saturated, vivid, lush, colorful, multicolor, intense, rich. Obvious reduction of color from baseline.

restive notch
#

laugh girl walking to the moon

icy drift
#

Positive guidance 3.5, negative guidance 9.0, CFG 3.0

#

Positive 3.5, negative 9.0, CFG 9.0

#

Positive 3.5, Negative 9.0, CFG 1.0 (holding CFG 1.0 from now on)

dry wave
#

maybe run cfg only for the first 90% timesteps?

icy drift
dry wave
#

and I would use same guidance for both

dry wave
icy drift
#

Positive 9.0. Negative 9.0. Desaturation maybe less pronounced.

#

Positive 9.0. Negative 1.0. No change.

#

Positive 1.0. Negative 1.0.

#

Again, Positive 3.5, Negative 3.5. (Repeat baseline.)

#

Upweighted negative prompt (color, rainbow, saturated, vivid, lush, colorful, multicolor, intense, rich:5.0). No change.

#

Removed negative prompt. No change.

#

Restored negative prompt color, rainbow, saturated, vivid, lush, colorful, multicolor, intense, rich (not upweighted), CFG 2.0. Definite desaturation effect has returned.

#

All else constant, raised to CFG 3.0. Desaturation effect intensified.

#

Upweighted negative prompt to 5.0. Desaturation diminished (opposite expected effect).

#

Restored negative prompt to unweighted state. Raised negative guidance to 9.0.
Desaturation effect increased again.

#

Tests complete! ๐Ÿ˜Š ๐ŸŽ‰

#

Negative prompting for FluxDev is now 100% usable.

dusky thistle
#

used random attention guidance for this

#

no upscale

icy drift
#

Prompt painting of a woman warrior by Frank Frazetta, moon maiden, oil painting, concept art illustration

#

Same seed, with negative NSFW, nude, naked, bare skin, exposed skin

#

๐Ÿ˜Š

dry wave
#

they always look too photorealistig and not vintage enough

hallow lion
hallow lion
#

holy molly!

alpine summit
sterile pendant
bitter hearth
#

He's the guy who gives powers to the main character and leaves without a trace

signal shuttle
hallow lion
#

The 5th element.

stoic turtle
#

Shows up again at the end of the story smiling

alpine summit
stoic turtle
#

I think yer camera ainโ€™t straight

alpine summit
sacred jewel
signal shuttle
sage burrow
#

Flux has expanded a bit

rain current
hot dawn
acoustic kite
#

TIL the max resolution recommended is 2MP, double what is recommended for sdxl which is 1MP

#

this is what 2mp looks like. the guidance scale is set to 4, i might lower it to 2 and try again

alpine summit
errant dust
#

Note, it SAYS Pro, but I am in doubt as to whether this is Pro or only their Pro subscription plan.

#

EDIT: turns out only subscribers can use Pro. The free images are Schnell.

acoustic kite
#

wait was it pro that was supposed to be slightly better than dev?

alpine summit
errant dust
#

I edited my messages above to avoid any misdirection

#

I'll probably plunk some small money down on one of these guys to test Pro and compare. Likely one that allows me maximum settings control

rain current
bitter hearth
bitter hearth
hallow lion
bitter hearth
muted dove
#

"Spaced" more like

errant dust
#

Power to Fal.AI. I had credits left from when I plunked some money down for SD3 8B but apparently did not use it all. Can use for Flux Pro.

acoustic kite
#

oops sorry for posting images back to back. didn't mean to :)

errant dust
#

So it turns out you DO get 5 free Flux Pro images with Nightcafe

#

Their site is hopelessly confusing in some things, but this is now certain.

acoustic kite
#

i'll def check it out if that's the case then

#

also i found out that you just have to copy paste your prompt several times if it doesn't give you what you want

#

a 5x copy/pasted prompt will give you the result all of the time lol

errant dust
#

Be warned their default settings, even for a free account is to send you email notifications about EVERYTHING. So be sure to enter the setting to turn that crap off. That said, free Flux Pro is free Flux Pro, and the settings change is a one-time nuisance

#

They have an option to fix the seed, but it is entirely unclear to me if the seed matches Comfy's in any way. I suspect not.

#

When comparing models or prompts I like to fix the seed to be sure I am really seeing an apples to apples comparison, and not lucky RNG vs unlucky

alpine summit
torn wharf
#

free flux pro? more like.. flux SLOW. amirite?

silver sluice
#

hey @jolly swan, hope you're doing well. I look forward to your progress on v6.9 if you have any. Me and my friends we're wondering now that the dust has settled on SD3 per-se, on a few things

  • Are you still considering making an SD3 Pony model after 6.9's release?
  • Have you considered switching to Flux instead for the next generation Pony model?
  • (Sorry I have to ask) Do you have a timeline for when you'll expect to see some results for 6.9?
mortal mesa
#

nah dont fall for it

icy drift
#

Have whoever you're arguing with blocked so your posts are all I see. ๐Ÿ˜› I did a bunch of tests on negative prompts after you mentioned the node you were using, and it definitely works! I posted a workflow for it earlier in the thread with instructions on what works / doesn't. ๐Ÿ™‚

mortal mesa
#

yikes, you attack and fight small children

torn wharf
#

never thought a lame joke like "more like flux SLOW" would set someone off that hard

foggy cloak
torn wharf
#

there's an experimental branch right now

#

its not working when i checked

foggy cloak
#

From panchovix? Might have to switch to experimental branch to try it out

torn wharf
#

and lly has been adding stuff to upstream too

#

i haven't had success getting it to work but they're doing stuff

mortal mesa
#

people have broken psyche, alot of people

mortal mesa
#

TBF there are a ton of reasons

torn wharf
#

whenever i hear things like "nobody likes you!" from one person, that just makes me think that all the people who don't like me are afraid to tell me so. so i intimidate them and that makes me feel good about myself.

#

logic

foggy cloak
#

Never seen you before, but I just remember crystalwizard going crazy glazing SD3 when it came out ๐Ÿ˜‚

bitter hearth
#

balls

torn wharf
#

i think that was the earliest impression of him i can remember

bitter hearth
#

Nothing is imballsible

torn wharf
#

i know i dont know everything. i think i'm dumb in this field. i also recognize that many self proclaimed experts are serial bullshitters. Thats why i like to just get down and dirty and start throwing balls at the wall to see what happens

#

can only get so far with book smarts. like dr jones used to say, field work is important

#

bad doxxing skills

mortal mesa
#

yell him he watches FOX like you di dthat other time, thatll get em

icy drift
# torn wharf just crystal wizard saying everybody is thinking about me and laughing fondly of...

I experimented to figure it out, and the results are in my workflow as notes.
CFG 1 = negative prompt does nothing.
Higher CFG = more powerful negative prompt
CFG > 3 = artifacts and buggy output

Guidance on negative prompt: increase for slightly more powerful negative prompt. High guidance will cause artifacts.

Best way to boost negative prompt: Add more words, synonyms, and related terms.

I locked the seed and all other gen parameters, and for different sets of prompts with / without negative I proved that negative was working perfectly.

torn wharf
mortal mesa
#

i did some feelings based tests and went back to FP16 T5xxl

torn wharf
mortal mesa
#

i dont know how else to say it lol, it does indeed feel better haha

#

and im not going to test

torn wharf
#

i'm sure that fp8 t5 diminishes the quality. i've felt it. but i don't have the data to demonstrate it to others.

sage burrow
#

The comic in question (tbh I'm only reposting the actual image it's really awesome)

I sent to icycold (and crystalwizard also but accidentally bc i mixed my windiws up at first)

I think too many people look down on those with any psychological stuff going on.

I wasn't making fun of icycold.

I just woke up, so apologies if I'm not making sense yet; I also didn't scroll back very far for context.

mortal mesa
torn wharf
#

lol it's the usual situation. I expected it to be honest, just not from crystal wizard. mention last week in passing that i had some mental health issues i've been working on , focus issues mostly, and now it's blowing up in my face. everyone laughs about me and shares mental health memes about me. smh. what a catastrophe

bitter hearth
bitter hearth
#

Does flux not have negative prompts

mortal mesa
#

technically, no, just Pro

bitter hearth
mortal mesa
#

also technically, people added them anyways

sage burrow
mortal mesa
#

i just do the slow

#

i mean you do like art business stuff maybe i think, i just play, so experiences may vary

sage burrow
# mortal mesa i just do the slow

If you mean the on my own computer slow, for schnell it's 5 mins compared with 30 seconds. My lack of patience satanic $30 p/m isn't bad ๐Ÿ˜„

sage burrow
mortal mesa
#

yup i dont blame you, im just not a buyer of AI yet..

torn wharf
#

I like the idea of them releasing a special distill that's not good for images, but we can use to train Loras that work on the distilled models.

Would that work?

torn wharf
sage burrow
sage burrow
bitter hearth
torn wharf
#

I just need to remind people that Barclay from TNG is also Murdoch from the a-team.

Like, what else haven't we discovered?

sage burrow
bitter hearth
#

Im dumb

#

Lalita is the name of an extremely acidic soda thing over here, few like it thomas

#

I like it

#

Your name always makes that come to mind

sage burrow
bitter hearth
sage burrow
#

I forgot to title that mental health cartoon, supposed to he titled "if we treated physical ailments how ee treat mental health ailments" or something like that.

torn wharf
#

I was watching a-team AND TNG while they were both broadcasting new episodes and I never connected those two characters.

Just goes to show how we can get stuck in this domains.

bitter hearth
sage burrow
torn wharf
#

I also like the idea of them releasing an undistilled 5b version . Ideas are fun

bitter hearth
mortal mesa
#

chicago?

errant dust
torn wharf
#

This lame joke has really set some off. It was a lame joke

bitter hearth
mortal mesa
errant dust
torn wharf
#

Wait until people figure out they give free flux gens too. Right now it's 5s.

#

Also, it's not a hill I'm gonna die on. Read it aloud for instance.

errant dust
#

I have credits in Fal anyhow. The real interest was/is to see how Pro compares in quality and output.

bitter hearth
#

thomas idk what I'm doing

torn wharf
#

when the prompt was for something entirely different it seems to make something cool anyways?

bitter hearth
#

No I did prompt for an overgrown ruins with portraits...

torn wharf
#

oh it got it!

bitter hearth
sage burrow
mortal mesa
#

ide like to say i re-remembered if you select a bunch of nodes you can combine them to a mega node

bitter hearth
mortal mesa
#

so not the worst place by far

bitter hearth
torn wharf
#

first thing i noticed is it tries to make the magic kingdom castle accurately tall instead of perspective trix like in real life

sage burrow
bitter hearth
#

Shrek is the way

sage burrow
errant dust
sage burrow
dull star
#

dev schnell combo suggested by comfy?

#

is that the merge?

errant dust
#

yes

dull star
#

and comfy recommends it? damn

bitter hearth
#

Comfy secretly put fox girls in the merge

mortal mesa
#

i think the recomendation was in the method of combining

bitter hearth
mortal mesa
#

i use that one with 6 steps

#

and fp16 t5

torn wharf
#

sd3 adn flux and all these SOTA models. I can't wait until this tech converges with the meme spheres and gets things weird af online. i feel like we're too slowly getting to the next gen of online content and things need to ramp up. we canmake relevant new memes for any situation now. it's a memevolution.

errant dust
#

"Flux.1-Dev double_blocks (MM-DiT) onto Flux.1-Schnell, then quantized to fp8 giving you a higher quality model that still runs in just 4 steps"

mortal mesa
dull star
torn wharf
mortal mesa
#

true

dull star
#

like if I want something super complex that might only work with dev at >25 steps

torn wharf
#

only a matter of time before a smart person figures out how to run fast inference on cpus with clever instruction set tricks and we have access to the main system memory

#

or intel is almost certainly designing instruction sets specifcally for inference

errant dust
bitter hearth
torn wharf
#

I know jim keller, the guy who architected the current amd64 ecosystem of instruction sets, he's working on dedicated hardware for this stuff. his chips are kind of old gen tech now though and i haven't seen new stuff from greyskull labs for a bit

errant dust
#

And in the cause of image AI aficionados, it is a match made in heaven

torn wharf
#

i dont know him. i know that hes' working on stuff

mortal mesa
torn wharf
mortal mesa
#

TPU for all

torn wharf
#

i don't think cloud inference services like chatgpt will ever work well for games outside of chatbots in games

dull star
#

there is already one for stable diffusion 1.5, 2.1 and sdxl I think

#

oh even sd3 daaamn

torn wharf
#

live services for gaming is an ongoing problem. they're often not needed and are essentialy just DRM

torn wharf
dull star
#

but this repo also allows you to partially use your gpu like with cuda, vulkan and rocm

#

like llama.cpp and whisper.cpp

torn wharf
#

only a matter of time till a smart developer puts diffusion upscaling into a game engine, so that all the high fidelity textures are generated at run time using lower quality scafolding assets

bitter hearth
mortal mesa
#

i think Microsoft and NVidia have that developed already

#

gonna get many game remakes down the line

torn wharf
#

theres a ton of controlled art direction that can occur with model. ip-adapter style transfers and very elaborate prompts would get enough consistency for game textures i have no doubt. just no one's made a product showcasing that yet. usually takes about 5 years for a game to go from initial ground breaking to a released product.

torn wharf
bitter hearth
torn wharf
#

the GTA trilogy remake was done with "remastering" tools too, though that was a poorly applied effort imo

#

remember when we first started seeing GAN upscalers? people were remastering final fantasy 7 prerendered backgrounds with them. like 2002ish?

errant dust
torn wharf
#

nvidia is still a gaming tech company at their core. they didn't design these new chips for openAI 's level of business at first. it was primarily to make video games go harder and they knew that chasing that would put them in a very strategic position for the future.

#

the one AI i've been watching them train that's so cool is a player agent. it can learn to play any games with any control schemes. they're doing a ton of practical research in that field. so instead of dippy dumb AI team mates, you can have one that teaches you some shit

errant dust
#

The ray tracing you see in games such as Cybrpunk 2077 and others has nothing to do with neural network inference.

bitter hearth
#

How can you guys type so much

errant dust
#

Nor the massive polygon output in Unreal 5

bitter hearth
#

agony I'm lazy to even read

mortal mesa
torn wharf
#

https://youtu.be/5LL6z1Ganbw](https://youtu.be/5LL6z1Ganbw?t=120 this is just one aspect of that research. they taught it to play minecraft by watching streams. it can do text to minecraft! "build a castle" and then it does!

โค๏ธ If you wish to support us and watch these videos in early access, check this out:

๐Ÿ“ The paper "MineDojo - Building Open-Ended Embodied Agents with Internet-Scale Knowledge" is available here:
https://minedojo.org/

๐Ÿ™ We would like to thank our generous Patreon supporters who make Two Minute Papers po...

โ–ถ Play video
errant dust
#

There are papers out to use neural networks to improve ray tracing, but they have not made it into games yet

torn wharf
#

while i don't believe nvidia has published a paper on how RTX is achieving real time ray tracing, i'm fairly certain they use a machine learned model

errant dust
#

But as I said, NN inference does exist in games, as shown with AlphaGo first, but is now well embeded in other mind games such as chess and shogi

torn wharf
#

the cyberpunk raytracing doesn't use nvidia's tech at all

#

imo crytek's crysis remake does the most accurate real time ray traicng

errant dust
#

I have trained many chess-based neural networks from scratch

torn wharf
#

DLSS "Deep Learned Super Sampling"

errant dust
#

using established tools mind you, but data and structures my own

torn wharf
#

with programming i got as far as remaking tetris from scratch in a console text engine. also i made a template language for html forms that was over engineered and then jquery came out and i said fuckit.

never got much further with code

errant dust
#

I'm a chess player at my core, not a programmer, so my interest was in leading the training of the NN through specific steps and methods from the perspective of a chess player

torn wharf
#

hmm. at my core.. what am i? break dancer with a knee injury. that'll do

errant dust
#

I had help with the code from a friend who is a coder for Amazon

bitter hearth
bitter hearth
errant dust
#

I take it this frustrated your Olympic gold dreams

torn wharf
#

oooh yeah tragic memory just came iin. thoguht i was clever and named my library "phorms" because it was a php situation and i thought "wow thats gotta be the best library name anyone has ever come up with" and i was really into it. i remember feeling so proud of myself. but everyone who did forms in php had the same idea

sacred jewel
torn wharf
mortal mesa
bitter hearth
sacred jewel
torn wharf
#

you dealt with ascii in the 80s. remember an app called "The Draw"

sage burrow
#

My level of coding is: built websites (in html in notepad) before dreamweaver etc. was even out. Make tons of glifs
ROFL

torn wharf
#

olympic ball dancing event

bitter hearth
#

10/10

#

People just standing there looking at them not do anything for 10 minutes

torn wharf
#

i put "resting still" in the negative

torn wharf
acoustic kite
#

where is this on the realism scale

#

here's another one :)

rain current
torn wharf
# acoustic kite where is this on the realism scale

the height chart is a dead give away. the measure lines are very none standard and dont persist properly behind her. her left ear the one hidden behind hair, wacky postioning for her head tilt. the glasses are sitting way higher than they should be. goofy hair line. weird red smudging under the frames but otherwise perfect skin everywhere else. obviously the text is weird but that could just be an in joke that no one gets. but the one thats super uncanny that most people wouldn't notice i bet. light seems to be coming from the top left and top right at the same time. the shadows are all over the place.

8/10 realism because it would probably fool 80% of people.

vast condor
#

are those real words?

torn wharf
# acoustic kite

this one i would only suspect because of the power button position on the bottom of the device

acoustic kite
torn wharf
rain current
torn wharf
#

it really is getting so hard to tell tho

icy drift
# acoustic kite where is this on the realism scale

Flux is the closest to perfect any open model has ever been, but it still took a TON of manual work to get my monster cards ready to print. Every image has mistakes only an AI would make, as soon as you start looking close.

torn wharf
#

you could sometimes get soul out of older models if you used an open pose controlnet on a photo with soul. it's in the latent space it's just hard to evoke out

torn wharf
icy drift
icy drift
torn wharf
# icy drift Is this "soul" like your "aesthetic" thing? ๐Ÿ˜› Doubt I'll get it. I like data.

to understand soul there is but one bible of a film to ingest. May i be so inclined to even provide a link. There's probably more that people would argue about, but i think they get it here. https://archive.org/details/the-blues-brothers-1980-4k

Internet Archive

Jake Blues rejoins with his brother Elwood after being released from prison, but the duo has just days to reunite their old R&B band and save the Catholic...

โ–ถ Play video
#

if you're on the fence, might i suggest that it stars Carrie Fisher

torn wharf
#

They get the band back together

icy drift
sage burrow
icy drift
#

(I also am a philosopher and believe in the metaphysical soul, so alternatively explain in a way a philospher / logician / linguist would understand it.)

torn wharf
#

i believe that if there is a soul, we can engineer it.

#

metaphysics is just physics that doesn't have an applied science to it yet

icy drift
mortal mesa
#

believable actions/poses/expressions/body language

icy drift
torn wharf
#

also maybe i might show the computer some alex grey paintings or a tool music video

sage burrow
#

you know that famous painting of Venus in the clam shell? That's one that google images chose. I can't post it on discord tho

icy drift
sage burrow
icy drift
sage burrow
#

I was hoping for one of those fancy ones where there is a translucent version of the person half out of their body....

sage burrow
icy drift
torn wharf
#

if the birth of venus isn't SFW, i don't wanna work there

#

boss is like "JENKINS?! YOU GETTING OFF TO RENAISSANCE ART AGAIN?!" its like.. no dude thats a weird accusation

icy drift
torn wharf
#

just rip the band aid off and post it

mortal mesa
sage burrow
icy drift
mortal mesa
#

its a 6th sense

icy drift
mortal mesa
#

cant say it but can see it

torn wharf
#

i dont think it was a portrait of someone posing. it has all the elements of like a constructed person. like they planned it out her pose and face and expression and eye lines and all that. like how we have a style like anime or pixar style. it's a design language.

icy drift
torn wharf
#

knowing the italians probably the position of her toes have extreme symbolism too

icy drift
uncut river
#

I believe in soul as in spirit, with a certain focus and mindset, in the style of, alive, purposeful

torn wharf
#

i gave all those blues brothers pics to represent soul. and also alex grey

icy drift
uncut river
#

not only people have souls

torn wharf
#

whatever LSD idoes to neurons, thats what we need to encode into these networks. if computers are going to find soul, that'll either do it or just make things interesting. either way, fun.

#

disclaimer: i have no idea what lsd does i've only seen it in movies

icy drift
uncut river
#

shrooms work better for the visual, I heard someone tell to some other

sage burrow
#

Alex Grey created a lot of paintings along those themes, in the 80's

mortal mesa
torn wharf
icy drift
uncut river
#

If you need a scan device to exaclty measure the amount of soul in something, you'll probably dont get it

torn wharf
#

contextualization could easily give that image some soul. but it does parodoxially feel stiff to me. and that's interesting. it compels me to examine it closer

mortal mesa
#

the scientologists got that thing

sage burrow
#

dips my gpu in lsd

torn wharf
uncut river
#

careful, dont touch with your fingertips

torn wharf
mortal mesa
#

like its a Michael Jordan rookie card

torn wharf
#

my heart aches so hard when i see people planting their sweaty sebum bum thumbs all over circuit boards

icy drift
torn wharf
#

seeing it out of the case is actually making me really nervous

mortal mesa
#

anything could happen a sudden gust

#

Heres a horror story for you, i pulled out like a half a cup of matted down dust out of my video card when i started with AI, its much quieter now

torn wharf
#

yikes

#

soilent dust is ||people||

mortal mesa
#

freaked me out, was well hidden behind the casing

icy drift
#

Guy says Differential Diffusion inpainting works with Flux? ๐Ÿค” Really great results. I'm at work. Anyone want to test it?
https://youtu.be/Qiltbxvhr_A?si=UpIN8In_7df77WiT

In this video, we demonstrate how you can perform high-quality and precise inpainting with the help of FLUX models.

FLUX is an advanced image generation model, available in three variants:

  • FLUX.1 [pro] for top-tier performance,
  • FLUX.1 [dev] for efficient non-commercial use, - - FLUX.1 [schnell] for fast local development. These models exc...
โ–ถ Play video
torn wharf
#

lol invoke was telling poeple that flux inpainting was impossible

icy drift
#

I'll test it later. Might be tomorrow though.

uncut river
#

I think sd15 has some soul

torn wharf
#

prompted it by name with a description of it. i think it knows giant cedars but not that one in particular. its close! it vibes.

mortal mesa
bitter hearth
#

i wana train a lora on flux on my 4090

#

kinda waiting for kohya_ss script

dry wave
#

yes

#

SimpleTuner is fine. Not more complicated than kohya_ss

#

it supports quantisation and all kind of loras out of the box as it's using diffusers

#

I could imagine it takes some time until kohya_ss supports stuff like quantisation, dunno

torn wharf
#

yeh its not getting that one specific tree but it does get creative with trees. hehheheheh hole hehee

mortal mesa
rain current
#

six fingers... welcome back

mortal mesa
jolly swan
# silver sluice hey <@670866147373219841>, hope you're doing well. I look forward to your progre...

SD3 is not competitive with other options like AuraFlow or Flux so there is not much reason to build anything on top of it (or even use it, honestly). Flux is good but overall not as exciting as AuraFlow which lacks in terms of aesthetics but I think I may be able to fill that gap. tldr: I am getting ready to train the first smaller finetunes first to see how it goes but I think the main training run is getting close.