#🆕｜sd3 | Stable Diffusion | Page 128

craggy crest Dec 13, 2024, 12:02 AM

#

sd 3.5 large

#

#

lavish sparrow Dec 13, 2024, 12:05 AM

#

"Kingslayer"

#

bitter hearth Dec 13, 2024, 2:20 AM

#

#🆕｜sd3 panorama view of a carnival, lit at night, beautiful night background with Ferris wheel, stall, roller coaster

craggy crest Dec 13, 2024, 2:40 AM

#

craggy crest Dec 13, 2024, 3:41 AM

#

icy drift Dec 13, 2024, 9:56 AM

#

Woah. Works amazing.

icy drift Dec 13, 2024, 10:21 AM

#

So that's an anime "drone shot", is it?

#

Why do the drone shots have drones in them though. Also these buildings are pretty wonky.
Probably because of the low resolution on my PC. 😕

dull star Dec 13, 2024, 11:06 AM

#

with stg or no?

#

or is this not ltxv?

errant dust Dec 13, 2024, 1:33 PM

#

I don't have that issue at all. Quite the contrary. I get contrasty punchy images

#

Sounds like an issue in your setup. Either nodes or Python.

brittle nexus Dec 13, 2024, 1:47 PM

#

bitter hearth Dec 13, 2024, 1:58 PM

#

I agree, SD 3.5 has more vibrant colours than flux in my opinion

muted dove Dec 13, 2024, 3:42 PM

#

#

#

muted dove Dec 13, 2024, 3:53 PM

#

brittle nexus

Send her to rehab, now!

buoyant mesa Dec 13, 2024, 6:11 PM

#

Kohya_ss does not support Lora training for SD3.5 yet, right?

cunning lintel Dec 13, 2024, 7:30 PM

#

SD 3.5L def is still nice, it's like flux's uneducated but creative cousin, bit dumber but really tries to make up for it 😉

mortal mesa Dec 13, 2024, 7:56 PM

#

bitter hearth I agree, SD 3.5 has more vibrant colours than flux in my opinion

i havent tried to much with 3.5 but the first SD3 you could pop out some super vibrant stuff

#

now with audio

#

devout schooner Dec 13, 2024, 8:52 PM

#

What were your prompts here?

devout schooner Dec 13, 2024, 9:03 PM

#

errant dust I don't have that issue at all. Quite the contrary. I get contrasty punchy image...

It may be only for certain types of prompts I guess
But it's for sure something I've noticed, again just versus the original SD3 on the same prompt with completely identical settings in Comfy

errant dust Dec 13, 2024, 9:04 PM

#

Show me a prompt

pseudo owl Dec 13, 2024, 9:04 PM

#

mortal mesa now with audio

nice, ltxv with mmaudio?

devout schooner Dec 13, 2024, 9:05 PM

#

I have some outputs saved at home that show this IIRC, I'll find them later

errant dust Dec 13, 2024, 9:05 PM

#

Any prompt. Even the seed number. I will run it and see if they match.

#

Easiest and best way to troubleshoot

mortal mesa Dec 13, 2024, 9:05 PM

#

pseudo owl nice, ltxv with mmaudio?

ya

devout schooner Dec 13, 2024, 9:06 PM

#

It's not a bug, it's definitely just a stylistic difference lol
I'm quite sure my Comfy setup is totally fine

errant dust Dec 13, 2024, 9:07 PM

#

The thing is, I use a very wide variety of prompts, art, design, photorealism, and so on, and have yet to see a single instance that remotely matches your description

lavish sparrow Dec 13, 2024, 9:12 PM

#

devout schooner What were your prompts here?

basically -> an LLM interpreting some random shit XD

#

i have no idea what the actual prompts were

#

i just press generate at this point

errant dust Dec 13, 2024, 9:27 PM

#

Speaking of LLMs, I downloaded and tested one of the quantized versions of Llama 3.3 (30GB file) since my machine has a total of 32GB RAM, and was shocked at how good it was. Truly shocked. I fed it a favorite, asking to describe a scene as if for a role-playing game, and it was absurdly good.

#

Since talk is cheap, here is my query and its reply.

#

Write me a description for a role-playing game in which the player has just entered a small cave. In it are two goblins arguing over a small chest of treasure in the center.

#

As you step into the small cave, the cool darkness envelops you, a stark contrast to the warmth and brightness of
the outside world. The air inside is stale, heavy with the scent of damp earth and mold. Your eyes adjust slowly
to the dim light that filters through the entrance, illuminating the rough stone walls and the uneven floor. Ahead
of you, in the center of the cave, lies a small chest adorned with rusted locks and bands, its wood surface worn
smooth by time and handling.

Two goblins, their wiry frames tense with agitation, stand on either side of the chest, their argument filling the
space with high-pitched voices and animated gestures. They seem oblivious to your presence, too engrossed in their
dispute over the treasure that supposedly lies within the chest. One goblin, slightly taller than the other,
brandishes a crude dagger, its point glinting faintly in the dim light. The shorter goblin, undaunted by the
threat, sneers and spits back, baring yellowed teeth.

The cave around them is sparse, with jagged rocks jutting out from the walls and a scattering of pebbles and dust
covering the ground. A small, dying torch, its flame flickering wildly, casts eerie shadows on the walls, making
it seem as though the very darkness itself moves and twists in response to the goblins' quarrel.

Your entrance has gone unnoticed by the arguing pair, offering you a moment of surprise and the opportunity to
decide how to proceed.

#

This was quite literally my first query, not some result after multiple tries or the like.

mortal mesa Dec 13, 2024, 9:36 PM

#

errant dust Speaking of LLMs, I downloaded and tested one of the quantized versions of Llama...

q3? i tried q4

errant dust Dec 13, 2024, 9:36 PM

#

Q3_K_S

#

it was all I could run. lol

#

but in light of the output, there are no complaints either

mortal mesa Dec 13, 2024, 9:37 PM

#

ya im still in "testing" mode heh

errant dust Dec 13, 2024, 9:42 PM

#

Out of curiosity I reran the exact prompt, and while some things are the same (a few phrase bits), it made multiple and quality changes too.

#

I know these models have a very wide range of uses, even programming is supposed to be top notch, but I like to test the basics too. It must live up to the ANLP after all.

lavish sparrow Dec 13, 2024, 10:12 PM

#

errant dust Q3_K_S

on ram?

#

i'm trying the exaone by lg, i think it's pretty damn good too, to be honest

#


Scene:  You cautiously step into the narrow, dimly lit entrance of a hidden cave, the musty air thick with the scent of damp earth and ancient moss. The flickering light from your torch casts eerie shadows across the rough stone walls, revealing intricate, gnarled formations that hint at eons of silent passage. Ahead, the cave opens slightly, revealing a small, circular chamber where the flickering torchlight illuminates a scene of unexpected chaos. 

The Encounter: 
In the heart of this chamber, two goblins are locked in a heated argument, their small, green forms towering over a modest chest nestled in the center. One goblin, clad in mismatched armor with a crude sword clutched tightly, is red-faced and gesturing wildly, clearly agitated. Its counterpart, perhaps slightly more cunning, is dressed in ragged cloaks, clutching a crude dagger and trying to reason with its compatriot through frantic gestures and shrill shouts. 

Treasure Chest:  The chest in question, though unassuming, gleams faintly with gold and jewels peeking through its worn leather bindings. The treasure within could provide a significant boost to your adventurer's coffers, but navigating the situation requires finesse and perhaps a bit of cunning. 

Choices Ahead:  

    Intervene:  Approach the arguing goblins and attempt to mediate the dispute, perhaps offering a bribe or a clever negotiation to claim the chest without bloodshed.
    Stealth Approach:  Move silently around the chamber, observing the goblins’ behavior for an opportune moment to snatch the chest when their attention is diverted.
    Confrontation:  Charge into the fray, using your combat skills to overpower both goblins and claim the treasure by force.
     

The path you choose will set the tone for your journey ahead, potentially unlocking new alliances, enemies, or hidden paths within this labyrinthine network of caves. What will you do? ```

errant dust Dec 13, 2024, 10:14 PM

#

https://ollama.com/library/llama3.3:70b-instruct-q3_K_S

llama3.3:70b-instruct-q3_K_S

New state of the art 70B model. Llama 3.3 70B offers similar performance compared to Llama 3.1 405B model.

lavish sparrow Dec 13, 2024, 10:15 PM

#

what are you running it on?

errant dust Dec 13, 2024, 10:15 PM

#

In this case, I simply run it in CMD

#

i.e.

lavish sparrow Dec 13, 2024, 10:15 PM

#

what hardware 🙂

errant dust Dec 13, 2024, 10:15 PM

#

ollama run llama3.3:70b-instruct-q3_K_S

#

Ohhhhh.... lol. New laptop I got for BF on a deal ($1070 after tax): i13900-HX (32 threads), 32GB, RTX 4070, Cherry KB

lavish sparrow Dec 13, 2024, 10:17 PM

#

that looks like mostly run in ram then -> must've taken a while ot get that answer

errant dust Dec 13, 2024, 10:17 PM

#

it did, but wasn't back breaking either

#

Since unlike images where you wait until the end, here I got the text output in order, as it was rendered

#

word by word, line by line

lavish sparrow Dec 13, 2024, 10:32 PM

#

marco-o1 is REALLY good for prompt enhancing, if you add a reasoning block (it's actually a reasoning model)

#

#

#

lavish sparrow Dec 13, 2024, 10:59 PM

#

lavish sparrow Dec 13, 2024, 11:23 PM

#

#

#

#

#

#

#

#

#

and time to say goodnight. sd3.5L one of the best models to play around with ❤️

craggy crest Dec 14, 2024, 3:12 AM

#

craggy crest Dec 14, 2024, 5:43 AM

#

lavish sparrow Dec 14, 2024, 6:59 AM

#

lavish sparrow Dec 14, 2024, 7:22 AM

#

the coffee went bad

lavish sparrow Dec 14, 2024, 9:04 AM

#

#

buoyant lark Dec 14, 2024, 9:36 AM

#

prompt:A beautiful woman, black hair, modern dress, sitting on a sofa

lavish sparrow Dec 14, 2024, 9:49 AM

#

buoyant lark prompt:A beautiful woman, black hair, modern dress, sitting on a sofa

unfortunately, this is not an image generation channel

buoyant lark Dec 14, 2024, 9:52 AM

#

Ah, lol, which group is the right one?

lavish sparrow Dec 14, 2024, 9:53 AM

#

i actually have no idea

buoyant lark Dec 14, 2024, 10:16 AM

#

OK, thanks all the same

uneven storm Dec 14, 2024, 1:50 PM

#

buoyant lark Ah, lol, which group is the right one?

there is no free image gen bots here, only local is free

errant dust Dec 14, 2024, 2:07 PM

#

craggy crest

Ah yes, the feared ArmEGGadon

toxic jewel Dec 14, 2024, 2:07 PM

#

prompt = (
"A beautiful Asian woman, wearing stylish and slightly sexy clothes, "
"sitting in a cozy cafe, holding a cup of coffee. "
"Photorealistic, highly detailed, natural lighting."
)

errant dust Dec 14, 2024, 2:13 PM

#

prompt = (
"A user who doesn't waste space asking for free images, knows how to read, wearing stylish and slightly sexy clothes, "
)

remote holly Dec 14, 2024, 2:55 PM

#

lavish sparrow

this is a realy great use of reasoning model

fleet meteor Dec 14, 2024, 3:50 PM

#

Nice lighting! 👏

#

Textures in sd 3.5 are also very good

craggy crest Dec 14, 2024, 4:29 PM

#

bitter hearth Dec 14, 2024, 4:56 PM

#

can GPT O1 make a nice prompt or would it be terrible

devout schooner Dec 14, 2024, 5:30 PM

#

errant dust The thing is, I use a very wide variety of prompts, art, design, photorealism, a...

here's a good example of what I meant
here neither the original SD 3.0 or SD 3.5 Medium really came out particularly good in terms of overall composition and coherency (both would need at least probably upscaling and a second denoise pass), but the stylistic difference is quite obvious
first pic is the original SD 3.0, second one is SD 3.5 Medium, seed was the same
prompt was
a still from a movie, featuring an elderly man dressed in a traditional wizard's attire. He is sitting in a dimly lit, ornate library filled with rows of wooden bookshelves that stretch into the background. The man has a long, flowing white beard and is wearing a large, pointed, black hat that casts a shadow over his face. His robe is also black, made of a heavy, textured fabric that adds to the mystical appearance. He holds an open, large book in his lap, and his hands are resting gently on its pages, suggesting he is deeply engrossed in reading or studying. The room is illuminated by a warm, golden light, which enhances the rich, dark tones of the wood and the man's attire. Several candles in brass holders are placed on wooden tables around the room, their flames flickering softly. Behind the wizard, a large, intricate astronomical globe is mounted on the wall, adding a sense of ancient knowledge and mysticism to the scene. The overall atmosphere is one of quiet reverence and intellectual curiosity, typical of a scholar's sanctuary. The image captures a moment of quiet contemplation and magical study.
Again this definitely isn't a "bug", this is just what SD 3.5 thinks certain prompts should look like for whatever reaon, in contrast to the original SD 3.0
but I'm not sure what actual part of the prompt it is exactly making it think the whole room should appear foggy / hazy

#

my only guess is maybe 3.5 had more training on like classical oil paintings and similar traditional art than the original 3.0
and that's unintentionally bleeding into the image sometimes when certain keywords are hit

fleet meteor Dec 14, 2024, 5:36 PM

#

devout schooner here's a good example of what I meant here neither the original SD 3.0 or SD 3.5...

Just to compare, this is the same prompt on flux schnell (in 4 steps)

devout schooner Dec 14, 2024, 5:47 PM

#

fleet meteor Just to compare, this is the same prompt on flux schnell (in 4 steps)

yeah that's what I'd expect-ish
main reason I know the prompt itself is the cause in any case is that I can also get insanely good, photographically perfect, vibrant images from SD 3.5 Medium in many cases
for not-long prompts, at that
this one was literally just beach, water, rocks, cliff, raw, photo, realistic for example

errant dust Dec 14, 2024, 5:47 PM

#

well, here is what I get with SD3.5

#

fleet meteor Dec 14, 2024, 5:49 PM

#

😁

fleet meteor Dec 14, 2024, 5:50 PM

#

devout schooner yeah that's what I'd expect-ish main reason I know the prompt itself is the caus...

It looks amazing

lavish sparrow Dec 14, 2024, 5:53 PM

#

i dont'care about realism that much... xD

devout schooner Dec 14, 2024, 5:56 PM

#

fleet meteor It looks amazing

yeah, that sort of thing is the main reason I like SD3.5
Flux can't generate images that look like that unless you use Pro Ultra 1.1 in "Raw" mode (which seems to be I guess just their actual full model without any distillation, like even the original "Pro" seems to be distilled in comparison)

lavish sparrow Dec 14, 2024, 5:59 PM

#

fleet meteor 😁

is that the spaghetti universe taking ours over?

#

.

devout schooner Dec 14, 2024, 6:00 PM

#

devout schooner here's a good example of what I meant here neither the original SD 3.0 or SD 3.5...

oh yeah I forgot, this one is with the RealVis guy's WIP finetune of 3.5 Medium, on the same seed and everything, a noticeable improvement overall in comparison to both
so something to keep an eye on IMO if you like 3.5 Medium

fleet meteor Dec 14, 2024, 6:00 PM

#

devout schooner yeah, that sort of thing is the main reason I like SD3.5 Flux can't generate ima...

I love flux but I agree, textures even in the original sd 3.0 are much sharper and detailed

fleet meteor Dec 14, 2024, 6:01 PM

#

devout schooner here's a good example of what I meant here neither the original SD 3.0 or SD 3.5...

sd 3,5 with iphone photo lora (idk if the lora worked in this case)

devout schooner Dec 14, 2024, 6:03 PM

#

fleet meteor sd 3,5 with iphone photo lora (idk if the lora worked in this case)

it's doing the "Large when you img2img" thing around the edges lol

lavish sparrow Dec 14, 2024, 6:03 PM

#

"spaghetti universe taking over reality"

fleet meteor Dec 14, 2024, 6:03 PM

#

Yeah i noticed xd, but its text2img

devout schooner Dec 14, 2024, 6:03 PM

#

devout schooner it's doing the "Large when you img2img" thing around the edges lol

actually 3.0 Medium also had that problem
3.5 Medium is the only one that solves it / doesn't have it

fleet meteor Dec 14, 2024, 6:03 PM

#

It seems it doesn´t support wide resolutions

devout schooner Dec 14, 2024, 6:05 PM

#

fleet meteor It seems it doesn´t support wide resolutions

high res / multires support is again another reason I prefer 3.5 Medium
1440x1440, no problem

fleet meteor Dec 14, 2024, 6:08 PM

#

well time to try

devout schooner Dec 14, 2024, 6:08 PM

#

fleet meteor Just to compare, this is the same prompt on flux schnell (in 4 steps)

Large Turbo is pretty comparable to your Schnell gen, if I take it to square format on the same seed, also 4 steps

devout schooner Dec 14, 2024, 6:08 PM

#

fleet meteor well time to try

i wouldn't use the FP8
GGUF Q8 will be better quality if you can't run the FP16 for some reason
but even that is only like 5GB

#

FP16 T5 is the issue for most people really
just using the GGUF Q8 or FP8 T5 encoder can be enough to get you going in a lot of cases

#

while still running the FP16 version of the actual transformer model

errant dust Dec 14, 2024, 6:12 PM

#

devout schooner yeah, that sort of thing is the main reason I like SD3.5 Flux can't generate ima...

YMMV. Here is what I got on first try:

devout schooner Dec 14, 2024, 6:12 PM

#

errant dust YMMV. Here is what I got on first try:

from Flux?

errant dust Dec 14, 2024, 6:13 PM

#

Yes

fleet meteor Dec 14, 2024, 6:13 PM

#

💀 💀 💀

errant dust Dec 14, 2024, 6:14 PM

#

I usd no LoRA

devout schooner Dec 14, 2024, 6:16 PM

#

errant dust Yes

well again I did say you "have to use Pro 1.1 Ultra in Raw mode" to get similar results to my 3.5 Gen
like if I actually try it on Ultra Raw (only gonna do one cause it costs like a million dollars per gen lol), I get this:

errant dust Dec 14, 2024, 6:16 PM

#

I did not use Flux Pro.

fleet meteor Dec 14, 2024, 6:16 PM

#

devout schooner i wouldn't use the FP8 GGUF Q8 will be better quality if you can't run the FP16 ...

Any idea why comfyui doesn´t recognize the .GGUFs? only the safetensors files are working

devout schooner Dec 14, 2024, 6:16 PM

#

fleet meteor Any idea why comfyui doesn´t recognize the .GGUFs? only the safetensors files ar...

you need uh, the City96 GGUF loader nodes

#

they're in Manager if you don't have them

fleet meteor Dec 14, 2024, 6:17 PM

#

devout schooner you need uh, the City96 GGUF loader nodes

Aaaa thanks, i found it

devout schooner Dec 14, 2024, 6:17 PM

#

errant dust I did not use Flux Pro.

right, yeah, I was saying their most expensive API only model is the only one that can reliably do photographic stuff without any hint of distillation

#

getting random cartoon gens from Flux Dev and such on the other hand is definitely a thing though yeah

errant dust Dec 14, 2024, 6:22 PM

#

#

A random cartoon by Flux Dev

devout schooner Dec 14, 2024, 6:33 PM

#

errant dust

yeah that's about the best Dev can do
it's got that noticeable CGI-like crispiness everywhere though
particularly on the moss and stuff

fleet meteor Dec 14, 2024, 6:34 PM

#

I wonder if dev has that problem only because it is a distilled model , and also if it could be "fixed" with finetunes

errant dust Dec 14, 2024, 6:34 PM

#

It is much lower resolution than the image you posted

#

which is 1800 x 2600

devout schooner Dec 14, 2024, 6:35 PM

#

fleet meteor I wonder if dev has that problem only because it is a distilled model , and also...

it does

#

that's my whole point lol, Dev looks like all distilled models do, overall

toxic bone Dec 14, 2024, 6:36 PM

#

you throw a rank 16 lora trained for 100 steps at flux and it'll blast cartoons out fine

#

trained with 512 x 512 images in fp8 mode

devout schooner Dec 14, 2024, 6:37 PM

#

errant dust It is much lower resolution than the image you posted

yeah I know
all I've been trying to say is that Dev looks like what it is, a distilled model
Ultra Raw 1.1 isn't distilled seemingly and so can do the unfiltered photography stuff properly

#

best example of this is, what people jokingly refer to as "Flux Girl" should really just be called "Distillation Girl"
cause if you generate literally any image of a woman with SD 3.5 Large Turbo specifically, you get results that could easily be mistaken for a Flux Dev gen without Loras

#

the sameface is clearly a result of distillation in general, basically

toxic bone Dec 14, 2024, 6:44 PM

#

you guys using comfyui? it doesn't make the knob fiddling very easy. on forge i dialed in a couple settings, prompted for a cartoon dog, and got a cartoon dog

#

flux isn't that hard to crack through the distillation

#

#

i get it. it's difficult to load an entirely new worksheet for every situation, but maybe it's just a problem with your node graph and not the model

devout schooner Dec 14, 2024, 6:46 PM

#

toxic bone you guys using comfyui? it doesn't make the knob fiddling very easy. on forge ...

I think you've missed my point
which was just "Dev is sometimes stylistically inconsistent for terse prompts, and won't always respect single keywords like photo"

#

like I've released numerous Flux Dev loras myself lol, I'm well versed in it

toxic bone Dec 14, 2024, 6:47 PM

#

devout schooner yeah that's about the best Dev can do it's got that noticeable CGI-like crispine...

i must've missed that point when you said "Yup that's about all you can get outta it"

devout schooner Dec 14, 2024, 6:48 PM

#

toxic bone i must've missed that point when you said "Yup that's about all you can get outt...

I was saying the actual visual appearance of his photographic beach Dev gen was the most photographic Dev can look without Loras that brute force realism back in, basically

#

so still I think misinterpreting what I said

toxic bone Dec 14, 2024, 6:49 PM

#

the picture was labelled "a random cartoon" and posted in context of flux making cartoons.. so i guess there was hidden context i wasn't aware of

devout schooner Dec 14, 2024, 6:49 PM

#

toxic bone the picture was labelled "a random cartoon" and posted in context of flux making...

he was being sarcastic as far as I can tell
possibly also not quite getting what i meant

#

again my entire point boils down to "Dev looks like exactly what it is" lol, nothing else

errant dust Dec 14, 2024, 6:50 PM

#

toxic bone the picture was labelled "a random cartoon" and posted in context of flux making...

Correct. He was saying the images cannot attain photorealism and random cartoons are the norm

toxic bone Dec 14, 2024, 6:50 PM

#

i just wanted to clear the misconception up that flux is incapable of cutting through distillation aesthetic

devout schooner Dec 14, 2024, 6:50 PM

#

errant dust Correct. He was saying the images cannot attain photorealism and random cartoons...

no lol

devout schooner Dec 14, 2024, 6:50 PM

#

devout schooner no lol

see here
#🆕｜sd3 message

toxic bone Dec 14, 2024, 6:50 PM

#

devout schooner no lol

dude is a troll. i'll ignore for future reference. seems he's an AP that wants to derail discussions

devout schooner Dec 14, 2024, 6:50 PM

#

maybe this is some kind of language barrier thing

#

not sure

devout schooner Dec 14, 2024, 6:51 PM

#

toxic bone dude is a troll. i'll ignore for future reference. seems he's an AP that wants ...

?????

errant dust Dec 14, 2024, 6:51 PM

#

??

toxic bone Dec 14, 2024, 6:51 PM

#

not you. you know who i mean

errant dust Dec 14, 2024, 6:51 PM

#

Anyhow, here is your reply Zoot:

#

ZootAllures91 — Today at 3:17 PM
right, yeah, I was saying their most expensive API only model is the only one that can reliably do photographic stuff without any hint of distillation
getting random cartoon gens from Flux Dev and such on the other hand is definitely a thing though yeah

devout schooner Dec 14, 2024, 6:51 PM

#

yeah

#

the last part of that what I meant was

#

it can be more random

#

than Ultra Raw ever will be

#

it doesn't know what "photo" means as a single keyword as reliably

#

basically

#

a separate point from the appearance of the images themselves

errant dust Dec 14, 2024, 6:53 PM

#

YMMV, but I had zero issues with the beach image it produced.

#

As to SD3.5 Medium (I was using SD Large ass I did not have it installed), here is the output on your prompt:

devout schooner Dec 14, 2024, 6:55 PM

#

errant dust YMMV, but I had zero issues with the beach image it produced.

yeah it's not "bad", it's just, distilled
I like Flux for a lot of things but "hard realism" just isn't a strong suit
i even made a Lora to try and address it for people myself lol
https://civitai.com/models/790722/zoots-human-photo-realmaxxer-for-flux

errant dust Dec 14, 2024, 6:55 PM

#

devout schooner Dec 14, 2024, 6:56 PM

#

errant dust

yeah this one is pretty good, or better at least
could just be a seed thing
there's still definitely some varying keyword behaviour between 3.0 and 3.5 though

errant dust Dec 14, 2024, 6:57 PM

#

I would hope so since the original had so many weird issues it was bound to have an effect on non human anatomy images

#

But I only use 3.5L as a rule anyhow

devout schooner Dec 14, 2024, 6:57 PM

#

Also here's a good example of Flux Dev's sometimes odd prompt behavior I just did real quick on CivitAI
1girl is MASSIVELY weighted towards Anime, in Flux Dev
so much so that it almost always overpowers everything else

errant dust Dec 14, 2024, 6:58 PM

#

what is '1girl

#

?

#

some cartoon character?

bitter hearth Dec 14, 2024, 6:58 PM

#

its a booru tag

devout schooner Dec 14, 2024, 6:58 PM

#

errant dust what is '1girl

it's a general tag meaning "one female person / character / etc" basically
often used in the SD 1.5 days for tagging stuff

errant dust Dec 14, 2024, 6:59 PM

#

the whole point of T5 is NLP, not the ancient single word prompting of the past

#

Complaining the AI cannot understand some weird name such as 1girl is not really much of a complaint to my eyes

devout schooner Dec 14, 2024, 7:01 PM

#

errant dust the whole point of T5 is NLP, not the ancient single word prompting of the past

I'm just saying any appearance of 1girl with Flux Dev even within a complete sentence that should arguably produce a photo, as I did, almost always results in an anime gen no matter what
as far as I can also tell Flux actually IS trained on Booru tags as well as natural language
like when you actually want anime from it, prompting only with tags generally works very well

bitter hearth Dec 14, 2024, 7:01 PM

#

the Pony crowd like to prompt that way

errant dust Dec 14, 2024, 7:01 PM

#

bitter hearth its a booru tag

A boohoo tag?

bitter hearth Dec 14, 2024, 7:01 PM

#

its a thing from Japan

#

it means image board in Japanese

#

and they post tagged anime images there

devout schooner Dec 14, 2024, 7:02 PM

#

devout schooner I'm just saying any appearance of `1girl` with Flux Dev even within a complete s...

like if Flux Dev wasn't trained on Booru tags pretty aggressively this wouln't be a thing, it wouldn't go "anime time" in this context, I don't think

errant dust Dec 14, 2024, 7:04 PM

#

ok, a fair point but if 1girl is used almost exclusively in an anime setting and you way you want a photo of an anime girl.....

#

As to the Pony crowd..... cry me a river

devout schooner Dec 14, 2024, 7:05 PM

#

bitter hearth the Pony crowd like to prompt that way

I train all my Loras concatenating Florence 2 Large "more detailed" mode natural language captions with Booru tags from wd-eva02-large-tagger-v3 right after them, in the same file
best approach I've found
cause each captioner tends to get stuff the other doesn't
so the combination is better than eiteher alone

eternal steppe Dec 14, 2024, 7:05 PM

#

flux with dynamic thresholding is still my go-to

#

🤷‍♂️

errant dust Dec 14, 2024, 7:06 PM

#

Well, as far as I'm concerned it does a nice job in photorealism, though to be fair I quite literally never do any photorealism with them

#

Here is a higher res image with that beach prompt

#

eternal steppe Dec 14, 2024, 7:07 PM

#

ComfyUI-dpmpp_2m-3.5-30-2024-12-14_13-20-41-0050.webp

#

#

#

ComfyUI-dpmpp_2m-3.5-30-2024-12-14_13-29-41-0052.webp

#

ComfyUI-dpmpp_2m-3.5-30-2024-12-14_13-38-42-0054.webp

eternal steppe Dec 14, 2024, 7:11 PM

#

devout schooner best example of this is, what people jokingly refer to as "Flux Girl" should rea...

I always find it so funny when people argue the realism of any model

#

It's so pointless to argue anything if you can just generate an image with one model then do a second pass with any other model of any style

#

Flux Redux helps with this magnificently combined with loras

bitter hearth Dec 14, 2024, 7:12 PM

#

you lose the flux VAE though

#

if you do that

eternal steppe Dec 14, 2024, 7:12 PM

#

bitter hearth you lose the flux VAE though

What are you talking about

bitter hearth Dec 14, 2024, 7:12 PM

#

if you do a workflow that first pass fluv dev and second pass SDXL
your image exits via the SDXL VAE

eternal steppe Dec 14, 2024, 7:12 PM

#

Then you can just re-encode it. Again.

#

This changes nothing lmao

#

If vae was an issue, you could literally just make a vae encodedecode loop

#

and loop hundreds of ksamplers

bitter hearth Dec 14, 2024, 7:15 PM

#

you can't undo the quality loss from the SDXL vae though

eternal steppe Dec 14, 2024, 7:16 PM

#

bitter hearth you can't undo the quality loss from the SDXL vae though

Quality loss how? How much quality is actually lost per-decode when you are quite literally re-inferecing the image every time in the ksampler

#

The main issue at hand with diffusers-based transformers is temporal consistency

#

The changing and/or loss of data based on denoise value and steps

#

I don't think vae decode is relevant in terms of loss quality

#

bitter hearth Dec 14, 2024, 7:19 PM

#

if you try img-to-img with SDXL, lots of k-samplers at 0.5 denoise
the image quite quickly gets destroyed

#

it isn't able to fix the damage

eternal steppe Dec 14, 2024, 7:19 PM

#

Well yeah, because you're denoising half of the image for the model fo inference upon.

#

Have you tried preventing it via other methods?

#

Canny, Depth controlnets

#

IPadapters

bitter hearth Dec 14, 2024, 7:20 PM

#

was still the same issue

eternal steppe Dec 14, 2024, 7:20 PM

#

I use a very low denoise combined with flux redux to maintain quality

#

ComfyUI-dpm_2-3.5-30-2024-12-12_03-46-14-0046.webp

#

#

original

#

ComfyUI-dpmpp_2m-3.5-30-2024-12-14_15-15-04-0068.webp

#

brocolli in the clouds

#

its raining legumes

bitter hearth Dec 14, 2024, 7:22 PM

#

if you try 50 ksamplers in a row you will see what I mean
the image is so damaged its not gonna be saved

eternal steppe Dec 14, 2024, 7:23 PM

#

redux itself is a style model

#

it applies the image into conditioning

toxic bone Dec 14, 2024, 7:23 PM

#

anime image boards have been ready for datasets with captions for years. those people have had taxonomical tagging down to a science for the entire 21st century. nobody tags image collections like anime enjoyers

eternal steppe Dec 14, 2024, 7:24 PM

#

That's what I use in combination with lower denoise

eternal steppe Dec 14, 2024, 7:24 PM

#

toxic bone anime image boards have been ready for datasets with captions for years. those ...

WD-14 tagger can give decently accurate image tagging results in danbooru tags for any image

#

But ik what you guys are talkin bout

#

and i aint going down that rabbit hole

toxic bone Dec 14, 2024, 7:25 PM

#

danbooru is one i avoid. so much extreme smut there

eternal steppe Dec 14, 2024, 7:25 PM

#

I do not judge people for their interests, just actions.

toxic bone Dec 14, 2024, 7:25 PM

#

there's many sites that have used image tagging and it's not specifically a smut thing

eternal steppe Dec 14, 2024, 7:25 PM

#

Rocket Fish.

ComfyUI-dpmpp_2m-3.5-30-2024-12-14_15-25-45-0071.webp

toxic bone Dec 14, 2024, 7:25 PM

#

and it was done long before any ai classifiers existed. WD tagger was trained on those datasets

eternal steppe Dec 14, 2024, 7:26 PM

#

I'm literally just letting flux make its own images.

#

Using an LLM.

#

I gave an LLM a sysprompt, and a constantly changing user prompt with a seed value.

bitter hearth Dec 14, 2024, 7:28 PM

#

you can throw WD tagger into the captioning mix if you want yeah
what Sana does is use four captioning models
and then it selects the caption with the highest clip score for the image
and uses that caption

eternal steppe Dec 14, 2024, 7:29 PM

#

Sadly for me I just can't use SANA properly rn.

#

I can inference it but it's vae outputs on intel arc aren't good

bitter hearth Dec 14, 2024, 7:29 PM

#

ah I don't know intel arc too well

eternal steppe Dec 14, 2024, 7:29 PM

#

A lot better supported than it used to be

#

I'm glad I bought the A770 at the time

#

16gb of vram and 4060-level speeds

#

good enough for me

bitter hearth Dec 14, 2024, 7:31 PM

#

intel CPU can be okay as well

#

with Onnx or OpenVino

eternal steppe Dec 14, 2024, 7:31 PM

#

or IPEX*

#

As that also exists for them

#

#

Nice leaves.

toxic bone Dec 14, 2024, 7:32 PM

#

for gamers it truly is a golden age of hyper graphics

eternal steppe Dec 14, 2024, 7:32 PM

#

The universe has decided to play an elaborate game of hide and seek with the moon, and as it's now in a never-ending cycle of falling off its axis due to an infinite loop, time itself has begun to unravel, creating ripples through the fabric of existence. The sky is painted in hues of violet and silver, with occasional patches of neon green swirling about like psychedelic storms. Meanwhile, giant mutant lobsters have taken over New York City, transforming it into a sprawling underwater metropolis teeming with bioluminescent flora

#

lol one of the prompts

toxic bone Dec 14, 2024, 7:32 PM

#

3 manufacturers all putting out budget level cards that can smash FHD resolutions on any game on the market. its a good time to be a gamer

eternal steppe Dec 14, 2024, 7:33 PM

#

I should be using gemini flash 2.0 from openrouter, they got it as a free model

#

https://openrouter.ai/google/gemini-2.0-flash-exp:free

Gemini 2.0 Flash Experimental (free) - API, Providers, Stats

Gemini 2.0 Flash offers a significantly faster time to first token (TTFT) compared to [Gemini 1. Run Gemini 2.0 Flash Experimental (free) with API

#

probably get better prompts

toxic bone Dec 14, 2024, 7:34 PM

#

phi-4 is out now too

eternal steppe Dec 14, 2024, 7:34 PM

#

Is it? I wonder how good it is.

#

phi 4 14b

#

ok

#

it competes with 70b models

#

ok

toxic bone Dec 14, 2024, 7:34 PM

#

https://huggingface.co/matteogeniaccio/phi-4/tree/main quants

eternal steppe Dec 14, 2024, 7:35 PM

#

I'd been a good frontend that supports it then

#

IPEX OLLAMA defo wont

#

https://ollama.com/vanilj/Phi-4:Q4_K_M

vanilj/Phi-4:Q4_K_M

Microsoft's Phi 4 model

#

hmm

toxic bone Dec 14, 2024, 7:36 PM

#

i use koboldcpp for llms but i dont know if it has phi4 support or not

eternal steppe Dec 14, 2024, 7:36 PM

#

wait

#

it says the arch it uses is phi3

#

phi3 itself is supported

bitter hearth Dec 14, 2024, 7:37 PM

#

ah thanks I didnt know about IPEX

eternal steppe Dec 14, 2024, 7:37 PM

#

IPEX is how I'm running ComfyUI and LLMs locally

#

And now with Xess2 and XeLL

#

Intel's really starting to pick up the pace on competition

bitter hearth Dec 14, 2024, 7:38 PM

#

I've started doing JAX stuff

#

TPUs scale a bit better for total TFLOPs/$ than Nvidia at the moment

toxic bone Dec 14, 2024, 7:39 PM

#

i trust that intel is going to create some ML instruction sets directly on their future CPUS. i was going to boost to a rocket lake or whatever. but then those power issues showed up. going to wait for the next big generation of their CPUs with some serious improvements. my alderlake works good nuff for now

eternal steppe Dec 14, 2024, 7:39 PM

#

come to danny

#

toxic bone Dec 14, 2024, 7:40 PM

#

would ipex use my i7's integrated gpu very well you figure?

bitter hearth Dec 14, 2024, 7:40 PM

#

all the cloud servers I use have been AMD

eternal steppe Dec 14, 2024, 7:40 PM

#

Ok.

#

Phi-4 works on ollama 0.3.6

#

So

#

it works on intel

bitter hearth Dec 14, 2024, 7:40 PM

#

toxic bone would ipex use my i7's integrated gpu very well you figure?

I think you want openVINO for CPU

eternal steppe Dec 14, 2024, 7:41 PM

#

#

Welp no need to use anything else

bitter hearth Dec 14, 2024, 7:41 PM

#

https://github.com/rupeshs/fastsdcputhis is a good openVINO SD implementation

toxic bone Dec 14, 2024, 7:41 PM

#

the integrated gpu is a gpu. separate from the cpu cores

eternal steppe Dec 14, 2024, 7:41 PM

#

For individuals on

#

nvidia

#

use svdquant

fleet meteor Dec 14, 2024, 7:42 PM

#

eternal steppe

😁 nice gen!

eternal steppe Dec 14, 2024, 7:42 PM

#

fleet meteor 😁 nice gen!

Thank you.

#

https://github.com/mit-han-lab/nunchaku/tree/main/comfyui

GitHub

nunchaku/comfyui at main · mit-han-lab/nunchaku

SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models - mit-han-lab/nunchaku

#

same guys who worked on the dc-ae used for sana

#

SVDQuant is a super-performant quantization

toxic bone Dec 14, 2024, 7:42 PM

#

i just want to load an llm on something other than my nvidia, so that i can use it in tandem with image gen

#

i got a 4080 for images

eternal steppe Dec 14, 2024, 7:42 PM

#

toxic bone i just want to load an llm on something other than my nvidia, so that i can use ...

Oh

#

Then use llama cpp

#

lmao

#

or

#

ollama cp

#

u

#

and they would both work

toxic bone Dec 14, 2024, 7:43 PM

#

yeah so no benefit to using integrated gpu then. just go with cpu

eternal steppe Dec 14, 2024, 7:43 PM

#

Not really no. You'll get more FLOPs out of the cpu alone

#

Otherwise using your GPU would be more sensical

eternal steppe Dec 14, 2024, 7:43 PM

#

toxic bone i just want to load an llm on something other than my nvidia, so that i can use ...

Also I do this already

#

You can use ollama with OLLAMA_KEEP_ALIVE="0"

#

in order to load a model, generate and unload after

#

phi-4 is nuts

toxic bone Dec 14, 2024, 7:50 PM

#

good to know xD

bitter hearth Dec 14, 2024, 7:50 PM

#

svdquant is what I currently use yeah

eternal steppe Dec 14, 2024, 7:51 PM

#

I can't use it myself

bitter hearth Dec 14, 2024, 7:51 PM

#

its on the tensorRT model optimiser roadmap
so it will improve soon

#

I don't have a GPU so I just use cloud

eternal steppe Dec 14, 2024, 7:52 PM

#

openrouter

#

free models

#

🤷‍♂️

bitter hearth Dec 14, 2024, 7:53 PM

#

there's always gonna be limits on free stuff

#

rate limits etc

#

and lower speeds

eternal steppe Dec 14, 2024, 7:58 PM

#

#

In the mystical city of Quagmire, where skyscrapers were made entirely of licorice and pigeons sported top hats adorned with tiny monocles, the citizens lived in an uproar of chaos and calm. One day, a peculiar event unfolded: Mr. Puddingpot, the local wizard with a penchant for mismatched socks and rainbow-colored waffles, decided to invent time-traveling croquet mallets. The invention was powered by the synchronized chirping of rare golden geese—each bird had their own distinct jingle tune that resonated only when they saw a full moon through a kaleidoscope lens. Meanwhile, in the corner of Puddingpot's laboratory, Lady Zinnia—a dragonfly with an extraordinary collection of porcelain teacups and a slight allergy to rhinestones—was knitting scarves out of spider silk laced with fairy dust for her upcoming tea party with the whispering willow trees. As fate would have it, Sir Whifflebottom, a knight whose armor was perpetually sticky from jellybean juice spills, had just stumbled upon a magical map that only appeared in rainbows on Thursdays when painted by left-handed penguins. This extraordinary day promised to bring even more strange and

#

Phi-4 is good at prompting.

bitter hearth Dec 14, 2024, 7:58 PM

#

thats good yeah

eternal steppe Dec 14, 2024, 7:58 PM

#

I asked for a "random things" prompt.

bitter hearth Dec 14, 2024, 7:59 PM

#

I like to use florence 2 first

#

then second LLM improves

eternal steppe Dec 14, 2024, 8:00 PM

#

I actually use wd-14 with underscores removed

#

lmao

#

but since im currently using flux

#

i actually don't need to make an image2image prompt

#

redux with clip vision does that for me

bitter hearth Dec 14, 2024, 8:01 PM

#

I haven't tried redux yet it seems good

eternal steppe Dec 14, 2024, 8:01 PM

#

I can test an image for you if wanted

bitter hearth Dec 14, 2024, 8:02 PM

#

its ok I've seen a lot of examples

eternal steppe Dec 14, 2024, 8:02 PM

#

👍

lavish sparrow Dec 14, 2024, 8:23 PM

#

#

#

devout schooner Dec 14, 2024, 8:44 PM

#

eternal steppe Quality loss how? How much quality is actually lost per-decode when you are quit...

if you take any image, and do a straight encode-then-decode in Comfy
with nothing in between
it's pretty noticeable
for fine details
left is original, right is passed through SDXL VAE
look at her eyes at full resolution, in particular

#

(this is an actual photograph of Madison Beer to be clear, not an AI generated image to begin with)

#

so you can see how eyes that aren't necessarily even as large in the overall image would eventually become kinda weird
when training a whole model with the XL VAE
as is the case in base SDXL

eternal steppe Dec 14, 2024, 8:53 PM

#

devout schooner if you take any image, and do a straight encode-then-decode in Comfy with nothin...

There are things like latent interpolation nodes to mitigate this issue isn't there?

#

nope nvm

#

even that has differences between each gen

bitter hearth Dec 14, 2024, 8:55 PM

#

most people its a small issue cos they don't have many k-samplers

#

but I like weird long complex workflows
and I get limited by it

#

spamming dozens of k-samplers is what I'd like to do, will be possible one day

eternal steppe Dec 14, 2024, 8:56 PM

#

Well crap. Ipex-llm was updated and now it broke my xpu support

#

lol

bitter hearth Dec 14, 2024, 8:58 PM

#

lol yeah stuff breaks all the time

devout schooner Dec 14, 2024, 8:59 PM

#

bitter hearth most people its a small issue cos they don't have many k-samplers

yeah one pass isn't a huge deal
but more than one quickly degrades on XL
SD 3.5 and Flux you have a lot more leeway
cause the degradation per pass is far less

bitter hearth Dec 14, 2024, 9:05 PM

#

yes with flux, using a lot of colour match nodes in between, I was able to use like 20 VAE cycles once

lavish sparrow Dec 14, 2024, 9:07 PM

#

#

fleet meteor Dec 14, 2024, 9:55 PM

#

Do anyone knows if last sd forge update broke something? Im getting 31.33 seconds per iteration in flux and I was getting 4-6s/i

toxic bone Dec 14, 2024, 11:10 PM

#

forge doesn't get many updates too often. the issue might be elsewhere.

I went to update my nvidia drivers last week, and it installed this new thing to replace geforce experience. "nviida app" .. it's a complete hog and i hate it. i've had so many issues since. Overlay is turned off but it's still attaching some container process to everything. It's crap and is not ready for them to deploy. But i had to get it to get the new driver so whatever. fuck.

fleet meteor Dec 14, 2024, 11:23 PM

#

toxic bone forge doesn't get many updates too often. the issue might be elsewhere. I wen...

Damn I have not idea what it could be then, and on google only a few people had the same problem, i´ll reinstall the gpu driver and if it doesnt work i´ll have to reinstall cuda again (I hope not windows 💀 )

fleet meteor Dec 14, 2024, 11:39 PM

#

I think I´ve downloaded the wrong fp8 version, im downloading the 17gb one this time

fleet meteor Dec 15, 2024, 12:17 AM

#

Its fixed!

craggy crest Dec 15, 2024, 3:09 AM

#

remote holly Dec 15, 2024, 12:45 PM

#

turbid grotto Dec 15, 2024, 4:55 PM

#

why bro brought midjourney bot here

remote holly Dec 15, 2024, 5:32 PM

#

hyper realistic cinematic photo, candid, black and white, professional photo, fashion, a woman with dark hair standing leaning against a wall blowing a light transparent smoke, she is wearing a white t-shirt with a long black coat, long exposure, in a dark setting, low light, dynamic shadows, volumetric light, 8k, detailed, masterpiece

lavish sparrow Dec 15, 2024, 6:54 PM

#

remote holly Dec 15, 2024, 7:14 PM

#

lavish sparrow

i love it

lavish sparrow Dec 15, 2024, 7:15 PM

#

remote holly i love it

the only human hands involved in this image is an idiot creating a workflow so he doesn't have to actually prompt ...

#

my workflow also does well with actual prompts...

#

gleaming wigeon Dec 15, 2024, 8:08 PM

#

The

vestal wadi Dec 15, 2024, 9:28 PM

#

top

rapid pivot Dec 15, 2024, 9:39 PM

#

remote holly `hyper realistic cinematic photo, candid, black and white, professional photo, f...

Smoking is bad

#

bonk

rapid pivot Dec 15, 2024, 9:40 PM

#

lavish sparrow

What in the vaal is this thomas

remote holly Dec 15, 2024, 10:18 PM

#

rapid pivot Smoking is bad

Yeah

silver bridge Dec 15, 2024, 11:30 PM

#

So how is SD 3.5 looking? And u guys use large turbo, large or medium? I’m considering using it

pseudo owl Dec 15, 2024, 11:36 PM

#

silver bridge So how is SD 3.5 looking? And u guys use large turbo, large or medium? I’m consi...

Sd3.5 large is pretty great, a good alternative to flux dev.

Turbo is good too but requires 4+ steps and I think shuttle3(flux schnell finetune) surpasses it since it can do same if not better quality at 2steps and can do 2k images.

Medium is probably the sota base model for its size but I think the low-step models are better choices since you can just use quantization with them for speed-ups and lower vram usage. Shuttle3 is probably faster then medium.

errant dust Dec 15, 2024, 11:42 PM

#

https://www.reddit.com/r/LocalLLaMA/comments/1heokci/pixtral_qwen2vl_are_coming_to_ollama/

From the LocalLLaMA community on Reddit: Pixtral & Qwen2VL are comi...

Explore this post and more from the LocalLLaMA community

#

These are large powerful models. Qwen is 72B and might take the throne of best image describer (for local installs)

#

Their Qwen 2.5 LLM is comparable to Llama 3.3 in quality

#

Heated race between Meta and Alibaba on that front

silver bridge Dec 15, 2024, 11:56 PM

#

pseudo owl Sd3.5 large is pretty great, a good alternative to flux dev. Turbo is good too ...

So any tips whenever using 3.5 large

pseudo owl Dec 15, 2024, 11:58 PM

#

errant dust https://www.reddit.com/r/LocalLLaMA/comments/1heokci/pixtral_qwen2vl_are_coming_...

Exllamav2 already supports both and is almost 2x faster then llama.cpp/ollama while using less vram.

silver bridge Dec 16, 2024, 12:06 AM

#

pseudo owl Exllamav2 already supports both and is almost 2x faster then llama.cpp/ollama wh...

Is promoting with 3.5 large sim to flux or the same?

pseudo owl Dec 16, 2024, 12:29 AM

#

silver bridge Is promoting with 3.5 large sim to flux or the same?

Yep prompting is similar

errant dust Dec 16, 2024, 12:35 AM

#

I assume it accepts the various quantized models?

#

GGUF, IQxXXS, etc

pseudo owl Dec 16, 2024, 12:39 AM

#

errant dust I assume it accepts the various quantized models?

It has its own quant format, not gguf but yes, it can also do decimals like 4.5bpw or 2.4bpw.

Qwen models(2.3bpw-6.0bpw): https://huggingface.co/turboderp/Qwen2-VL-72B-Instruct-exl2

errant dust Dec 16, 2024, 12:44 AM

#

The interface will download and install the models?

sturdy pine Dec 16, 2024, 1:46 AM

#

is comfy UI completely separate from 3.5 medium, large, and large turbo? I just installed it and it never gave me a chance to choose

lucid swift Dec 16, 2024, 2:26 AM

#

sturdy pine is comfy UI completely separate from 3.5 medium, large, and large turbo? I just ...

here is a tutorial for flux and comfy https://comfyanonymous.github.io/ComfyUI_examples/flux/

ComfyUI_examples

Flux Examples

Examples of ComfyUI workflows

sturdy pine Dec 16, 2024, 2:27 AM

#

thx

lucid swift Dec 16, 2024, 2:27 AM

#

sturdy pine thx

allso the images on that page are workflows. so you can downdle them and drag and drop them into comfyui to get the workflwos that are deskibed

#

and you shuld probably use the fp8 checkpoint it uses less vram

sturdy pine Dec 16, 2024, 2:29 AM

#

Thx. I think I'm finally getting it now. Glad that link actually says where to put everything

lucid swift Dec 16, 2024, 2:30 AM

#

yes its very helpful i also used it xD

craggy crest Dec 16, 2024, 3:20 AM

#

https://huggingface.co/crystalwizard/Surreal-abstractions sd 3.5 large lora

crystalwizard/Surreal-abstractions · Hugging Face

craggy crest Dec 16, 2024, 8:07 AM

#

cunning kraken Dec 16, 2024, 9:26 AM

#

Amara, an anthropomorphic owl with a serene expression, prepares matcha in a traditional Japanese tea house. She wears a simple kimono with floral patterns and carefully whisks the bright green tea in a ceramic bowl. The room is minimalistic, with tatami mats, sliding shoji doors, and a small zen garden visible outside. Delicate steam rises from the bowl, and the atmosphere is tranquil, with soft sunlight filtering through bamboo trees. The scene captures a moment of calm precision and elegance.

lavish sparrow Dec 16, 2024, 9:30 AM

#

#

remote holly Dec 16, 2024, 9:53 AM

#

https://lineargen.github.io/

LinGen: Towards High-Resolution Minute-Length Text-to-Video Generat...

We propose a Linear-complexity text-to-Video Generation (LinGen) framework that scales linearly in the number of pixels. For the first time, LinGen enables high-resolution minute-length video generation on a single GPU without compromising quality.

#

Linear complexity 🤯

#

I not will be suprised if one day we reach log(n) complexity

lavish sparrow Dec 16, 2024, 9:57 AM

#

@remote holly that's what one of the path of exile dev's figured how to do with screen space reflections

#

doing ray-casting (path tracing) with log(n) complexity

#

so for each sample point the next point will be cheaper. he held a talk about it. when i was watching my mind was like "poooof"

#

raytracing becoming CHEAPER for each increase in resoluation instead of more expensive.

lavish sparrow Dec 16, 2024, 10:18 AM

#

#

#

#

#

#

getting pretty reliable text out here 😮

#

remote holly Dec 16, 2024, 1:54 PM

#

lavish sparrow raytracing becoming CHEAPER for each increase in resoluation instead of more exp...

yeah i see they used a manba for the attention , that's a good optimization

bitter hearth Dec 16, 2024, 2:05 PM

#

non-quadratic attention is what Sana does also

bitter hearth Dec 16, 2024, 3:10 PM

#

is there an optimal size for a flux image?

#

yeah 384x384 in my experience

lavish sparrow Dec 16, 2024, 4:13 PM

#

#

#

#

#

remote holly Dec 16, 2024, 4:14 PM

#

bitter hearth non-quadratic attention is what Sana does also

I didn't know this fact , they removed the DiT ?

bitter hearth Dec 16, 2024, 4:16 PM

#

remote holly I didn't know this fact , they removed the DiT ?

no they made a research breakthrough
and managed to use linear attention with DiT

remote holly Dec 16, 2024, 4:26 PM

#

bitter hearth no they made a research breakthrough and managed to use linear attention with Di...

Ha , it keeps a good quality ?

bitter hearth Dec 16, 2024, 4:27 PM

#

well its fast

#

quality I am not sure

#

depends how well it trains

dry wave Dec 16, 2024, 5:29 PM

#

bitter hearth no they made a research breakthrough and managed to use linear attention with Di...

breakthrough?
I'm sceptical. So far there are plenty of papers about howto get rid of quadratic attention and none of them works as good as quadratic attention.
All images from Sana I have seen so far look relatively bad

bitter hearth Dec 16, 2024, 5:33 PM

#

yeah I guess breakthrough was too strong of a term when its possible that it is worse LOL

#

the research on VAE speedups might be more valuable

#

I struggle with VAE encode/decode times when doing something like 4k SDXL with 4 steps

#

feels weird when VAE encode/decode time is longer than the Unet time

#

for non-linear attention the Mamba stuff might be more promising, the Dimba, Dim or ZigMa models

dry wave Dec 16, 2024, 5:41 PM

#

bitter hearth the research on VAE speedups might be more valuable

I don't like this either xD

#

its like Stable Cascade. Sounds good in the beginning, but comes with a lot of flaws

#

if you train your own custom finetune you usually don't have to finetune the vae. I bet, however, you would have to do that for Sana

#

their vae has a much higher compression factor that probably won't work well with custom finetunes

#

(same happened for StableCascade with their stage b model)

bitter hearth Dec 16, 2024, 5:43 PM

#

I feel like there is a middle ground between the SD 1.5, SDXL and Flux VAEs and their TAESD versions
even going a bit faster for a bit of quality loss would be welcome sometimes

#

the TAESD versions are just too bad I can't get them to work a lot of the time

#

I wasn't aware that you had to fine tune the VAE for Cascade fine tunes, that does sound pretty bad

#

I skipped stable Cascade cos I didn't like the aesthetics

#

if we could get just one photography/realism checkpoint for Sana that would be good maybe
my bigger issue with Sana is that it is not that much faster than Schnell

#

particularly the Schnell finetune Shuttle Diffusion

#

which happily does 2560x2560 in 2 steps

pseudo owl Dec 16, 2024, 6:25 PM

#

Yeah I think vaes can definitely be improved. This is reducio vae, sota quality and compresses 4096x times:https://huggingface.co/microsoft/Reducio-VAE
For reference, mochi-1 compresses 384x and Hunyuan does 512x. Reducio vae is similar if not better quality and does 4096x. Limited to 1sec sadly.

bitter hearth Dec 16, 2024, 6:27 PM

#

we're gonna need it for images if diffusion is going to be 1 step

dry wave Dec 16, 2024, 6:28 PM

#

that's video.

#

of course you can compress videos stronger than images

pseudo owl Dec 16, 2024, 6:29 PM

#

dry wave of course you can compress videos stronger than images

Yeah it still does 1024x for images, in real 128x I believe. Which is still massive, stable cascade/sana are only 32x and that was the highest.

dry wave Dec 16, 2024, 6:29 PM

#

you can also compress images arbitrarily strong, but is it worth it? At some point if you compress your image too strong you end up with a GAN like model which lacks any creativity and cannot be finetuned anymore

pseudo owl Dec 16, 2024, 6:30 PM

#

For fast real-time generations, probably a good idea. Vae needs to be probably lightweight too though.

dry wave Dec 16, 2024, 6:32 PM

#

I don't know. You need a strong compression cause attention is quadratic. Maybe we have to overcome the transformer architecture instead

bitter hearth Dec 16, 2024, 6:51 PM

#

in the LLM world I think they got further with some of the non-quadratic attention stuff
they got at least a 7B or something

#

I'm in the "focus on scaling" camp mostly, I think in the short term we just need to keep on scaling transformers
can distill from the larger ones

pseudo owl Dec 16, 2024, 7:13 PM

#

bitter hearth in the LLM world I think they got further with some of the non-quadratic attenti...

There are actually hybrid models large as this: https://huggingface.co/ai21labs/AI21-Jamba-1.5-Large

ai21labs/AI21-Jamba-1.5-Large · Hugging Face

craggy crest Dec 16, 2024, 7:13 PM

#

dry wave Dec 16, 2024, 8:55 PM

#

bitter hearth in the LLM world I think they got further with some of the non-quadratic attenti...

not really. Yes, there is mamba and xlstm and stuff but it just does not work as good as attention

#

they have nice benchmarks showing they are as good as attention. But just in their benchmark. As soon as you leave their narrow benchmarks attention outperforms this stuff clearly

#

what works, however, are hybrid architectures where you mix global and local approaches. They show better performance while dropping only very slightly in quality. However, this approach don't solve the scaling issue, it just makes it more efficient but quadratic runtime is still quadratic runtime

bitter hearth Dec 17, 2024, 1:04 AM

#

hybrid architectures don't get rid of the quadratic scaling yeah

#

its really bad scaling if we want sequence lengths to get several orders of magnitude higher

craggy crest Dec 17, 2024, 4:00 AM

#

#

#

limpid thunderBOT Dec 17, 2024, 7:46 AM

#

Last 7 days <Dec 10 2024> → <Dec 16 2024>

Member counts
346039 ↘ 346018 ↘ 346017 ↘ 346003 ↗ 346018 ↘ 346000 ↘ 345993
Action members
0 → 0 → 0 → 0 → 0 → 0 ↗ 70
Message members
0 → 0 → 0 → 0 → 0 → 0 ↗ 58
Reaction members
0 → 0 → 0 → 0 → 0 → 0 ↗ 25
More details

Summary | comcom Analytics

comcom analytics は、Discord または Slack 上で運営されているコミュニティを分析・モニタリングできる完全無料のダッシュボードです。現在、パブリックにβ版を提供しています。

lavish sparrow Dec 17, 2024, 9:18 AM

#

ä

#

I'm never going to outprompt my LLM again 🥲

#

lavish sparrow Dec 17, 2024, 10:12 AM

#

lavish sparrow Dec 17, 2024, 11:22 AM

#

"i think i'm lost"

worn zealotBOT Dec 17, 2024, 12:25 PM

#

Author: @pine loom
MotionPrompt: say hello
Command: </animate:1181475143340539995>
Background: from video
Model: V3-beta
Explore more features at viggle ai

Viggle AI

Viggle AI | Controllable AI Video Generator

Bring your characters to life with AI. From professional motion capture to viral memes, discover endless ways to create with Viggle.

lavish sparrow Dec 17, 2024, 3:08 PM

#

#

lavish sparrow Dec 17, 2024, 3:42 PM

#

glossy bloom Dec 17, 2024, 3:43 PM

#

lavish sparrow

Beautiful picture.

unkempt mango Dec 17, 2024, 4:23 PM

#

remote holly Dec 17, 2024, 6:00 PM

#

hunyuan video gguf q8 with rtx 3060

remote holly Dec 17, 2024, 6:38 PM

#

i can't wait hunyuan svd quant

turbid grotto Dec 17, 2024, 6:39 PM

#

remote holly hunyuan video gguf q8 with rtx 3060

is it faster than fp8?

remote holly Dec 17, 2024, 6:40 PM

#

turbid grotto is it faster than fp8?

i never tried fp8 but i have but i generated this in 400s

#

20 steps

#

is like generating 2 images with flux

#

2s video in 408s

turbid grotto Dec 17, 2024, 6:42 PM

#

remote holly i never tried fp8 but i have but i generated this in 400s

i don't remember but I feel like it wasn't slower for me

#

also, flux takes only ~100s for me on 3060

remote holly Dec 17, 2024, 6:43 PM

#

q8 or fp8 ?

#

i think i could do with more frames , maybe 3s

#

i have not enough memory for decode

#

i love the consistency

turbid grotto Dec 17, 2024, 6:46 PM

#

remote holly q8 or fp8 ?

fp16 for flux and fp8 for hunyan

civic trail Dec 17, 2024, 7:03 PM

#

errant dust Dec 17, 2024, 7:30 PM

#

lavish sparrow

WHat was the prompt? Was this by LLM? If so which one and what was asked?

#

https://blog.google/technology/google-labs/video-image-generation-update-december-2024/

Google

State-of-the-art video and image generation with Veo 2 and Imagen 3

We’re rolling out a new, state-of-the-art video model, Veo 2, and updates to Imagen 3. Plus, check out our new experiment, Whisk.

#

Imagen 3 by DeepMind is updated

lavish sparrow Dec 17, 2024, 7:50 PM

#

errant dust WHat was the prompt? Was this by LLM? If so which one and what was asked?

It's an LLM enhanced prompt. However, there's a twist to it. The prompt i did: non-euclidean dreams

#

the output:

"T5": "A surreal dreamscape where gravity and space warp in defiance of the ordinary. Jagged, floating islands with archways that lead nowhere and staircases spiraling infinitely upwards or downwards dominate the scene. Colors are vivid and unnatural, bleeding into one another, creating a mesmerizing yet disorienting effect. Figures float effortlessly, their forms merging and separating as if woven from light and shadow, embodying the fluidity of non-Euclidean space.",
"CLIPG": "non-Euclidean, surreal, floating islands, infinite stairs, vivid colors, ethereal figures",
"CLIPL": "Surreal floating landscapes with warped geometry and ethereal figures in vivid, merging colors.",
"ARTSTYLE": "Surrealism, digital art"}```

#

i noticed SD3.5L and M really like it when the clipg and clipl aren't as long. so i have a LLM output various fields ->
reasoning: to get its bearings. It also helps getting rid of stuff that should not go into a prompt
T5: obvious
clipg: shortest prompt, just keywords
clipl: short stentences, basically a summary
artstyle: i add these in front of clipg and clipl, so there is an actual art style used

#

the LLM model i use is ExaOne, i'm really impressed with it

errant dust Dec 17, 2024, 8:01 PM

#

what size? I was not terribly impressed with its text, but admit it had some curious cultural habits with its creative output

dry wave Dec 17, 2024, 8:18 PM

#

why do people always use different text for clipl and clipg ;_;

remote holly Dec 17, 2024, 8:30 PM

#

lavish sparrow the output: ```{"reasoning": "To interpret 'non-euclidean dreams,' I envision a...

i use a vlm to make a specific kind of prompts to create found footages videos with ltxv , do you think reasonning can enhance the generated prompts ?

lavish sparrow Dec 17, 2024, 8:30 PM

#

errant dust what size? I was not terribly impressed with its text, but admit it had some cur...

there was a bug in the first release, the repeat penalty made it really bad

#

if you still have that parameter in there, it's TERRIBLE for real

remote holly Dec 17, 2024, 8:31 PM

#

is possible to ask a reasonning for describe an urbex video ?

lavish sparrow Dec 17, 2024, 8:31 PM

#

i'm not sure if vlm models are really suited to reasoning

#

and exaone isn't a reasoning model per se -> it's smart enough to do it when instructed, which helps

remote holly Dec 17, 2024, 8:32 PM

#

i mean use vlm output as an input for reasonning model

buoyant hedge Dec 17, 2024, 8:34 PM

#

@lavish sparrow that's really cool!

lavish sparrow Dec 17, 2024, 8:46 PM

#

remote holly i mean use vlm output as an input for reasonning model

as long as it's text, of course you can 🙂

remote holly Dec 17, 2024, 8:46 PM

#

yeah , i will try with marco-o1 in ollama

lavish sparrow Dec 17, 2024, 9:08 PM

#

marco o1 is surprisingly capable

#

if you want to use it in comfyui, you have to so some extra stuff

remote holly Dec 17, 2024, 9:09 PM

#

i will use for sure, can you split reasoning part and take only the final ?

lavish sparrow Dec 17, 2024, 9:09 PM

#

yes, that's the thing i've done, actually!

remote holly Dec 17, 2024, 9:09 PM

#

json format ?

lavish sparrow Dec 17, 2024, 9:10 PM

#

remote holly json format ?

#

set the format to json, and add in the system prompt what should go in what fields

pseudo owl Dec 17, 2024, 9:11 PM

#

remote holly i mean use vlm output as an input for reasonning model

probably wouldn't recommend that, just using something like qwen2vl/minicpmv2.6/pixtral should work well for describing images in a specific format.

No need for reasoning honestly like marco o1, llama 3 8b might be even better if used properly.

All format issues can be super easily solved by few-shot, just give it examples and you will get identical formats.

lavish sparrow Dec 17, 2024, 9:11 PM

#

so i let it reason in the reason tag, but i can discard it afterwards, but the intelligence of those models comes from having reasoning available to them in the first place.

pseudo owl Dec 17, 2024, 9:14 PM

#

I kinda dislike marco o1, its just a cot like finetune which even llama 3.2 1b/3b can do zero shot. The only open source model that actually seems to improve performance with reasoning is QwQ 32b(qwen) imo but thats more for coding.

remote holly Dec 17, 2024, 9:14 PM

#

pseudo owl probably wouldn't recommend that, just using something like qwen2vl/minicpmv2.6/...

is use minicpm-v2.6 and qwen 2 vl with this system prompt : You are an AI assistant expert in writing very precise and quality urbex video descriptions from images of places and spaces. When the user gives you an image representing a place or a space you write a description of a video following this template:

A urbex video filmed in the first person in [insert a short description of the place], the operator moves forward, [insert your description with details], the lighting is [insert a description on the lighting of the place], hyper realistic, fluid, masterpiece, amateur video, tilted camera

Instructions for generating video descriptions:

Do not contradict the image of the first frame.
Start with a concise description of the scene in a single paragraph, focusing on the key actions in order.
Include detailed character and object movements, focusing on specific, observable actions.
Provide essential background details to set the context, highlighting elements that enhance the mood without overloading the description.
Mention camera angles and movements that define the visual style of the scene, keeping it concise.
Specify lighting and colors to establish tone, ensuring they complement the action and setting.
Make sure the description reflects the type of source, such as live action or animation, clearly and naturally.

#

it works very great but in think i can do better results

pseudo owl Dec 17, 2024, 9:17 PM

#

remote holly it works very great but in think i can do better results

Any flaws with the normal output? The best way to solve it usually is by giving it a real example that you think is very good, try that maybe?

remote holly Dec 17, 2024, 9:18 PM

#

i get videos like this

#

i will try give an example

errant dust Dec 17, 2024, 10:27 PM

#

pseudo owl I kinda dislike marco o1, its just a cot like finetune which even llama 3.2 1b/3...

It does well on math logic puzzles too, but its excessive neurotic behavior is a bit annoying

bitter hearth Dec 17, 2024, 10:31 PM

#

they're not the same as what O1 did cos it wasn't cot

#

it was reinforcement learning apparently

limpid thunderBOT Dec 18, 2024, 1:43 AM

#

Thank you for using comcom analytics.
"comcom analytics" supports all community managers (moderators and server owners) by stats, visualization, and analytics.

If you have any questions, feel free to ask us!
Your dashboard
Help
Support server

Other languages
en: help
ja: help Japanese

granite pasture Dec 18, 2024, 1:43 AM

#

help Chinese

limpid thunderBOT Dec 18, 2024, 1:44 AM

#

Thank you for using comcom analytics.
"comcom analytics" supports all community managers (moderators and server owners) by stats, visualization, and analytics.

If you have any questions, feel free to ask us!
Your dashboard
Help
Support server

Other languages
en: help
ja: help Japanese

uneven storm Dec 18, 2024, 4:22 AM

#

remote holly hunyuan video gguf q8 with rtx 3060

very nice, i saw that the distelled versions of hunyan lost was heabily degraded but this seems reeally nice, how long does it take for a vid to gen, at what fps and for how many seconds can you get it, also which ui are you using? i do allot of image stuff but never video so curious

#

i also have 3060 so would be good info 🙂

remote holly Dec 18, 2024, 8:26 AM

#

@uneven storm i have an rtx3060 12gb. You can generate 2s(50 frames) of video with classic resolution and 20 steps , it takes me ~10mn to generate one video.

I am using hunyuan-q8 with the new native nodes in comfyui

uneven storm Dec 18, 2024, 8:31 AM

#

remote holly <@289981989480562688> i have an rtx3060 12gb. You can generate 2s(50 frames) of ...

can you send the comfy workflow pls?

remote holly Dec 18, 2024, 8:33 AM

#

https://comfyanonymous.github.io/ComfyUI_examples/hunyuan_video/

https://huggingface.co/city96/HunyuanVideo-gguf/tree/main

Update your comfyui , and your comfyu-gguf node pack

ComfyUI_examples

Hunyuan Video Model

Examples of ComfyUI workflows

city96/HunyuanVideo-gguf at main

uneven storm Dec 18, 2024, 8:33 AM

#

tyvm

unkempt compass Dec 18, 2024, 10:51 AM

#

remote holly https://comfyanonymous.github.io/ComfyUI_examples/hunyuan_video/ https://huggin...

Do you mean ComfyUI implemented a GGUF loader for Hunyuan? And what about FastHunyuan? Did you try it?

remote holly Dec 18, 2024, 10:52 AM

#

yeah the comfyui-gguf nodes have a support for hunyaun video , adn fastHunyuan works only with h100 gpu i think

unkempt compass Dec 18, 2024, 10:53 AM

#

remote holly yeah the comfyui-gguf nodes have a support for hunyaun video , adn fastHunyuan w...

But why Fast so? 🙂
Do you have a GGUF working workflow to give me?

remote holly Dec 18, 2024, 10:54 AM

#

i dont know, researchers test always they work on h100 gpu

unkempt compass Dec 18, 2024, 10:55 AM

#

And about a working workflow for GGUF Hunyuan?

remote holly Dec 18, 2024, 10:56 AM

#

just use comfyanonymous huyuan workflow and replace model loader to unet gguf loader

#

this one

unkempt compass Dec 18, 2024, 10:57 AM

#

remote holly this one

Ok 🙂

muted dove Dec 18, 2024, 3:35 PM

#

#

#

remote holly Dec 18, 2024, 8:30 PM

#

In a surreal, neon-lit dreamscape, a young woman stands surrounded by thick, swirling clouds of electric blue mist. Her sharp silhouette contrasts with the glowing haze, creating a sense of otherworldly wonder. She wears an oversized black coat adorned with glowing neon stars, which pulse with an ethereal, blue-white light, casting faint, shimmering reflections on her surroundings. The stars appear to hover slightly above the coat’s fabric, as if suspended by magic, radiating a celestial glow. Her face is illuminated softly by the ambient light, and her expression is calm yet enigmatic, as she gazes directly into the distance. The atmosphere is immersive and futuristic, as the dynamic fog continues to drift and curl through the scene, giving the impression of a cosmic, dreamlike environment. The contrast of the deep shadows and vibrant neon hues evokes a moody, cyberpunk-inspired aesthetic.

civic trail Dec 18, 2024, 9:56 PM

#

civic trail Dec 18, 2024, 10:12 PM

#

turbid grotto Dec 18, 2024, 10:16 PM

#

anyone tried this? Looks cool
https://github.com/lehduong/OneDiffusion?tab=readme-ov-file

GitHub

GitHub - lehduong/OneDiffusion: Official implementation of OneDiffu...

Official implementation of OneDiffusion paper. Contribute to lehduong/OneDiffusion development by creating an account on GitHub.

civic trail Dec 18, 2024, 10:25 PM

#

remote holly `In a surreal, neon-lit dreamscape, a young woman stands surrounded by thick, sw...

craggy crest Dec 18, 2024, 11:25 PM

#

bitter hearth Dec 19, 2024, 12:41 AM

#

turbid grotto anyone tried this? Looks cool https://github.com/lehduong/OneDiffusion?tab=readm...

its good yeah

#

lots of conditioning that is like control net or pullid

#

and they included seg control net which is one of the better but rarer ones

craggy crest Dec 19, 2024, 3:44 AM

#

#

#

nocturne flint Dec 19, 2024, 4:42 AM

#

生成带雪山、情侣、结婚证的元素的漫画风图片

remote holly Dec 19, 2024, 7:28 AM

#

civic trail

Flux ?

night oar Dec 19, 2024, 8:05 AM

#

Hello

civic trail Dec 19, 2024, 8:33 AM

#

remote holly Flux ?

Yes. Demoncore

remote holly Dec 19, 2024, 8:34 AM

#

i have the prompt for image version , i will send you

#

i dont find the prompt sorry

#

but is used these keywords :


a pretty woman with dark black hair , standing with a stylish pose in front of a red background with snow on the ground and smoke ,

she wears a large blue wool coat with long blue wool pants and a large blue hat ,

reflections , detailed shadows , bright and dynamic light ,

8k, detailed, masterpiece```

lavish sparrow Dec 19, 2024, 9:04 AM

#

#

remote holly Dec 19, 2024, 9:33 AM

#

hyper realistic cinematic photo, artistic photo, artistic fashion style, long exposure, bokeh effect, cinematic view, fashion, close angle, bright colors, professional photo, a pretty woman with dark black bob hair, floating in the air in a photo room with a red background, she is wearing a large orange wool coat with long orange wool pants and a large orange hat, reflections, detailed shadows, bright and dynamic light, 8k, detailed, masterpiece

#

with sd3.5l

lavish sparrow Dec 19, 2024, 10:26 AM

#

??? XD

bitter hearth Dec 19, 2024, 1:00 PM

#

remote holly ``` hyper realistic cinematic photo, artistic photo, artistic fashion style, lon...

3.5L colours are so nice

remote holly Dec 19, 2024, 1:14 PM

#

bitter hearth 3.5L colours are so nice

yes , this is why i love this model

#

bitter hearth Dec 19, 2024, 1:18 PM

#

you can send your Flux, Cascade, SD 1.5 or SDXL latents directly to SD 3.5 with this https://github.com/city96/SD-Latent-Interposer

#

particularly for SD 1.5 it lets you avoid using the small VAE

errant dust Dec 19, 2024, 2:02 PM

#

Midjourney is having a special for this month, with UNLIMITED IMAGES using their smallest subscription. They said the reason is they have too many servers that are running idle, so made this announcement. They literally state that (IMHO it means the competition from free purveyors like Flux, and the sheer strength of said models, is the cause)

#

They call it the Relaxathon, meaning set it from Fast image generation to Relaxed, but due to huge server availability there is little to no wait time using this theoretically slower mode

#

Again, this is in their very own announcement

bitter hearth Dec 19, 2024, 2:05 PM

#

its quite slow in relaxed mode as far as I know

errant dust Dec 19, 2024, 2:07 PM

#

So you are trying it in this Relaxathon and new conditions? Ok, good to know

bitter hearth Dec 19, 2024, 2:09 PM

#

no I don't use closed source personally
its just what I have heard

errant dust Dec 19, 2024, 2:09 PM

#

But you heard this now, when they declared that their servers are currently running idle

#

Still good to know

#

This is the announcement for those interested:

#

Hey. we got a ton of servers for our models next year, and they're currently sad and idle. So for the remainder of the year we're doing a holiday RELAX-ATHON. Relax mode is now available to all members (including basic) with near-zero wait time.

More specifically this applies to
Standard jobs, variations, sref, permutation jobs

It does not apply to:
Upscales, weird jobs, editor, outpainting, retexture

We've also increased the queue sizes and maximum simultaneous jobs
You can now do up to 12 simultaneous relax mode jobs
You can now have 24 jobs in your waiting queue
(these numbers may change based on server availability)

This is a pretty huge amount of free computing power and we hope you can use it over the holidays to have a lot of fun and take some crazy aesthetic journeys.

Happy holidays! ❤️

#

Sounds also llike MJ7 is on the way

#

I have not tested even the new release of Imagen 3 frankly, which is free and came out yesterday.

#

I still may bite for fun.

rapid pivot Dec 19, 2024, 2:15 PM

#

errant dust I have not tested even the new release of Imagen 3 frankly, which is free and ca...

Can you use it online?

#

errant dust Dec 19, 2024, 2:17 PM

#

rapid pivot Can you use it online?

You can only use it online. It is DeepMind/Google's image service. But free.

#

They added different ratios, so now it is not locked into 1:1 images

bitter hearth Dec 19, 2024, 2:18 PM

#

errant dust But you heard this now, when they declared that their servers are currently runn...

its just marketing to get people to subscribe really

errant dust Dec 19, 2024, 2:18 PM

#

bitter hearth its just marketing to get people to subscribe really

SUre, but unlimited images for the cheapest plan is a good deal

rapid pivot Dec 19, 2024, 2:18 PM

#

errant dust You can only use it online. It is DeepMind/Google's image service. But free.

I meant more like is it on a website or do I have to download an app

#

waow

errant dust Dec 19, 2024, 2:18 PM

#

oh... no, just website

rapid pivot Dec 19, 2024, 2:19 PM

#

rapid pivot Dec 19, 2024, 2:22 PM

#

errant dust oh... no, just website

They still don't let you generating with people

#

I can't generate Santa using a jetpack, 0/10 sadcat

#

glif-stable-diffusion-3-5-large-insane-iw5qs5fxukt60rhkaumj2e2z.jpg

errant dust Dec 19, 2024, 2:23 PM

#

Yeah? I never tried. Let me see what happens

rapid pivot Dec 19, 2024, 2:24 PM

#

Yeah they tried to "diversify" things on the older models and got backlash for it

errant dust Dec 19, 2024, 2:25 PM

#

Worked for me

#

#

rapid pivot Dec 19, 2024, 2:26 PM

#

errant dust

Hmm I was trying on Gemini

#

Maybe only there

errant dust Dec 19, 2024, 2:26 PM

#

Well, I said Imagen, not Gemini, so cannot comment

#

https://labs.google/fx/tools/image-fx

rapid pivot Dec 19, 2024, 2:27 PM

#

rapid pivot Dec 19, 2024, 2:27 PM

#

errant dust Well, I said Imagen, not Gemini, so cannot comment

Gemini is their text bot, it can generate stuff

errant dust Dec 19, 2024, 2:27 PM

#

As a rule I always get rid of the middleman

#

🙂

rapid pivot Dec 19, 2024, 2:28 PM

#

But because of that it might be getting blocked like the old models

errant dust Dec 19, 2024, 2:28 PM

#

Like Copilot and Dalle3.

rapid pivot Dec 19, 2024, 2:28 PM

#

errant dust As a rule I always get rid of the middleman

It's whatever, closed source AI will never be as good

rapid pivot Dec 19, 2024, 2:29 PM

#

errant dust https://labs.google/fx/tools/image-fx

Not available in my country, rip

errant dust Dec 19, 2024, 2:29 PM

#

That's not a very grounded opinion IMHO. Preference won't make it true.

#

Oh, and nor in my country. I use a VPN

#

I mean, all models, open source or closed source are made by companies big and small. There is no magic sauce in creating a model that makes the open source inherently better. You can create tools and LoRAs and so on, but the base model? The sheer cmpute alone is outside the geenral community's reach for now.

#

This is in contrast to open-source software projects directly developed by the community

rapid pivot Dec 19, 2024, 2:33 PM

#

errant dust Oh, and nor in my country. I use a VPN

Wait it's available in Brazil

errant dust Dec 19, 2024, 2:33 PM

#

Imagen 3?

rapid pivot Dec 19, 2024, 2:33 PM

#

I just updated the page and worked

#

Yeah

#

I got an image from it

errant dust Dec 19, 2024, 2:33 PM

#

Huh. It wasn't a couple of months ago

#

I got messages about availability soon blablabla

rapid pivot Dec 19, 2024, 2:34 PM

#

I went to the faq and saw Brazil there and was like wtf

#

Went back and it worked waow

#

The UI in that site is pure suffering on mobile

#

sadcat

#

@lavish sparrow

errant dust Dec 19, 2024, 2:45 PM

#

Solid artwork though

pseudo owl Dec 19, 2024, 3:00 PM

#

Hunyuan can gen 20sec vids now with ctx window and not using extra vram. Will use extra time obviously though.

bitter hearth Dec 19, 2024, 3:48 PM

#

rapid pivot Yeah they tried to "diversify" things on the older models and got backlash for i...

oh I remember this yeah

#

was in the early Gemini days

remote holly Dec 19, 2024, 4:20 PM

#

bitter hearth you can send your Flux, Cascade, SD 1.5 or SDXL latents directly to SD 3.5 with ...

You mean the latent image ?

remote holly Dec 19, 2024, 4:21 PM

#

pseudo owl Hunyuan can gen 20sec vids now with ctx window and not using extra vram. Will us...

Amazing

bitter hearth Dec 19, 2024, 4:50 PM

#

remote holly You mean the latent image ?

yeah the latent image

#

pink spaghetti noodle in ComfyUI

fleet meteor Dec 19, 2024, 5:49 PM

#

pseudo owl Hunyuan can gen 20sec vids now with ctx window and not using extra vram. Will us...

Yooo that´s some good quality

fleet meteor Dec 19, 2024, 5:52 PM

#

errant dust I have not tested even the new release of Imagen 3 frankly, which is free and ca...

I just tested it... its looking good

#

Too bad its closed source but at least its free (no, im not gonna subscribe or upgrade to it XD)

bitter hearth Dec 19, 2024, 6:03 PM

#

looks nice

fleet meteor Dec 19, 2024, 6:14 PM

#

🤣

pseudo owl Dec 19, 2024, 6:23 PM

#

Flux Schnell 1step, 1024x512

remote holly Dec 19, 2024, 6:23 PM

#

bitter hearth yeah the latent image

Using different latents change the output image ?

fleet meteor Dec 19, 2024, 6:25 PM

#

pseudo owl Flux Schnell 1step, 1024x512

Nice

bitter hearth Dec 19, 2024, 6:34 PM

#

remote holly Using different latents change the output image ?

yeah the VAE makes a different image

#

you use the VAE of the final model

remote holly Dec 19, 2024, 6:44 PM

#

bitter hearth you use the VAE of the final model

thats realy cool , i will try theses nodes

#

new ltx video version 0.9.1 !!! https://github.com/Lightricks/ComfyUI-LTXVideo

GitHub

GitHub - Lightricks/ComfyUI-LTXVideo: LTX-Video Support for ComfyUI

LTX-Video Support for ComfyUI. Contribute to Lightricks/ComfyUI-LTXVideo development by creating an account on GitHub.

#

new vae , built in stg and image compression !

bitter hearth Dec 19, 2024, 6:55 PM

#

wow nice

#

LTX is good

remote holly Dec 19, 2024, 7:04 PM

#

yeah

#

i love they listen the community and add the image compression and built in stg

#

i am looking forward controlnet and lora support

#

ltxv with controlnets have the potential to be the new animatediff with DiT

remote holly Dec 19, 2024, 7:29 PM

#

anatomy is not good but i love colors

#

is it possible to correct the hands with inpainting ?

pseudo owl Dec 19, 2024, 7:29 PM

#

remote holly is it possible to correct the hands with inpainting ?

Worked for sdxl, never tested on sd3.5 but probably

pseudo owl Dec 19, 2024, 7:30 PM

#

remote holly new ltx video version 0.9.1 !!! https://github.com/Lightricks/ComfyUI-LTXVideo

New ltxv looks amazing so far, left is old one, right is new one

remote holly Dec 19, 2024, 7:30 PM

#

realy amazing !

#

the movement and camera are better

#

no crazy deformations with stg

pseudo owl Dec 19, 2024, 7:32 PM

#

Yeah it got a nice 5 finger hand, and it’s actually faster too. Previous ltxv already took like 15sec on a 4090 with stg. This one is 15% faster I believe

remote holly Dec 19, 2024, 7:33 PM

#

what is this new option :
Additional initial latent optional input to chain latents for high res generation.

remote holly Dec 19, 2024, 7:33 PM

#

pseudo owl Yeah it got a nice 5 finger hand, and it’s actually faster too. Previous ltxv al...

15% ? a realy good improvement

remote holly Dec 19, 2024, 8:10 PM

#

4s/it for 5s and 30 steps !

hallow lion Dec 19, 2024, 8:11 PM

#

pseudo owl New ltxv looks amazing so far, left is old one, right is new one

Even the apple is faking it

craggy crest Dec 19, 2024, 8:26 PM

#

remote holly Dec 19, 2024, 8:30 PM

#

i notice that now the model producs better camera motion

#

with 50 steps

#

amazing

lavish sparrow Dec 19, 2024, 9:23 PM

#

#

lavish sparrow Dec 19, 2024, 9:39 PM

#

#

nomnomnom

#

"Let me devour your heart"

pseudo owl Dec 19, 2024, 9:59 PM

#

New Ltxv 0.9.1

lavish sparrow Dec 19, 2024, 10:00 PM

#

short haven Dec 19, 2024, 10:15 PM

#

blood microchips

lavish sparrow Dec 19, 2024, 11:10 PM

#

remember to water your plants

dull star Dec 19, 2024, 11:20 PM

#

remote holly with 50 steps

oh wow this is really good

lavish sparrow Dec 19, 2024, 11:35 PM

#

lavish sparrow Dec 19, 2024, 11:51 PM

#

time for the little monster to sleep

lavish sparrow Dec 20, 2024, 7:47 AM

#

and morning coffee, important too

lavish sparrow Dec 20, 2024, 8:04 AM

#

upbeat girder Dec 20, 2024, 9:23 AM

#

craggy crest

how can you do this?

civic trail Dec 20, 2024, 9:44 AM

#

lavish sparrow Dec 20, 2024, 9:57 AM

#

lavish sparrow Dec 20, 2024, 10:02 AM

#

upbeat girder how can you do this?

i tried.

#

abstract rabbits xD

#

#

?

#

"i'm tired, boss"

#

#

"T5": "Abstract swirling flames dance across a canvas, interwoven with ethereal, flowing lines representing voices and emotions. Shades of deep reds and oranges contrast with cool blues and purples, symbolizing bittersweet desire and peace. Interlaced figures, barely discernible, embody connection and eternal devotion, their forms merging into a harmonious chaos that speaks of burning passion and eternal union.",
"CLIPG": "flames, ethereal lines, red-orange, blue-purple, intertwined figures, voices, eternity",
"CLIPL": "Abstract flames and flowing lines symbolize burning passion and eternal love, merging figures in a vibrant, bittersweet palette.",
"ARTSTYLE": "Abstract Expressionism"}``` the fact that sd3.5 L even gets this close, is 100% amazing...

toxic bone Dec 20, 2024, 10:54 AM

#

"The fact that..." [opinion]

lavish sparrow Dec 20, 2024, 10:58 AM

#

the fact that i'm getting images i'm getting entertained by -> i 100% do not care other people might not like it. i like it. that's all that matters to me. why would anyone's opinion matter in the first place? but i know there's people in this channel that might like it too.

lavish sparrow Dec 20, 2024, 11:18 AM

#

lavish sparrow Dec 20, 2024, 4:19 PM

#

#

lavish sparrow Dec 20, 2024, 4:49 PM

#

craggy crest Dec 20, 2024, 5:24 PM

#

upbeat girder how can you do this?

that's sd 3.5 large. It's a test for a lora i'm creating.

craggy crest Dec 20, 2024, 5:25 PM

#

lavish sparrow i tried.

too cute award!

turbid grotto Dec 20, 2024, 5:44 PM

#

bitter hearth its good yeah

and of course no comfy wrapper sadcat

bitter hearth Dec 20, 2024, 5:46 PM

#

I saw a comment once saying
instead of waiting for Comfy wrapper, just go to the tool where its at
and I've started doing that lol

turbid grotto Dec 20, 2024, 5:47 PM

#

bitter hearth I saw a comment once saying instead of waiting for Comfy wrapper, just go to the...

Sadly, I have a skill issue

#

Yea, I should learn

#

but later agony

#

What plans do Stability have? Will sd3.5 be the last img model?

bitter hearth Dec 20, 2024, 5:50 PM

#

its ok everyone has skill issue
not sure what's gonna happen with Stability

#

feels like video is the focus of everywhere for a while

finite osprey Dec 20, 2024, 5:54 PM

#

I wish sd3.5 would hit 2mpx natively : (

pseudo owl Dec 20, 2024, 5:55 PM

#

Yeah when will stability cook a sota video model, same with Black Forest labs and they’ve been promising it for a very very long time.

bitter hearth Dec 20, 2024, 5:55 PM

#

they took a lot of VC money so the funding is there

pseudo owl Dec 20, 2024, 5:56 PM

#

finite osprey I wish sd3.5 would hit 2mpx natively : (

Even flux usually gets artifacts around that res, shuttle 3 is probably the best at native 2k gen, not 100% sure. You can always just do a 2nd pass.

finite osprey Dec 20, 2024, 5:58 PM

#

2nd passes still gives artifacts for me, but probably skill issues. Unless you talk about upscaling which I dont like

bitter hearth Dec 20, 2024, 6:00 PM

#

pseudo owl Even flux usually gets artifacts around that res, shuttle 3 is probably the best...

demoncore is good

#

they trained it on flux de-distilled

#

it can go to 2560x2560 even

#

I agree shuttle is probably the best model out there currently, for quality per FLOP

finite osprey Dec 20, 2024, 6:02 PM

#

do they run on a 3070 ?

bitter hearth Dec 20, 2024, 6:03 PM

#

on linux yeah

#

shuttle-3-diffusion-Q4_K_S.gguf - 6.88 GB will fit

finite osprey Dec 20, 2024, 6:04 PM

#

will try

bitter hearth Dec 20, 2024, 6:05 PM

#

SVDquant would be the best thing for you
its currently tricky to use
they are gonna make it easier apparently

finite osprey Dec 20, 2024, 6:09 PM

#

#soon

#

#

Thanks @bitter hearth goodjob

bitter hearth Dec 20, 2024, 6:24 PM

#

I recognise the Shuttle look lol

finite osprey Dec 20, 2024, 6:24 PM

#

The fact shuttle is written on his arm is a giveaway

bitter hearth Dec 20, 2024, 6:24 PM

#

lol didnt even see

lavish sparrow Dec 20, 2024, 6:25 PM

#

bitter hearth Dec 20, 2024, 6:26 PM

#

Shuttle beats dev quite regularly

#

it can't do every style/subject but when it works its great

finite osprey Dec 20, 2024, 6:27 PM

#

what are its weaknesses?

bitter hearth Dec 20, 2024, 6:28 PM

#

it goes painterly sometimes

#

I always want photographic

#

also it is not a soft model by any means

#

which is both a pro and a con

lavish sparrow Dec 20, 2024, 6:38 PM

#

bitter hearth Dec 20, 2024, 6:40 PM

#

very nice green

finite osprey Dec 20, 2024, 6:51 PM

#

fleet meteor Dec 20, 2024, 6:51 PM

#

SD3.5

finite osprey Dec 20, 2024, 7:07 PM

#

stiff cypress Dec 20, 2024, 7:20 PM

#

So I have SD forge....so how to can use sd3?

elder urchin Dec 20, 2024, 8:02 PM

#

Can we train loras on SD3/SD3.5?

#

Also, is it worth it go from SD2 to 3/3.5 for anime?

civic trail Dec 20, 2024, 9:28 PM

#

pseudo owl Dec 20, 2024, 9:52 PM

#

elder urchin Also, is it worth it go from SD2 to 3/3.5 for anime?

Yes 3.5 should be much better then sd2, but maybe not anime specific sdxl finetunes like animagine.

craggy crest Dec 20, 2024, 11:05 PM

#

pseudo owl Yes 3.5 should be much better then sd2, but maybe not anime specific sdxl finetu...

SD 3.5 large. prompt: anime girl yelling at an anime boy - no loras, no finetuned check points, just the base model. SD 3.5 L and SD 3.5 M are fantastic models that people refuse to use. Large doesn't NEED loras - but people refuse to even try to use it without them

elder urchin Dec 20, 2024, 11:07 PM

#

I want loras for specific characters

#

I doubt the base model can run all of the characters I want on good quality

craggy crest Dec 20, 2024, 11:34 PM

#

elder urchin I doubt the base model can run all of the characters I want on good quality

you might be surprised - but that's a very specific thing. in general, however, for some odd reason the mindset in this community seems to be 'can't generate without lora' - like a lora is a magic wand that'll give them wings, or an invulnerability potion, or something. and if they don't have a lora, they won't even try prompting the base model.

#

craggy crest Dec 20, 2024, 11:54 PM

#

frail shoal Dec 21, 2024, 1:06 AM

#

frail shoal Dec 21, 2024, 1:08 AM

#

craggy crest SD 3.5 large. prompt: anime girl yelling at an anime boy - no loras, no finetu...

using only sd3.5 medium as second stage for pixart sigma. Sadly sd3.5 large is too big for my pc

#

and sd3.5m does shitty composition on it's own

craggy crest Dec 21, 2024, 2:01 AM

#

frail shoal using only sd3.5 medium as second stage for pixart sigma. Sadly sd3.5 large is t...

the new nvidia gpus should be coming out after the first of the year, and that should drop the price on everything else. maybe you can upgrade soon?

#

have you tried the GGUFs that were put out, out of curiosity?

frail shoal Dec 21, 2024, 2:03 AM

#

craggy crest have you tried the GGUFs that were put out, out of curiosity?

yes i did, but is too slow, also it does not have the quality of sd3.5m. The images above were made with pixart sigma 15 steps + sd3.5m 35-50 steps in a step-swap workflow.

frail shoal Dec 21, 2024, 2:04 AM

#

craggy crest the new nvidia gpus should be coming out after the first of the year, and that s...

will upgrade soon, but we will see if nvidia will give us more vram this time

craggy crest Dec 21, 2024, 2:04 AM

#

frail shoal will upgrade soon, but we will see if nvidia will give us more vram this time

you can always buy additional vram

frail shoal Dec 21, 2024, 2:05 AM

#

craggy crest you can always buy additional vram

no i can't, a gpu comes with a fixed vram

craggy crest Dec 21, 2024, 2:05 AM

#

well, we'll wait and see what they release

sacred jewel Dec 21, 2024, 2:42 AM

#

craggy crest you can always buy additional vram

https://tenor.com/view/bullish-crypto-hype-i-want-more-lets-do-this-gif-1852269844701917269

Tenor

craggy crest Dec 21, 2024, 3:35 AM

#

SD 3.5 large

summer ginkgo Dec 21, 2024, 5:58 AM

#

Whitfield Diffie (SD3.5 Large Turbo).. turned him into father time... a bit 🥲😂 hf space

fervent dragon Dec 21, 2024, 7:23 AM

#

waow

sweet quartz Dec 21, 2024, 12:17 PM

#

move

alfredzhao_83531_a_beautiful_girl_stand_near_a_car_wear_a_cat_0fe1b756-2aa8-4faf-916e-6b0998fd156a.png

rapid pivot Dec 21, 2024, 12:49 PM

#

Don't move

#

sage burrow Dec 21, 2024, 6:40 PM

#

craggy crest Dec 21, 2024, 9:42 PM

#

SD 3.5 large

#

#

#

hallow lion Dec 21, 2024, 11:20 PM

#

Rathorse. A sing of our times.

sage burrow Dec 21, 2024, 11:27 PM

#

frail shoal Dec 21, 2024, 11:37 PM

#

sd3.5 large turbo + medium

#

anyone finetuning sd3.5 medium ?

#

also have people started using more sd3.5 now, or most of them are still on flux

bitter hearth Dec 21, 2024, 11:52 PM

#

frail shoal also have people started using more sd3.5 now, or most of them are still on flux

check Civit numbers

#

I don't wanna check myself cos Civit's site lags everything up

frail shoal Dec 21, 2024, 11:56 PM

#

jsut saying that flux loras had exploded when it came out, and it seems not that much dor sd3.5, there is almost nothing

bitter hearth Dec 21, 2024, 11:58 PM

#

oh I thought you were asking a question rather than making a rhetorical point

sage burrow Dec 22, 2024, 12:16 AM

#

frail shoal also have people started using more sd3.5 now, or most of them are still on flux

I like 3.5 better

sage burrow Dec 22, 2024, 12:17 AM

#

frail shoal jsut saying that flux loras had exploded when it came out, and it seems not that...

I have no idea why, sd3 loras are far far easier to create, especially when doing uhm, civitai themes

#

It's true though, barely any 3.5 loras in comparison

frail shoal Dec 22, 2024, 12:18 AM

#

sage burrow I like 3.5 better

i do like it better for the creativity, but flux has better image coherency.

frail shoal Dec 22, 2024, 12:19 AM

#

sage burrow I have no idea why, sd3 loras are far far easier to create, especially when doin...

it's hard to switch when you are already happy with flux ?

sage burrow Dec 22, 2024, 12:19 AM

#

There's some stuff that sd3 can do that flux cannot

frail shoal Dec 22, 2024, 12:20 AM

#

seeing the ton of loras for flux, i was tempted to switch again to it lol

frail shoal Dec 22, 2024, 12:20 AM

#

sage burrow There's some stuff that sd3 can do that flux cannot

like ?

sage burrow Dec 22, 2024, 12:21 AM

#

frail shoal like ?

Anything nsfw, but also melting clocks was extremely difficult to create loras for with flux!

frail shoal Dec 22, 2024, 12:22 AM

#

flux has something special though, like symmetric details. idk how they achieved that

sage burrow Dec 22, 2024, 12:25 AM

#

Fortunately they are both free, and run locally 😄

bitter hearth Dec 22, 2024, 12:25 AM

#

frail shoal flux has something special though, like symmetric details. idk how they achieved...

probably just the size

#

most 20B DiTs have good structure

frail shoal Dec 22, 2024, 12:26 AM

#

sage burrow Fortunately they are both free, and run locally 😄

yeah, im thinking to try flux dev stage 1 + sd3.5 medium stage 2, to remove that plastic feel from flux gens

sage burrow Dec 22, 2024, 12:26 AM

#

Anyone made a Hunyuan Video lora yet? 🙂

craggy crest Dec 22, 2024, 12:26 AM

#

frail shoal jsut saying that flux loras had exploded when it came out, and it seems not that...

flux is no longer a shiny new toy

pseudo owl Dec 22, 2024, 12:27 AM

#

sage burrow Anyone made a Hunyuan Video lora yet? 🙂

theres several now actually I believe

craggy crest Dec 22, 2024, 12:27 AM

#

sage burrow Anyone made a Hunyuan Video lora yet? 🙂

i think i saw one mentioned a couple days ago

frail shoal Dec 22, 2024, 12:27 AM

#

craggy crest flux is no longer a shiny new toy

yeah but still, i see almost no lora or finetunes for sd3.5

sage burrow Dec 22, 2024, 12:27 AM

#

frail shoal yeah but still, i see almost no lora or finetunes for sd3.5

I'm getting around to it lol

craggy crest Dec 22, 2024, 12:27 AM

#

frail shoal yeah but still, i see almost no lora or finetunes for sd3.5

i'm sitting here cranking out sd 3.5 large loras, about one every two days. there are a LOT of them out there, both on hugging face and on civit

#

here

#

https://huggingface.co/collections/crystalwizard/my-loras-675aff7119f93a63c64ecbd8

My Loras - a crystalwizard Collection

#

start there - those are all style loras

pseudo owl Dec 22, 2024, 12:28 AM

#

The cool thing with hunyuan is since its also like a text-to-image model, you can train it on images too and it can actually gen videos of that character/style. Image training is a lot cheaper and faster.

sage burrow Dec 22, 2024, 12:28 AM

#

<-- got distracted by Hunyuan Video lol

craggy crest Dec 22, 2024, 12:28 AM

#

sage burrow <-- got distracted by Hunyuan Video lol

mage just put hunyuan in, free and unlmited, for all accounts ...

#

just sayin'

sage burrow Dec 22, 2024, 12:29 AM

#

pseudo owl The cool thing with hunyuan is since its also like a text-to-image model, you ca...

I can't wait until that aspect of it hits mage! (My computer is too slow)

frail shoal Dec 22, 2024, 12:29 AM

#

craggy crest i'm sitting here cranking out sd 3.5 large loras, about one every two days. ther...

mhhm, only sd3.5 large. Is there any chance of having a really good sd3.5M in the future, or it is still unusable as a base model ?

sage burrow Dec 22, 2024, 12:29 AM

#

craggy crest mage just put hunyuan in, free and unlmited, for all accounts ...

I noticed! 😄

craggy crest Dec 22, 2024, 12:30 AM

#

frail shoal mhhm, only sd3.5 large. Is there any chance of having a really good sd3.5M in th...

3.5 m is not unusuable as a base model, but i can be talked into creating loras for it if you want

bitter hearth Dec 22, 2024, 12:30 AM

#

there is 3.5m turbo now also

craggy crest Dec 22, 2024, 12:30 AM

#

but i'm making style loras. what sort of lora are you looking for?

bitter hearth Dec 22, 2024, 12:31 AM

#

https://old.reddit.com/r/StableDiffusion/comments/1hieril/introducing_tensorarts_stable_diffusion_35_medium/

frail shoal Dec 22, 2024, 12:31 AM

#

craggy crest 3.5 m is not unusuable as a base model, but i can be talked into creating loras ...

i have a lot of fantasy dnd like images, handpicked. If you can train a lora on them

craggy crest Dec 22, 2024, 12:31 AM

#

frail shoal i have a lot of fantasy dnd like images, handpicked. If you can train a lora on ...

sure - how many do you have?

pseudo owl Dec 22, 2024, 12:31 AM

#

craggy crest mage just put hunyuan in, free and unlmited, for all accounts ...

mage video is hunyuan?

craggy crest Dec 22, 2024, 12:31 AM

#

pseudo owl mage video is hunyuan?

yes

#

they just released it yesterday

#

so it's in beta still

#

https://www.mage.space/video

Mage | Create Videos with AI

Generate amazing videos using artificial intelligence. Create, customize and share AI-generated videos instantly.

pseudo owl Dec 22, 2024, 12:32 AM

#

nice, there was no free demo of hunyuan before and this one is unlimited😄

frail shoal Dec 22, 2024, 12:32 AM

#

craggy crest sure - how many do you have?

around 2000, but they are all very high quality.

craggy crest Dec 22, 2024, 12:32 AM

#

it's low res, keep that in mind. they will put in high res, but not till it's out of beta and that probably won't be free to all accounts

craggy crest Dec 22, 2024, 12:33 AM

#

frail shoal around 2000, but they are all very high quality.

i can't use more than 1000 for a lora, you're wanting a fine tuned check point. that's not me, that's @sage burrow

frail shoal Dec 22, 2024, 12:33 AM

#

craggy crest i can't use more than 1000 for a lora, you're wanting a fine tuned check point. ...

maybe i can choose 1000 then lol

#

i would need some time to sort them

craggy crest Dec 22, 2024, 12:34 AM

#

frail shoal maybe i can choose 1000 then lol

i'll be happy to train a lora for you if you're willing to do that, sure

sage burrow Dec 22, 2024, 12:34 AM

#

frail shoal around 2000, but they are all very high quality.

Civitai.com is really easy to create loras with, up to 1000 images. $2 approx each 🙂

frail shoal Dec 22, 2024, 12:34 AM

#

craggy crest i'll be happy to train a lora for you if you're willing to do that, sure

great

frail shoal Dec 22, 2024, 12:34 AM

#

sage burrow Civitai.com is really easy to create loras with, up to 1000 images. $2 approx ea...

yeah but i don't know the settings, i would need to pay a lot until i get something decent

#

i never trained before

sage burrow Dec 22, 2024, 12:35 AM

#

craggy crest i can't use more than 1000 for a lora, you're wanting a fine tuned check point. ...

Unfortunately I am only able to do loras so far.
(Though sometimes I'll create a huge lora, then pseudo-merge it with a checkpoint)

craggy crest Dec 22, 2024, 12:35 AM

#

sage burrow Unfortunately I am only able to do loras so far. (Though sometimes I'll create a...

which would probably work for this

frail shoal Dec 22, 2024, 12:36 AM

#

craggy crest i'll be happy to train a lora for you if you're willing to do that, sure

sent you a friend request, will be back to you once i have the 1000 sorted

craggy crest Dec 22, 2024, 12:36 AM

#

why don't you see if you can crete what @frail shoal is after. and i'll do a lora from a smaller dataset if it doesn't work

pseudo owl Dec 22, 2024, 12:36 AM

#

craggy crest it's low res, keep that in mind. they will put in high res, but not till it's ou...

yeah not too bad, speed is actually decent and the cinema aspect ratio res is 672x368 seems like.

craggy crest Dec 22, 2024, 12:36 AM

#

yeah. i made several videos right after it released, they worked well

frail shoal Dec 22, 2024, 12:36 AM

#

although i must warn you, there are a lot of women. not all, some dragons and a lot of fighting scenes, but a lot of women still

sage burrow Dec 22, 2024, 12:37 AM

#

frail shoal yeah but i don't know the settings, i would need to pay a lot until i get someth...

The preset ones work extremely well (that's what I use most often). Just remember to switch image size from 512 up to 1024

craggy crest Dec 22, 2024, 12:37 AM

#

pseudo owl yeah not too bad, speed is actually decent and the cinema aspect ratio res is 67...

be aware that it's very picky about aspect ratio. if you want someone walking as the subject, you need (click customize) to use 9:16 AR

#

otherwise you might get just their stomach

frail shoal Dec 22, 2024, 12:37 AM

#

sage burrow The preset ones work extremely well (that's what I use most often). Just remembe...

there is a preset about sd3.5m ?

craggy crest Dec 22, 2024, 12:37 AM

#

frail shoal there is a preset about sd3.5m ?

a preset what?

frail shoal Dec 22, 2024, 12:38 AM

#

my images are not all in 1024 though, they have different aspect ratios. would i need to crop them before ?

frail shoal Dec 22, 2024, 12:38 AM

#

craggy crest a preset what?

i thik becky meant in civit ai

craggy crest Dec 22, 2024, 12:38 AM

#

frail shoal my images are not all in 1024 though, they have different aspect ratios. would i...

i make mine all 1024 - some people don't. i just want the AI to have a dataset that's consistant

sage burrow Dec 22, 2024, 12:39 AM

#

frail shoal there is a preset about sd3.5m ?

3.5 in general there are. (Akso flux)

frail shoal Dec 22, 2024, 12:39 AM

#

craggy crest i make mine all 1024 - some people don't. i just want the AI to have a dataset t...

i try to have a decent group for each aspect ratio

sage burrow Dec 22, 2024, 12:39 AM

#

frail shoal my images are not all in 1024 though, they have different aspect ratios. would i...

I'm a lazy lora maker, I never do lol. I think you are supposed to though

craggy crest Dec 22, 2024, 12:40 AM

#

sage burrow I'm a lazy lora maker, I never do lol. I think you are supposed to though

you don't have to - there are pros and cons for each

#

if they are all the same AR, then you can have issues if you try to generate somethign that's not that AR

sage burrow Dec 22, 2024, 12:40 AM

#

So, anyone know a good free video upscaler?

craggy crest Dec 22, 2024, 12:41 AM

#

on the other hand, you might get a better training if they are all the same AR

craggy crest Dec 22, 2024, 12:41 AM

#

sage burrow So, anyone know a good free video upscaler?

capcuts

frail shoal Dec 22, 2024, 12:41 AM

#

i cropped like a 100 and inpainted some details into things that were needed. but it takes a lot of time. i also turned a guy character into a woman, because i really liked that one lol

craggy crest Dec 22, 2024, 12:41 AM

#

on their magic tools page

sage burrow Dec 22, 2024, 12:41 AM

#

craggy crest capcuts

Thank you

#

Hunyuan Video is decent at sword swinging characters btw! Better than still images last o tried

analog bolt Dec 22, 2024, 11:48 AM

#

Is there any fancy way of upscaling with SD 3.5 (I'd like to get 4k image), or do I just generate standard 1k image and then pass through SD upscaler?

analog bolt Dec 22, 2024, 1:02 PM

#

bitter hearth Dec 22, 2024, 1:04 PM

#

yeah if you can split the tiles up
then run a captioning model, control net and IP adapter on each tile, you can get much better results

analog bolt Dec 22, 2024, 1:05 PM

#

bitter hearth yeah if you can split the tiles up then run a captioning model, control net and ...

https://tenor.com/view/i-know-some-of-these-words-mhmm-clueless-words-i-know-gif-5609882

Tenor

bitter hearth Dec 22, 2024, 1:05 PM

#

LOL

#

I'll make a workflow for you later if you want

analog bolt Dec 22, 2024, 1:13 PM

#

I've used one workflow from da internets with basically default setup (just removed inpainting) and the upscale produces artifacts.
I guess that's because it's using 512 tiles, not 1024 which are default for sd 3.5l?

bitter hearth Dec 22, 2024, 1:38 PM

#

SD 3.5L like 1024 yeah

analog bolt Dec 22, 2024, 3:43 PM

#

Setting tile size to 1024 did get rid of the artifacts, but there's clear chessboard pattern for brigthness

bitter hearth Dec 22, 2024, 3:46 PM

#

was there controlnet

analog bolt Dec 22, 2024, 3:53 PM

#

workflow should be included in image

finite osprey Dec 22, 2024, 3:53 PM

#

Trying dropping the setting to around 0.10 in the upscale toolbox

analog bolt Dec 22, 2024, 3:54 PM

#

denoise?

bitter hearth Dec 22, 2024, 4:42 PM

#

when denoise is low it means it only changes the image a little bit

analog bolt Dec 22, 2024, 4:44 PM

#

that was a question to @finite osprey. And I did reduce denoise to 0.1 and grid is still there.

finite osprey Dec 22, 2024, 4:46 PM

#

Not quite sure then maybe you should post your upscaler screenshot here so that people can help more

frail shoal Dec 22, 2024, 4:46 PM

#

Are there any fancy flow samplers that work with sd3 .5 in comfyui ?

bitter hearth Dec 22, 2024, 4:50 PM

#

yeah the clown stuff all works https://github.com/ClownsharkBatwing/RES4LYF
in stock Comfy samplers, Euler ancestral and DPM++ 2SA also work

#

as far as I understand

#

the thing that needs to be done for an SDE/ancestral sampler to work with rectified flow models, is for the SDE to be variance preserving rather than variance exploding

analog bolt Dec 22, 2024, 7:01 PM

#

What's with the bot constantly pinging alextitanov and mayz? annoying

craggy crest Dec 22, 2024, 7:21 PM

#

#

#

remote holly Dec 22, 2024, 8:27 PM

#

structured prompts work with T5 xl ?

#

prompt ```
<subject>
fashion presentation, professional video
</subject>

<actor>
a woman with black bobbed hair
</actor>

<clothes>
a white shirt, a very long black navy blue velvet coat, black wide pants, a blue shark-shaped cap, blue shoes
</clothes>

<action>
the woman is standing holding her hat
</action>

<background>
a bright room, a black background with a starry sky
</background>

<camera>
the camera films the woman from above
</camera>

<extra>
dynamic shadows, light reflections, fashion, professional video, filming, cinematic view, hyper realistic, 4k, masterpiece, smooth, detailed and sharp faces
</extra>

remote holly Dec 22, 2024, 9:23 PM

#

i love the colors and aesthetics but the hands ruin everything

#

prompt :

fashion presentation, professional video
</subject>

<style>
fashion, long exposure, professional photo, vhs, hyper realistic, cinema
</style>

<actor>
a woman with black bobbed hair
</actor>

<clothes>
a large black tee-shirt dress with blue bubble patterns, black boots with blue arrow patterns, a blue velvet hat
</clothes>

<action>
the woman is standing holding her hat over opaque pink water in space
</action>

<background>
a dark space background with stars and the planet earth in the distance
</background>

<extra>
reflections, dynamic shadows, light reflections, fashion, cinematic view, hyper realistic, 4k, masterpiece, smooth, detailed and sharp faces
</extra>```

heavy oriole Dec 22, 2024, 10:49 PM

#

frail shoal around 2000, but they are all very high quality.

I would also be willing to give it a try if you can share the images. Feel free to friend request if interested.

frail shoal Dec 22, 2024, 11:47 PM

#

fervent dragon Dec 23, 2024, 8:20 AM

#

thomas

muted dove Dec 23, 2024, 9:22 AM

#

pseudo owl Hunyuan can gen 20sec vids now with ctx window and not using extra vram. Will us...

How did you get a longer video? What's "CTX window"?

bitter hearth Dec 23, 2024, 10:52 AM

#

they mean context window
but I am not sure how it was done

#

if it was just passing the last frame as a new initial frame that doesn't rly count

#

its still useful to do that but its different

turbid grotto Dec 23, 2024, 12:57 PM

#

did anyone manage to install svdquants in comfy on windows?

muted onyx Dec 23, 2024, 3:10 PM

#

Is it okay to talk about the safeguard on StableDiffusion 3.5 Medium?
I have set pipe.safety_checker = None but there is still significant safeguarding

bitter hearth Dec 23, 2024, 3:14 PM

#

muted onyx Is it okay to talk about the safeguard on StableDiffusion 3.5 Medium? I have se...

this is a diffusers thing not a StableDiffusion 3.5 Medium thing

muted onyx Dec 23, 2024, 4:22 PM

#

bitter hearth this is a diffusers thing not a StableDiffusion 3.5 Medium thing

Should I switch to another UI (great if I can get suggestions), or can I somehow switch it off from Diffusers?

pseudo owl Dec 23, 2024, 4:34 PM

#

muted dove How did you get a longer video? What's "CTX window"?

not an expert, Kijai is the one who implemented it, but basically the steps are divided into segments(If you do 40steps, 20sec video with 5sec segments, then 10steps per segment) and there is an overlap factor so the next segment "remembers" what happened before.

It should be a bit better then passing last frame as next frame as neon said and the passing last frame technique won't even really work since hunyuan has no i2v yet.

#

Makes flux linear and makes it do 8k generation 6.3x faster at that res: https://github.com/Huage001/CLEAR

GitHub

GitHub - Huage001/CLEAR: Official PyTorch implementation of paper "...

Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up". - Huage001/CLEAR

craggy crest Dec 23, 2024, 4:44 PM

#

I'm surprised no one's released in image 2 vid for hunyuan yet

pseudo owl Dec 23, 2024, 4:49 PM

#

craggy crest I'm surprised no one's released in image 2 vid for hunyuan yet

tencent said they are planning on releasing it on january, and looks like they have controlnets too but not sure when they will release that.

craggy crest Dec 23, 2024, 4:49 PM

#

that'll be fantastic if they meet that.

#

controlnet would just be an extra bonus

muted dove Dec 23, 2024, 4:51 PM

#

pseudo owl not an expert, Kijai is the one who implemented it, but basically the steps are ...

Thanks for explaining, but I still don't understand it. How can I use it in Comfy, is there a workflow/nodes to download?

pseudo owl Dec 23, 2024, 4:53 PM

#

muted dove Thanks for explaining, but I still don't understand it. How can I use it in Comf...

Kijai's wrapper has support for it, not sure if comfyui natively supports it: https://github.com/kijai/ComfyUI-HunyuanVideoWrapper

muted dove Dec 23, 2024, 4:55 PM

#

I use those nodes (comfy version just gives me blurry output), but I don't see anything that explains how to create a longer video. Increasing the frames just takes forever and crashes my PC...but that's another issue.

#

I'm using a 4090

bitter hearth Dec 23, 2024, 4:56 PM

#

muted onyx Should I switch to another UI (great if I can get suggestions), or can I somehow...

you can switch it off in diffusers

pseudo owl Dec 23, 2024, 4:57 PM

#

muted dove I use those nodes (comfy version just gives me blurry output), but I don't see a...

Should be something that says HyVideoContextOptions?

bitter hearth Dec 23, 2024, 5:02 PM

#

pseudo owl Makes flux linear and makes it do 8k generation 6.3x faster at that res: https:/...

wow thanks I missed this one

#

really important paper

#

PipeFusion is what I was looking at currently, it should stack with that

muted onyx Dec 23, 2024, 5:27 PM

#

bitter hearth you can switch it off in diffusers

May I ask how?
pipe.safety_checker = None does not seem to be sufficient

pseudo owl Dec 23, 2024, 5:31 PM

#

muted onyx May I ask how? pipe.safety_checker = None does not seem to be sufficient

wdym by safeguard? Is it giving black images or what?

muted onyx Dec 23, 2024, 5:31 PM

#

pseudo owl wdym by safeguard? Is it giving black images or what?

By safeguard I meant censor 🙈

pseudo owl Dec 23, 2024, 5:33 PM

#

muted onyx By safeguard I meant censor 🙈

I mean sd3.5 medium is not really trained for nsfw stuff if you mean that, ponyxl is a better choice then.

bitter hearth Dec 23, 2024, 5:44 PM

#

maybe try thesehttps://github.com/CompVis/stable-diffusion/issues/331#issuecomment-1562198856 https://stackoverflow.com/questions/77616823/stablediffusionpipeline-always-producing-empty-picture-probably-because-of-saf

bitter hearth Dec 23, 2024, 5:44 PM

#

pseudo owl I mean sd3.5 medium is not really trained for nsfw stuff if you mean that, ponyx...

he's getting dinged by a safety checker thing diffusers has

craggy crest Dec 23, 2024, 7:11 PM

#