#🆕｜sd3 | Stable Diffusion | Page 121

pseudo owl Nov 5, 2024, 9:50 PM

#

You can just input multiple images and they can understand them(you can ask questions or give instructions) They support video understanding and I believe oryx 1.5 and llava onevision support 3d understanding too.

I think MiniCPM v2.6 is the only one supported by ollama so far and I prefer that for vision understanding. They are around 7b-8b in size. I forgot that theres also Qwen2 vl which is pretty great(supports img, multi-img, vids long as 20min).

toxic bone Nov 5, 2024, 9:50 PM

#

was using 3.1 8b and some extension

pseudo owl Nov 5, 2024, 9:51 PM

#

Now there are even open models that can natively generate speech output and understand speech.

toxic bone Nov 5, 2024, 9:51 PM

#

pseudo owl Now there are even open models that can natively generate speech output and unde...

there's a lot of things all iterating along, and will soon converge and change the way we live.

cursive frigate Nov 5, 2024, 9:52 PM

#

pseudo owl You can just input multiple images and they can understand them(you can ask ques...

I've been using this one for a little over a month now and its been great. Was just hoping there have been some new stuff to torture my pc with.

https://ollama.com/doreilly/minicpm26_q5_k_m

doreilly/minicpm26_q5_k_m

Get up and running with large language models.

mortal mesa Nov 5, 2024, 9:52 PM

#

i dont find myself doing thing with vision models but way back when i did i used Moondream, that not even on the radar anymore? the newer stuff eclipses it?

toxic bone Nov 5, 2024, 9:53 PM

#

theres a weird thing you can do with flux an sd3. use one of these vision models to describe a long detailed prompt about something that was generated, and then give that back as a prompt and get a very similar image

#

i used florence large for that a few times

pseudo owl Nov 5, 2024, 9:54 PM

#

mortal mesa i dont find myself doing thing with vision models but way back when i did i used...

Moondream is still really fast and amazing but its hard to instruct it, it only gives captions or for vqa. You can feed that caption to a normal llm for better instruct capability but its somewhat limitated.

cursive frigate Nov 5, 2024, 9:54 PM

#

toxic bone theres a weird thing you can do with flux an sd3. use one of these vision models...

That is how I use it all the time

toxic bone Nov 5, 2024, 9:54 PM

#

cursive frigate That is how I use it all the time

it's a wild feedback loop. black box level magic

cursive frigate Nov 5, 2024, 9:56 PM

#

toxic bone it's a wild feedback loop. black box level magic

This is the workflow I use and the results are pretty good. The base of the workflow came from Aitrepreneur but I changed a lot of it to do more.

#

I made a section for just feeding the prompt LoRA keywords because AI sucks at keeping that text exactly the way it needs to be to trigger LoRAs and another section where I can tell it some basic instructions for the image and style I want and it also analyses the image and provides a prompt then I have it combine all of that to generate the image. The results have been really good.

#

If you want the workflow let me know and I will upload an image with it baked in.

toxic bone Nov 5, 2024, 10:03 PM

#

cursive frigate I made a section for just feeding the prompt LoRA keywords because AI sucks at k...

oh comfyui is not my scene right now. thanks though! i am coming back to dive into comfyui again soon. Factorio space age came out recently though so all the noodling part of my brain is , occupied currently

#

actually loaded fooocus today for a bit

cunning lintel Nov 5, 2024, 10:18 PM

#

cursive frigate Nov 5, 2024, 10:20 PM

#

toxic bone oh comfyui is not my scene right now. thanks though! i am coming back to dive in...

ComfyUI is great once you start digging in and figuring out how to modify it to do the things you want. I used focus a little bit about a year ago. It was pretty decent.

toxic bone Nov 5, 2024, 10:23 PM

#

cursive frigate ComfyUI is great once you start digging in and figuring out how to modify it to ...

i've used a lot of comfy. i'm no stranger to node graphs. factorio is just oen giant node graph of a game and that brain gear is engaged on that task. once factorio itch is scratched, i'll experiment with it again no doubt. maybe by christmas

#

i got time off then

cursive frigate Nov 5, 2024, 10:32 PM

#

toxic bone i've used a lot of comfy. i'm no stranger to node graphs. factorio is just oen g...

Looking forward to see some of your results. They always seem to be good when you put images in the chat.

bitter hearth Nov 5, 2024, 11:46 PM

#

I've had a theory for about 6 months now that if we can get good enough captioning models then we can fix tiled upscale

#

cos the reason tiled upscale gets duplicated subjects is that we are using the same prompt across all the tiles

craggy crest Nov 6, 2024, 12:51 AM

#

bitter hearth cos the reason tiled upscale gets duplicated subjects is that we are using the s...

you sure about this?

bitter hearth Nov 6, 2024, 12:59 AM

#

that's the usual explanation yeah

#

like imagine if it was just a photo of a cat on a rug

#

when generating the initial image every tile has a cat in it, cos there is only 1 tile

#

but imagine if we are at 4x upscale, there are 16 tiles, and many don't contain a cat yet the prompt says cat

mortal mesa Nov 6, 2024, 1:11 AM

#

ive prompted separate tiles, to much work for my use case which is pretty much nothing

craggy crest Nov 6, 2024, 1:26 AM

#

https://build.nvidia.com/nvidia/consistory

NVIDIA NIM

consistory model by nvidia | NVIDIA NIM

Generates consistent characters across a series of images without requiring additional training.

craggy wave Nov 6, 2024, 1:36 AM

#

Hi there ! Anyone here have a comfyUI workflow for sd 3.5 large with Lora Loader + hire fix feature ? Thanks 🙏

dusky thistle Nov 6, 2024, 1:42 AM

#

dusky thistle Nov 6, 2024, 2:32 AM

#

kind acorn Nov 6, 2024, 3:08 AM

#

Paint a picture of sunset and lone ducks flying, autumn water together in the sky

#

@dusky thistle Paint a picture of sunset and lone ducks flying, autumn water together in the sky

craggy crest Nov 6, 2024, 4:18 AM

#

kind acorn Paint a picture of sunset and lone ducks flying, autumn water together in the sk...

bitter hearth Nov 6, 2024, 4:18 AM

#

mortal mesa ive prompted separate tiles, to much work for my use case which is pretty much n...

yeah I have a workflow where I hand prompted 25 tiles it was too much effort

#

hoping vision models can do that well soon

bitter hearth Nov 6, 2024, 5:02 AM

#

new MiaoshouAI tagger version https://huggingface.co/MiaoshouAI/Florence-2-large-PromptGen-v2.0

#

it can give output for T5 and ClipL now

noble coyote Nov 6, 2024, 6:54 AM

#

President Donald Trump - again - PotUSA number 47!

craggy crest Nov 6, 2024, 7:00 AM

#

noble coyote President Donald Trump - again - PotUSA number 47!

shouldn't that be in #🌶｜off-topic

noble coyote Nov 6, 2024, 7:01 AM

#

"I think it should be in knead two kneau!" 🥳

craggy crest Nov 6, 2024, 7:02 AM

#

noble coyote "I think it should be in knead two kneau!" 🥳

i so want to make a bread pun, but i'm too tired to think

noble coyote Nov 6, 2024, 7:02 AM

#

Florence2/Flux img2img

noble coyote Nov 6, 2024, 7:04 AM

#

craggy crest i so want to make a bread pun, but i'm too tired to think

Yeah - you're in here all-hours - I've had 7 hours shuteye since our paths crossed!!!

craggy crest Nov 6, 2024, 7:04 AM

#

noble coyote Yeah - you're in here all-hours - I've had 7 hours shuteye since our paths cross...

it's just now midnight for me

noble coyote Nov 6, 2024, 7:04 AM

#

Mid-West USA

craggy crest Nov 6, 2024, 7:04 AM

#

arizona

#

no daylight savings

noble coyote Nov 6, 2024, 7:05 AM

#

Beautiful State - I love the series of Route 66 photos of Hackberry Az (by Carol M Highsmith)

#

Are your clocks set for 'winter time' yet?

craggy crest Nov 6, 2024, 7:06 AM

#

i love driving on parts of the old route 66

#

there are still burma shave signs on part of it

noble coyote Nov 6, 2024, 7:06 AM

#

Its 7 a.m. here in London UK

craggy crest Nov 6, 2024, 7:06 AM

#

good morning then :)

noble coyote Nov 6, 2024, 7:06 AM

#

Yes, Route 66 is a wellspring of Americana

craggy crest Nov 6, 2024, 7:07 AM

#

a lot of it is.

noble coyote Nov 6, 2024, 7:07 AM

#

craggy crest good morning then :)

"Good Morning Arizona!!!!!!!!!" 🥳

craggy crest Nov 6, 2024, 7:08 AM

#

noble coyote "Good Morning Arizona!!!!!!!!!" 🥳

there was a guy in here earlier trying to figure out how to generate. he never did, but he donated a very nice prompt "Paint a picture of sunset and lone ducks flying, autumn water together in the sky"

#

noble coyote Nov 6, 2024, 7:09 AM

#

I saw CSBW's version - an eagle in place of some ducks!!! 😄

craggy crest Nov 6, 2024, 7:09 AM

#

i saw that too, i think he added that on the sly

noble coyote Nov 6, 2024, 7:09 AM

#

craggy crest

Nice, very soothing

craggy crest Nov 6, 2024, 7:09 AM

#

i wrote that one down, i quite like it

noble coyote Nov 6, 2024, 7:12 AM

#

Here's a vast prompt I got from another forum ... freely posted ... if you're into B&W?

📎 Prompt_from_Discord.txt

muted dove Nov 6, 2024, 7:12 AM

#

Surprisingly nice image for such a bad prompt 😄

#

The ducks I mean

craggy crest Nov 6, 2024, 7:13 AM

#

muted dove Surprisingly nice image for such a bad prompt 😄

yeah. that was with 3.5 large, no loras. very plain vanilla

craggy crest Nov 6, 2024, 7:13 AM

#

noble coyote Here's a vast prompt I got from another forum ... freely posted ... if you're in...

yeah, but i'd be likely to render each sentence as it's own prompt

muted dove Nov 6, 2024, 7:13 AM

#

"Lone" ducks, flying "together" 🤦🏻‍♂️

craggy crest Nov 6, 2024, 7:13 AM

#

too many tokens in that

craggy crest Nov 6, 2024, 7:14 AM

#

muted dove "Lone" ducks, flying "together" 🤦🏻‍♂️

actually, there's a comma after ducks and so: "autumn water together in the sky" i really don't know how autum water can be together, but if it's in the sky, it's normally rain

noble coyote Nov 6, 2024, 7:14 AM

#

muted dove Surprisingly nice image for such a bad prompt 😄

"There are no bad prompts!"
Prompts are rescued by the alchemy of SD.
Prejudice against "short-prompts" should be disparaged 😄

craggy crest Nov 6, 2024, 7:15 AM

#

noble coyote "There are no bad prompts!" Prompts are rescued by the alchemy of SD. Prejudice ...

an apple spinning in space

#

noble coyote Nov 6, 2024, 7:16 AM

#

craggy crest too many tokens in that

It produces amazing results - I think the prompt-maker doesn't realise that only about one tenth of that text ever gets used!

noble coyote Nov 6, 2024, 7:16 AM

#

craggy crest an apple spinning in space

OK, so you have the most left-field and surreal workflow!!! 😄

#

I love the way you have the cougar "looking at that spinning apple!"

craggy crest Nov 6, 2024, 7:17 AM

#

noble coyote OK, so you have the most left-field and surreal workflow!!! 😄

rofl. i gave you a prompt, and then posted an image from the skip layers test i'm doing. they really don't relate to each other

noble coyote Nov 6, 2024, 7:18 AM

#

🥳

craggy crest Nov 6, 2024, 7:18 AM

#

told you i was tired ;)

noble coyote Nov 6, 2024, 7:18 AM

#

It brings out your sense-of-humour

craggy crest Nov 6, 2024, 7:19 AM

#

here's the first paragraph of that monster prompt you posted

noble coyote Nov 6, 2024, 7:19 AM

#

craggy crest here's the first paragraph of that monster prompt you posted

It works.

craggy crest Nov 6, 2024, 7:19 AM

#

noble coyote It brings out your sense-of-humour

that it does. i get fairly silly

noble coyote Nov 6, 2024, 7:20 AM

#

Silly is good. This room can get a tad doomy at times 😦

craggy crest Nov 6, 2024, 7:20 AM

#

second paragraph

noble coyote Nov 6, 2024, 7:20 AM

#

#

craggy crest Nov 6, 2024, 7:21 AM

#

third paragraph

noble coyote Nov 6, 2024, 7:21 AM

#

craggy crest Nov 6, 2024, 7:21 AM

#

5 prompts for the price of one ;)

#

untold valley Nov 6, 2024, 7:22 AM

#

ppl need to add token*counter to comfyui cant have over i belive 256 tokens or something like that

muted dove Nov 6, 2024, 7:22 AM

#

craggy crest third paragraph

The whole thing

craggy crest Nov 6, 2024, 7:22 AM

#

noble coyote Nov 6, 2024, 7:23 AM

#

There is a URL available (!) which parses your prompt and returns the number of tokens it contains

craggy crest Nov 6, 2024, 7:23 AM

#

muted dove The whole thing

lost the black and white somewhere. love the way it wrote the words out though

craggy crest Nov 6, 2024, 7:23 AM

#

noble coyote There is a URL available (!) which parses your prompt and returns the number of ...

this is for stable https://sd-tokenizer.rocker.boo/ BUT it is model specific, each model will tokenize differently

Stable Diffusion Tokenizer

Informs you about how your prompt/words gets turned into tokens, privately. For Stable Diffusion models, CLIP models

noble coyote Nov 6, 2024, 7:24 AM

#

Yes - the prompt-owner either has a great grasp of language - or he's sleeping with an LLM!

dusky thistle Nov 6, 2024, 7:24 AM

#

craggy crest Nov 6, 2024, 7:25 AM

#

he's sleeping with an llm - however, at least for flux and SD3.X - since the images in the dataset were captioned with CogVLM - getting an LLM to create prompts for them works perfectly

#

you just have to remember to tell it how many tokens

noble coyote Nov 6, 2024, 7:25 AM

#

#

#

craggy crest Nov 6, 2024, 7:26 AM

#

noble coyote

because everyone should have beer with their breakfast

muted dove Nov 6, 2024, 7:27 AM

#

Ferrari inspired armour design

noble coyote Nov 6, 2024, 7:27 AM

#

When I visited Hungary many moons ago - so many people breakfasted on beer!!!

noble coyote Nov 6, 2024, 7:28 AM

#

muted dove Ferrari inspired armour design

YMCA Fireman!!!

craggy crest Nov 6, 2024, 7:28 AM

#

craggy crest Nov 6, 2024, 7:28 AM

#

noble coyote When I visited Hungary many moons ago - so many people breakfasted on beer!!!

usually you put it IN the pancakes, however

noble coyote Nov 6, 2024, 7:29 AM

#

The prompt consisted of "the white witch eats pancakes and drinks beer with the red knight!"

#

A secondary prompt dropped the beer and pancakes, and substitued "does the laundry"

muted dove Nov 6, 2024, 7:30 AM

#

It's really struggling with monochrome!

craggy crest Nov 6, 2024, 7:30 AM

#

#

prompt: a group of giggling girls chasing geese

noble coyote Nov 6, 2024, 7:30 AM

#

^..^<

muted dove Nov 6, 2024, 7:31 AM

#

noble coyote Nov 6, 2024, 7:31 AM

#

#

craggy crest Nov 6, 2024, 7:31 AM

#

@-->--->---

noble coyote Nov 6, 2024, 7:32 AM

#

dusky thistle Nov 6, 2024, 7:32 AM

#

#

noble coyote Nov 6, 2024, 7:32 AM

#

Hi, you look familiar?!

craggy crest Nov 6, 2024, 7:33 AM

#

<%%%%|==========>

noble coyote Nov 6, 2024, 7:33 AM

#

Sword

craggy crest Nov 6, 2024, 7:33 AM

#

--~~~=:>[XXXXXXXXX]>

noble coyote Nov 6, 2024, 7:33 AM

#

<*))))))))))><

craggy crest Nov 6, 2024, 7:33 AM

#

:)

#

@@@@:|

#

@@@@:)

#

that works better

dusky thistle Nov 6, 2024, 7:34 AM

#

noble coyote Nov 6, 2024, 7:35 AM

#

Indigenous beefcake

craggy crest Nov 6, 2024, 7:37 AM

#

black and white portrait photo of an elderly native american in the 1800s

noble coyote Nov 6, 2024, 7:37 AM

#

untold valley Nov 6, 2024, 7:37 AM

#

Oh damn, Trump won.

noble coyote Nov 6, 2024, 7:38 AM

#

PotUSA 47

dusky thistle Nov 6, 2024, 7:39 AM

#

noble coyote Nov 6, 2024, 7:39 AM

#

muted dove Nov 6, 2024, 7:40 AM

#

What a disaster and a terrible image of the American population. They were just starting to gain some credibility back with the rest of the world. I fear it'll be a worse shit-show than last time too.

craggy crest Nov 6, 2024, 7:40 AM

#

prompt: diana ross and the supreme sandwich

dusky thistle Nov 6, 2024, 7:40 AM

#

untold valley Oh damn, Trump won.

so far beyond fucked up i don't even know what to say

noble coyote Nov 6, 2024, 7:40 AM

#

craggy crest Nov 6, 2024, 7:40 AM

#

muted dove What a disaster and a terrible image of the American population. They were just ...

can we not use this channel for politics? there's an #🌶｜off-topic channel for stuff that's not Stable diffusion

dusky thistle Nov 6, 2024, 7:40 AM

#

muted dove What a disaster and a terrible image of the American population. They were just ...

it will be much, much worse

#

noble coyote Nov 6, 2024, 7:41 AM

#

The USA is entering a period of Unstable Diffusion

muted dove Nov 6, 2024, 7:41 AM

#

Weighing up her options after finding out who the next prez will be.

dusky thistle Nov 6, 2024, 7:42 AM

#

absolute nightmare tbh

#

noble coyote Nov 6, 2024, 7:42 AM

#

The evil cabal of Farage Musk and PotUSA

#

GM all y'all

#

Welcome to a new dawn

dusky thistle Nov 6, 2024, 7:44 AM

#

noble coyote Nov 6, 2024, 7:44 AM

#

Your excellent w/f can take up to 15 minutes per image on my 8Gb VRAM PC 🥳

craggy crest Nov 6, 2024, 7:47 AM

#

noble coyote Your excellent w/f can take up to 15 minutes per image on my 8Gb VRAM PC 🥳

there's a 12 step program for that

dusky thistle Nov 6, 2024, 7:47 AM

#

#

thse take about 50 sec on mine

noble coyote Nov 6, 2024, 7:48 AM

#

Great!

craggy crest Nov 6, 2024, 7:48 AM

#

they don't take any time at all on mine.

noble coyote Nov 6, 2024, 7:48 AM

#

I wonder how my (desired) 5090 will do?

craggy crest Nov 6, 2024, 7:48 AM

#

(cause i can't run 'em)

noble coyote Nov 6, 2024, 7:48 AM

#

🙂

craggy crest Nov 6, 2024, 7:48 AM

#

waiting for him to get his node into manager

noble coyote Nov 6, 2024, 7:49 AM

#

I just updated RES4LYF

dusky thistle Nov 6, 2024, 7:50 AM

#

dusky thistle Nov 6, 2024, 7:51 AM

#

craggy crest waiting for him to get his node into manager

did you try installing from the link using the manager

#

noble coyote Nov 6, 2024, 7:52 AM

#

Is installing via Manager any worse or better than a git pull?

craggy crest Nov 6, 2024, 7:52 AM

#

dusky thistle did you try installing from the link using the manager

not yet, no. i've been slogging through one skip layer after the next for 48 hours now

craggy crest Nov 6, 2024, 7:52 AM

#

noble coyote Is installing via Manager any worse or better than a git pull?

yes

noble coyote Nov 6, 2024, 7:52 AM

#

How?

craggy crest Nov 6, 2024, 7:52 AM

#

you mean git clone

noble coyote Nov 6, 2024, 7:53 AM

#

Git pull as in update?

craggy crest Nov 6, 2024, 7:53 AM

#

but you hve to have it before you can update

noble coyote Nov 6, 2024, 7:53 AM

#

So a Manager-inuced install wins hands down over a git clone? How?

dusky thistle Nov 6, 2024, 7:54 AM

#

not sure

noble coyote Nov 6, 2024, 7:54 AM

#

OK

dusky thistle Nov 6, 2024, 7:54 AM

#

but he doesn't have pywt installed in the right spot

#

i guess manager handles that

craggy crest Nov 6, 2024, 7:54 AM

#

craggy crest Nov 6, 2024, 7:54 AM

#

noble coyote So a Manager-inuced install wins hands down over a git clone? How?

it doesn't

dusky thistle Nov 6, 2024, 7:54 AM

#

you can also just comment out the import pywt btw

noble coyote Nov 6, 2024, 7:54 AM

#

In truth, it probably is no difference

dusky thistle Nov 6, 2024, 7:54 AM

#

it just won't let you use the wavelets noise type then

#

but it's one of 17

craggy crest Nov 6, 2024, 7:54 AM

#

noble coyote In truth, it probably is no difference

there is, actually

noble coyote Nov 6, 2024, 7:55 AM

#

But what?

craggy crest Nov 6, 2024, 7:55 AM

#

sometimes manager's install works great. and othertimes it barfs and git clone is what you wind up with

noble coyote Nov 6, 2024, 7:56 AM

#

What is the essential difference between a git clone install/Manager install?

craggy crest Nov 6, 2024, 7:56 AM

#

when one doesn't work, the other usually does?

noble coyote Nov 6, 2024, 7:57 AM

#

OK, but even if both work - which one is better?

#

#

#

untold valley Nov 6, 2024, 7:58 AM

#

noble coyote What is the essential difference between a git clone install/Manager install?

you do it yourself vs a program does it for you

craggy crest Nov 6, 2024, 7:58 AM

#

noble coyote OK, but even if both work - which one is better?

neither, both, doesn't really correlate

untold valley Nov 6, 2024, 7:58 AM

#

same thing end result

craggy crest Nov 6, 2024, 7:58 AM

#

"does not compute"

dusky thistle Nov 6, 2024, 8:01 AM

#

untold valley Nov 6, 2024, 8:01 AM

#

anyone been able to recreate 1.5 skin textures with 3.5 yet? like somehow removing the "stylized" format that it outputs. its nice, eye pleasing but sometimes you want that generic non-retouched looking image

dusky thistle Nov 6, 2024, 8:01 AM

#

untold valley anyone been able to recreate 1.5 skin textures with 3.5 yet? like somehow removi...

it can do better than 1.5 skin textures

#

tbh, sdxl can too

untold valley Nov 6, 2024, 8:02 AM

#

dusky thistle it can do better than 1.5 skin textures

need like prompt or keywords, ive been struggling w it

noble coyote Nov 6, 2024, 8:02 AM

#

#

muted dove Nov 6, 2024, 8:05 AM

#

untold valley need like prompt or keywords, ive been struggling w it

Maybe it's down to the workflow and not just the prompt. You shouldn't need to prompt for skin detail.

untold valley Nov 6, 2024, 8:07 AM

#

yeah maybe its user error

dusky thistle Nov 6, 2024, 8:08 AM

#

muted dove Nov 6, 2024, 8:09 AM

#

untold valley yeah maybe its user error

The prompt I typed for this was just a face

#

This was with freckles, but the refiner went overboard with it 😄

dusky thistle Nov 6, 2024, 8:15 AM

#

#

#

a close up amateur cell phone photo of a woman smiling in her messy apartment

untold valley Nov 6, 2024, 8:19 AM

#

maybe its just that we are genning in a higher res but you can kinda see how they have some sort of plasticity,

#

not my images from *photoreal civitai page, but it just feels different, resolution may be the key. but sdxl,3,3.5 dont have this type of airbrush, high make up feeling. idk maybe im just seeing things and need sleep.

dusky thistle Nov 6, 2024, 8:23 AM

#

those do look a lil airbrushed yea

muted dove Nov 6, 2024, 8:24 AM

#

This looks perfectly normal to me

dusky thistle Nov 6, 2024, 8:24 AM

#

muted dove Nov 6, 2024, 8:24 AM

#

That's a noisy mess 🤷🏻‍♂️

dusky thistle Nov 6, 2024, 8:25 AM

#

#

that's the point

#

amateur cell phone photo should look like this

muted dove Nov 6, 2024, 8:26 AM

#

Why? It doesn't look good. Not even like a poor amateur photo.

dusky thistle Nov 6, 2024, 8:26 AM

#

disagree

muted dove Nov 6, 2024, 8:26 AM

#

It's ok as small images on Discord, but full size they're bad.

dusky thistle Nov 6, 2024, 8:27 AM

#

one shot quick generations with no refinement using sd35M

#

the faces you posted above look pretty plastic

#

it's more convincing on the low end quality side

#

most camera photos are amazingly crap

muted dove Nov 6, 2024, 8:31 AM

#

dusky thistle it's more convincing on the low end quality side

I think yours are too far in the low end, so they don't look good or realistic. I'm not being confrontational, just my own opinion 🙂

dusky thistle Nov 6, 2024, 8:31 AM

#

muted dove I think yours are too far in the low end, so they don't look good or realistic. ...

search around for phone resolution photos on google

#

without any other context

#

what i posted is better than most of em

#

they're blurry, hazy, full of artifacts

muted dove Nov 6, 2024, 8:33 AM

#

Nobody has phones that bad nowadays though 😄

untold valley Nov 6, 2024, 8:33 AM

#

ok awesome at least im not losing my mind and you kinda sorta are getting it that left one leans in the right direction, but u see the plasticity thing on the rights forehead.

dusky thistle Nov 6, 2024, 8:33 AM

#

muted dove Nobody has phones that bad nowadays though 😄

most photos look worse than that

#

and maybe theoretically the camera phones are good but they're usually not used that well

#

glare, dirty lenses, blur from the hand shaking, poor lighting, etc

#

half the time you can barely make out the structure of the iris

#

#

without the cell phone part

#

sd35M

muted dove Nov 6, 2024, 8:37 AM

#

untold valley ok awesome at least im not losing my mind and you kinda sorta are getting it tha...

Isn't that just bright light reflecting off moist skin, or do you mean the texture in that area?

untold valley Nov 6, 2024, 8:37 AM

#

muted dove Isn't that just bright light reflecting off moist skin, or do you mean the textu...

texture

untold valley Nov 6, 2024, 8:38 AM

#

dusky thistle

kinda sorta zeroing in this and the left hand image of glaxy pink tank top shirts going the right way

muted dove Nov 6, 2024, 8:38 AM

#

That could be down to a combination of using Flux as a refiner and the sampler/scheduler choice.

noble coyote Nov 6, 2024, 8:38 AM

#

muted dove Nov 6, 2024, 8:42 AM

#

Skin is textured, I think it looks acceptable in this one. 🤷🏻‍♂️

#

Anyone got a pin?

#

untold valley Nov 6, 2024, 8:46 AM

#

muted dove I think yours are too far in the low end, so they don't look good or realistic. ...

what prompt did you use for image on the left if you are pry to share? think that was the closest

muted dove Nov 6, 2024, 8:47 AM

#

untold valley what prompt did you use for image on the left if you are pry to share? think tha...

They were all the same prompt.

untold valley Nov 6, 2024, 8:48 AM

#

so just "a face"

muted dove Nov 6, 2024, 8:48 AM

#

This one was...

a young tanned __nationality__ woman with fair skin and a petite physique. She has a round face with a soft, natural makeup look, and her hair is styled in a casual, side-swept bob. She is wearing a fitted, sleeveless, pink ribbed tank top that accentuates her small to medium-sized breasts. Her attire includes thin, transparent straps that add a modern, minimalist touch. The woman is wearing round, gold-rimmed glasses that frame her face, giving her a studious appearance. Her expression is happy, with a smile, and she is looking directly at the camera with a confident stance, one arm raised to adjust her hair.
In the background, a dark street scene

dusky thistle Nov 6, 2024, 8:48 AM

#

noble coyote Nov 6, 2024, 8:49 AM

#

muted dove Nov 6, 2024, 8:49 AM

#

@untold valley I do use an LLM in my workflow, so that prompt isn't necessarily what is used for the end result.

untold valley Nov 6, 2024, 8:49 AM

#

dusky thistle

baby shark do do dooo dooo baby shark! that hats awesome.

untold valley Nov 6, 2024, 8:51 AM

#

muted dove <@563203398443204608> I do use an LLM in my workflow, so that prompt isn't neces...

thank you both for helping, i have some ideas i may experiment with. they aesthetic may yet be there. a prompt that big will not make it consistent but ill dig thru and see if there are any key tokens there

dusky thistle Nov 6, 2024, 8:52 AM

#

muted dove Nov 6, 2024, 8:52 AM

#

baby shark

dusky thistle Nov 6, 2024, 8:53 AM

#

doododoo

untold valley Nov 6, 2024, 8:55 AM

#

dusky thistle

yeah we got close! the idea is there. thanks!

noble coyote Nov 6, 2024, 8:55 AM

#

muted dove Nov 6, 2024, 8:58 AM

#

dusky thistle Nov 6, 2024, 8:58 AM

#

#

muted dove Nov 6, 2024, 9:01 AM

#

dusky thistle Nov 6, 2024, 9:03 AM

#

#

#

untold valley Nov 6, 2024, 9:08 AM

#

a lot of workflows now incorporate LLM's keep trying to see examples but its crazy how much you can tell a ai wrote it. we have come full circle, using ai to generate ai prompts to generate ai art.

dusky thistle Nov 6, 2024, 9:08 AM

#

muted dove Nov 6, 2024, 9:09 AM

#

dusky thistle Nov 6, 2024, 9:09 AM

#

untold valley Nov 6, 2024, 9:13 AM

#

@dusky thistle @muted dove instagram photo is helping facepalm

dusky thistle Nov 6, 2024, 9:15 AM

#

muted dove Nov 6, 2024, 9:15 AM

#

an amateur iphone photo of a face

untold valley Nov 6, 2024, 9:16 AM

#

awesome

#

iphone, instagram jfc just need to think like a white girl out to get her venti mocha latte from starbucks bobagirl

dusky thistle Nov 6, 2024, 9:17 AM

#

untold valley Nov 6, 2024, 9:19 AM

#

i could tell you used "sharp image" in your prompt lol

dusky thistle Nov 6, 2024, 9:20 AM

#

muted dove Nov 6, 2024, 9:20 AM

#

This is raw sd3.5 output

untold valley Nov 6, 2024, 9:22 AM

#

that's a really clean crisp gen

muted dove Nov 6, 2024, 9:23 AM

#

I don't like the neck

dusky thistle Nov 6, 2024, 9:23 AM

#

held up surprisngly well considering how close up it is... sd35 is a lil better than expected there

#

normally that just turns into nonsense

#

they're pretty coherent models for sure

untold valley Nov 6, 2024, 9:24 AM

#

regarless of anything sd3.5m is a great model

untold valley Nov 6, 2024, 9:24 AM

#

dusky thistle they're pretty coherent models for sure

yeah

muted dove Nov 6, 2024, 9:25 AM

#

I'm using SD3.5L

untold valley Nov 6, 2024, 9:26 AM

#

have you tried medium?

muted dove Nov 6, 2024, 9:26 AM

#

Yes

dusky thistle Nov 6, 2024, 9:26 AM

#

i bounce back and forth all the time

muted dove Nov 6, 2024, 9:26 AM

#

Me too, they're all just different models

dusky thistle Nov 6, 2024, 9:26 AM

#

everything recently above is medium

#

yeah def

#

no doubt the training set was different

muted dove Nov 6, 2024, 9:27 AM

#

I don't like how SD3.5 left the face blurry, but refining with Flux makes freckles look like the pox.

dusky thistle Nov 6, 2024, 9:29 AM

#

#

flux tends to make skin look plastic

#

it's better to have something be noisy and shitty looking than too clean

#

very refined sets off our BS detectors

muted dove Nov 6, 2024, 9:30 AM

#

Sometimes I'd agree, but Flux does tidy up a lot of the "mess". It fixes hands and fingers for example.

untold valley Nov 6, 2024, 9:31 AM

#

i will say based on that comparison the textures on materials like brick etc stand out, the skin texture not so much but the bricks/cement 🤌

muted dove Nov 6, 2024, 9:32 AM

#

...and the watch detail

#

...eyes, hair... 😄

untold valley Nov 6, 2024, 9:35 AM

#

kinda reminds me of like 2.0 good at "landscapes" horrible everything else

#

except better at many things

#

minus the humans

#

thing that bugs me ab 3.5 is it loves loves the 3/4 shot

#

doesnt like to zoom out

dusky thistle Nov 6, 2024, 9:37 AM

#

dusky thistle Nov 6, 2024, 9:38 AM

#

muted dove Sometimes I'd agree, but Flux does tidy up a lot of the "mess". It fixes hands a...

yeah i'm referring more to textures, details

#

jpeg artifacts are good for fooling the eye obv

#

the smudge patterns phones give too

#

#

#

bold fossil Nov 6, 2024, 9:42 AM

#

Underwater world with colorful fish, coral reefs, and sunken ship, illuminated by natural light filtering through water, in a hyper-realistic style

limpid thunderBOT Nov 6, 2024, 9:44 AM

#

Thank you for using comcom analytics.
"comcom analytics" supports all community managers (moderators and server owners) by stats, visualization, and analytics.

If you have any questions, feel free to ask us!
Your dashboard
Help
Support server

Other languages
en: help
ja: help Japanese

dusky thistle Nov 6, 2024, 9:46 AM

#

bold fossil Underwater world with colorful fish, coral reefs, and sunken ship, illuminated b...

Here is the image you requested.

muted dove Nov 6, 2024, 9:47 AM

#

Love the window/water detail in this (raw sd3.5L)

dusky thistle Nov 6, 2024, 9:48 AM

#

#

untold valley Nov 6, 2024, 9:48 AM

#

could I ask which one of these appeals/ looks the best according to wte criteria you want?

dusky thistle Nov 6, 2024, 9:49 AM

#

#

ask dinogator

#

muted dove Nov 6, 2024, 9:57 AM

#

untold valley could I ask which one of these appeals/ looks the best according to wte criteria...

I don't like snow on the face and the eyes are worst in the 1st one. So it's a difficult choice.

untold valley Nov 6, 2024, 10:00 AM

#

I can’t make up my mind either

dusky thistle Nov 6, 2024, 10:00 AM

#

#

#

#

muted dove Nov 6, 2024, 10:19 AM

#

dusky thistle

Not that sort of "pool"! 😄

dusky thistle Nov 6, 2024, 10:21 AM

#

#

#

#

ocean mural Nov 6, 2024, 1:27 PM

#

make image of an alien galactic council

noble coyote Nov 6, 2024, 1:50 PM

#

Thwaark!!!

dusky thistle Nov 6, 2024, 4:06 PM

#

fossil pagoda Nov 6, 2024, 4:24 PM

#

241106132239_Silhouette_Art_breathtaking_detailed_masterpiece_aweinspiring_radiant_magnificen__00019_.png

dusky thistle Nov 6, 2024, 4:24 PM

#

forget if i said this last night, but i got the deis samplers working!

#

it's kinda interesting - the third order multistep samplers seem to struggle a tiny bit with SD3.5, maybe because we're using CFG

#

DEIS_3M is basically what comfyui uses when you select "DEIS", i'm finding DEIS_2m is much better

craggy crest Nov 6, 2024, 4:35 PM

#

dusky thistle DEIS_3M is basically what comfyui uses when you select "DEIS", i'm finding DEIS_...

how about releaseing a clownsampler node that's just your versions of the samplers and schedulers that can replace the ksampler node

dusky thistle Nov 6, 2024, 4:36 PM

#

craggy crest how about releaseing a clownsampler node that's just your versions of the sample...

def a good idea

#

i was thinking of having a ksampler select and a "klownsharksampler" (who knows what i'd call it, but yeah)

#

equivalents to that

#

gotta work out some good presets first... generated like 9k sd35 images now to that end

#

but yea def agree it would be good to have a simple interface for easy entry

#

#

#

#

SD3.5 Medium

#

#

#

#

living rooms are something diffusion models really struggled with before

craggy crest Nov 6, 2024, 4:44 PM

#

dusky thistle

Kitty!

dusky thistle Nov 6, 2024, 4:44 PM

#

it's great with animals

old walrus Nov 6, 2024, 5:00 PM

#

medieval empress sitting on her throne, dark fantasy, digital art illustration

bitter hearth Nov 6, 2024, 5:22 PM

#

oh no

#

I opened the workflow on the kitty image

noble coyote Nov 6, 2024, 5:28 PM

#

craggy crest Nov 6, 2024, 5:29 PM

#

when your cat has an affair with a rat

craggy crest Nov 6, 2024, 5:29 PM

#

bitter hearth I opened the workflow on the kitty image

what'd ya find?

bitter hearth Nov 6, 2024, 5:31 PM

#

chaos cloud

craggy crest Nov 6, 2024, 5:34 PM

#

bitter hearth chaos cloud

??

noble coyote Nov 6, 2024, 5:38 PM

#

£10 cash

craggy crest Nov 6, 2024, 5:38 PM

#

that was interesting

bitter hearth Nov 6, 2024, 5:46 PM

#

just a lot of nodes

#

in a big cloud

marble sand Nov 6, 2024, 5:47 PM

#

lol

noble coyote Nov 6, 2024, 5:52 PM

#

Flux and Silhuflowart2 LoRA

pseudo owl Nov 6, 2024, 6:02 PM

#

Tried will smith eating spaghetti with Mochi-1 from genmo website(apache2.0 open model)
I mean not bad but not perfect.

sacred jewel Nov 6, 2024, 6:22 PM

#

untold valley could I ask which one of these appeals/ looks the best according to wte criteria...

TESTOTHER is my preference

untold valley Nov 6, 2024, 6:24 PM

#

sacred jewel TESTOTHER is my preference

I appreciate the feedback thank you 😊

bitter hearth Nov 6, 2024, 6:25 PM

#

found a gguf of flan T5

#

https://huggingface.co/dumb-dev/flan-t5-xxl-gguf

#

its better than normal T5

#

its similar idea to common thing where people replace Clip-L with this https://huggingface.co/zer0int/CLIP-GmP-ViT-L-14

#

don't rly need a gguf for Clip its so tiny already lol

craggy crest Nov 6, 2024, 6:56 PM

#

pseudo owl Tried will smith eating spaghetti with Mochi-1 from genmo website(apache2.0 open...

he ate the fork!

sacred jewel Nov 6, 2024, 7:01 PM

#

Cool LoRA thanks for heads up 😄

noble coyote Nov 6, 2024, 7:02 PM

#

You're welcome!

rapid pivot Nov 6, 2024, 8:55 PM

#

pseudo owl Tried will smith eating spaghetti with Mochi-1 from genmo website(apache2.0 open...

This prompt

#

The nightmares ive seen...

#

sadcat

pseudo owl Nov 6, 2024, 9:39 PM

#

rapid pivot This prompt

lol, yeah its a massive increase in quality compared to previous models

sullen moss Nov 7, 2024, 12:04 AM

#

https://blackforestlabs.ai/flux-1-1-ultra/

Black Forest Labs

Introducing FLUX1.1 [pro] Ultra and Raw Modes

Black Forest Labs are proud to launch a new ultra option to Flux 1.1 PRO

#

#

#

pseudo owl Nov 7, 2024, 12:11 AM

#

Cool but sadly closed source, images do look great but are they even going to release something open now?

dusky thistle Nov 7, 2024, 12:18 AM

#

#

#

#

#

#

bitter hearth Nov 7, 2024, 1:45 AM

#

the flux pro update does look amazing

#

2048x2048 native, and they made a realism mode

sullen moss Nov 7, 2024, 1:51 AM

#

nM6A-yQAF5H3gQMNop34Z_1b7122ec552e4973a094e4307581dfa1.jpg

sullen moss Nov 7, 2024, 1:52 AM

#

bitter hearth the flux pro update does look amazing

Yep

#

OW04XcVUU6bm9HxkMfCd1_7b6bb94cf8564a79bb5fcb39be6dde61.jpg

#

ziLXhQW9tQqcVuhHX7T6m_b187d8aa21794b9c9d747e2fb136113e.jpg

#

8BWGFDNMEwYbRwDC_VfA6_5d268bbf52fa472a90b50b8ec608b475.jpg

#

14X8urpx2Mqvtu1881X2K_7fac23c0eb5142438bf8c50e125fefc0.jpg

#

D0AWm801osmkbHcykIQ19_4f3f1f9f3f8f43caaf810cd2f40c5af7.jpg

#

This is really crazy

craggy crest Nov 7, 2024, 2:11 AM

#

bitter hearth the flux pro update does look amazing

pictures or it didn't happen ;)

dusky thistle Nov 7, 2024, 3:21 AM

#

#

cedar axle Nov 7, 2024, 3:29 AM

#

SD3.5L

dusky thistle Nov 7, 2024, 3:31 AM

#

#

#

#

#

#

#

#

#

#

#

#

cedar axle Nov 7, 2024, 3:49 AM

#

dusky thistle

really fun texture on this one 😍

dusky thistle Nov 7, 2024, 3:49 AM

#

#

#

craggy crest Nov 7, 2024, 4:16 AM

#

A dark, Gothic castle floating on a mist-covered, rocky island in the sky | towering spires stonework, eerie, isolated atmosphere | surrounded by thick fog, fading into a bleak, overcast sky | minimalist composition with high contrast | surreal, haunting ambiance

dusky thistle Nov 7, 2024, 4:30 AM

#

errant dust Nov 7, 2024, 4:31 AM

#

bitter hearth the flux pro update does look amazing

I must have missed the news. What update? NVM, found it. Well, I am sending a en email complaining and demanding a refund for my Flux Dev

craggy crest Nov 7, 2024, 4:34 AM

#

errant dust I must have missed the news. What update? NVM, found it. Well, I am sending a en...

waste of time most likely

errant dust Nov 7, 2024, 4:35 AM

#

Not a problem. I am a serial email complainer

#

(curiously got me two jobs at companies I was complaining to. True story.)

#

Maybe three, if I include writing for a newspaper back in 90s. Not sure that counts, and was not by email.

dusky thistle Nov 7, 2024, 4:42 AM

#

#

#

the great north american sandfish

bitter hearth Nov 7, 2024, 5:37 AM

#

FLUX1.1 [pro] – ultra mode: This option enables image generation at four times the resolution of standard FLUX1.1 [pro], without sacrificing prompt adherence. Unlike many high-resolution models that experience significant slowdowns at higher resolutions, our performance benchmarks show sustained fast generation times—over 2.5x faster than comparable high-resolution offerings. This model is available at a competitive price of $0.06 per image.that's the thing, their price for one single image is around the same as renting an RTX 3060 12GB for 30 minutes

dusky thistle Nov 7, 2024, 5:54 AM

#

untold valley Nov 7, 2024, 6:05 AM

#

dusky thistle

I have a big complaint.... this looks so good on discord but then you click iand the rocks/waterfalls so blocky. water still looks nice tho. but still bait n switch lol

dusky thistle Nov 7, 2024, 6:07 AM

#

untold valley I have a big complaint.... this looks so good on discord but then you click iand...

sounds like you need a new monitor lol

#

https://www.bestbuy.com/site/lg-signature-97-class-m3-series-oled-evo-4k-uhd-smart-webos-tv-with-wireless-connectivity-2023/6550242.p?skuId=6550242&utm_source=feed&ref=212&loc=TVsGeneralPREMIUM&gad_source=1&gclid=CjwKCAiAxKy5BhBbEiwAYiW--0Pi08Qp59SXEkGZO22ZdCy7eptwAaKnv23aIJKF51hDGn_thJbXeBoC56UQAvD_BwE&gclsrc=aw.ds this will help you see that image more clearly

Best Buy

LG SIGNATURE 97" Class M3 Series OLED evo 4K UHD Smart webOS TV wit...

Shop LG SIGNATURE 97" Class M3 Series OLED evo 4K UHD Smart webOS TV with Wireless Connectivity (2023) at Best Buy. Find low everyday prices and buy online for delivery or in-store pick-up. Price Match Guarantee.

untold valley Nov 7, 2024, 6:08 AM

#

lolololol the blocks

#

i have two screens

#

one is like a 14 or 17" POS, my main is a nice 27" IPS one

dusky thistle Nov 7, 2024, 6:09 AM

#

damn

#

i'm rockin dual ultrawide

#

got a 57" dual 4k and a 49"

untold valley Nov 7, 2024, 6:10 AM

#

we are not all millionaires sadcat

dusky thistle Nov 7, 2024, 6:10 AM

#

me either lol

#

#

i'm just a big believer in selling passenger doors off cars to fund more hardware purchase

#

dusky thistle Nov 7, 2024, 6:29 AM

#

#

dusky thistle Nov 7, 2024, 6:59 AM

#

noble coyote Nov 7, 2024, 7:35 AM

#

Flux and Silhuflowart2 LoRA

#

craggy crest Nov 7, 2024, 7:40 AM

#

untold valley I have a big complaint.... this looks so good on discord but then you click iand...

did you actually open the original image? or just look at discord's compressed version in their image viewer?

untold valley Nov 7, 2024, 7:44 AM

#

open in browser

dusky thistle Nov 7, 2024, 8:00 AM

#

looks like sandstone cliffs to me

#

got stuff like that in my area

#

the water gets into the sandstone and separates the layers through freeze thaw cycles over the years

robust echo Nov 7, 2024, 8:19 AM

#

An old 1960s vintage photograph album depicting a parked car in front of a house --ar 3:2

dusky thistle Nov 7, 2024, 8:22 AM

#

robust echo An old 1960s vintage photograph album depicting a parked car in front of a house...

Here is the image you requested.

untold valley Nov 7, 2024, 8:27 AM

#

Clownshark has became the Ai.

robust echo Nov 7, 2024, 8:39 AM

#

a Chinese man standing before a blue lake,ride a red mountain bike, wear a sunglass, on the plate refelcting the lake

dusky thistle Nov 7, 2024, 8:47 AM

#

robust echo a Chinese man standing before a blue lake,ride a red mountain bike, wear a sung...

Here is the image you requested.

#

#

#

#

muted dove Nov 7, 2024, 9:02 AM

#

dusky thistle

Damn you Scotty, you got the teleport coordinates wrong again!

craggy crest Nov 7, 2024, 9:06 AM

#

#

lone eagle Nov 7, 2024, 10:50 AM

#

We lie down together amid the cacti. The succulent leaves closest to the ground are a marbled grey, as if turned to stone, and we become absorbed by them, feeling our way around their rounded contours with our fingers. As we gaze up, following their odd tear-shaped forms bundled together, the sprawling double cypress tree – two trunks locked in an embrace – claims our attention with its swaying branches splitting into ever finer branches and twigs ending, here and there, in clusters of cones. Initially, we can’t really tell if it’s the wind that’s causing the canopy to stir and sway that way. It forms a dark shifting frame that we enter and get lost in as one does in a forest.

noble coyote Nov 7, 2024, 12:15 PM

#

Flux Turbo LoRA w/f with 2 x KSamplers, 2 x Upscale and Sharpen

#

bitter hearth Nov 7, 2024, 3:07 PM

#

the 8 step Flux Turbo LoRA gave me about as good an image as 1200 steps of Flux Dev

#

its so amazing

#

something about these new models means they can compress like that

sacred jewel Nov 7, 2024, 3:26 PM

#

rapid pivot Nov 7, 2024, 3:34 PM

#

I remember this prompt lmao

sacred jewel Nov 7, 2024, 3:54 PM

#

sacred jewel Nov 7, 2024, 3:54 PM

#

rapid pivot I remember this prompt lmao

Yeah, revisiting old prompts with new LoRAs 😉

signal shuttle Nov 7, 2024, 3:54 PM

#

Finally just started to train a 3.5 medium lora, its going pretty well, so far i noticed that it learns styles better then flux

craggy crest Nov 7, 2024, 4:38 PM

#

lone eagle We lie down together amid the cacti. The succulent leaves closest to the ground ...

if you're trying to generate that, and that's a really lousy prompt, you need to do so in the #artisan-faq channels

pseudo owl Nov 7, 2024, 5:46 PM

#

bitter hearth the 8 step Flux Turbo LoRA gave me about as good an image as 1200 steps of Flux ...

Which one? The one from alimama?

lunar canopy Nov 7, 2024, 5:46 PM

#

@noble coyote mind accepting friend request? catlook

bitter hearth Nov 7, 2024, 5:46 PM

#

pseudo owl Which one? The one from alimama?

yeah the alimama one

#

the hyper one is also very strong its not bad either

#

the bigger your final resolution, the less harm they do

#

so they do especially well for tiled upscale

pseudo owl Nov 7, 2024, 5:51 PM

#

bitter hearth yeah the alimama one

Yikes it’s amazing at prompt following. Even auraflow, flux dev can’t get this right most of the time

#

It nailed everything

#

Prompt: A white cat on top of a blue dog sitting on a brown couch in a living room. Behind them is a window with 4 cow pictures, one in each corner. Outside the window is outer space and a ufo.

bitter hearth Nov 7, 2024, 5:53 PM

#

often acceleration loras can improve various aspects

#

people often view them as always being worse

#

but TCD is substantially better than regular SDXL, for example

noble coyote Nov 7, 2024, 5:55 PM

#

Flux Turbo (8-step with LoRA) AliMaMa

noble coyote Nov 7, 2024, 5:56 PM

#

lunar canopy <@801511644944400414> mind accepting friend request? <:catlook:10244407998464819...

'Ullo?

pseudo owl Nov 7, 2024, 6:05 PM

#

Seems pretty impressive so far, both didn’t get it 100% right, I think flux dev might win at this one tho.
Prompt: A man holding a sign that says “This is the grand contest of the teacher or the student. Will 8 steps win or 30 steps?”

bitter hearth Nov 7, 2024, 6:13 PM

#

oh if you want accurate text that is probably a pretty big exception

craggy crest Nov 7, 2024, 6:22 PM

#

untold valley Nov 7, 2024, 6:27 PM

#

Ooooo Torcello big trouble.

#

sponging

bitter hearth Nov 7, 2024, 6:41 PM

#

I think using DiTs for text will not be the way to go soon

#

Omnigen-style models are much better for text

#

the huge downside of Omnigen models is they can't add noise to correct for past mistakes like we do with diffusion

craggy crest Nov 7, 2024, 6:43 PM

#

bitter hearth the huge downside of Omnigen models is they can't add noise to correct for past ...

not sure that's a problem

bitter hearth Nov 7, 2024, 6:43 PM

#

not sure

#

it may well not be a problem

#

we're using stuff like DPM++ 2SA, Euler A and Clownshark stuff on the Rect Flow models but you're not really "supposed" to

craggy crest Nov 7, 2024, 6:45 PM

#

bitter hearth we're using stuff like DPM++ 2SA, Euler A and Clownshark stuff on the Rect Flow ...

you know what "you're not supposed to" really means?

bitter hearth Nov 7, 2024, 6:45 PM

#

on Comfy server someone was saying, the big API providers like Fal aren't doing that

#

lol

craggy crest Nov 7, 2024, 6:45 PM

#

bitter hearth on Comfy server someone was saying, the big API providers like Fal aren't doing ...

not doing what?

bitter hearth Nov 7, 2024, 6:46 PM

#

using fancy stochastic solvers
they are likely using DPM++ 2M or heun or euler or something like that

craggy crest Nov 7, 2024, 6:46 PM

#

bitter hearth using fancy stochastic solvers they are likely using DPM++ 2M or heun or euler o...

for which model?

bitter hearth Nov 7, 2024, 6:46 PM

#

oh I mean for Flux and SD3/SD3.5

craggy crest Nov 7, 2024, 6:46 PM

#

i know what FAL is using for SD3.5 - i gave them the settings.

#

not sure about flux

bitter hearth Nov 7, 2024, 6:47 PM

#

on comfy server someone speculated that FAL have made a hand-written INT8 kernel
their Flux Dev endpoint is a lot cheaper and faster than their competition

craggy crest Nov 7, 2024, 6:47 PM

#

they might have

bitter hearth Nov 7, 2024, 6:48 PM

#

but yeah the point I was making was the research tends to just want simple samplers for these modern Rect Flow models
compared to diffusion where SDE/ancestral was favoured

craggy crest Nov 7, 2024, 6:48 PM

#

bitter hearth but yeah the point I was making was the research tends to just want simple sampl...

for 3.5, they're using: sampler: euler, scheduler: simple

#

cause that's what I told them to use

bitter hearth Nov 7, 2024, 6:48 PM

#

oh thanks that's really helpful

#

so we've finally reached the point where the trajectories are straight enough to use euler

#

it was gonna happen eventually

#

if your line is straight then euler is optimal

craggy crest Nov 7, 2024, 6:49 PM

#

bitter hearth so we've finally reached the point where the trajectories are straight enough to...

did you, or did you not, praise a euler image the other day?

bitter hearth Nov 7, 2024, 6:49 PM

#

haha

#

its swings and roundabouts

craggy crest Nov 7, 2024, 6:50 PM

#

;)

bitter hearth Nov 7, 2024, 6:50 PM

#

diffusion models tend to have curvy trajectories, is the issue

craggy crest Nov 7, 2024, 6:50 PM

#

plotting out the path through an image is worse than plotting the trajectory to hit the moon with a rocket from earth

bitter hearth Nov 7, 2024, 6:51 PM

#

its a pretty rough method yeah

craggy crest Nov 7, 2024, 6:51 PM

#

"getting better all the time"

bitter hearth Nov 7, 2024, 6:51 PM

#

GANs and VAEs don't do it they just jump in one go

#

apparently SD 1.5 VAE was almost a GAN anyway, there's a lot of overlap

#

I still find it funny that with diffusion we run the models backwards

craggy crest Nov 7, 2024, 8:15 PM

#

bitter hearth I still find it funny that with diffusion we run the models backwards

computers don't care which way is up

bitter hearth Nov 7, 2024, 8:18 PM

#

yeah they seem quite chill about it

craggy crest Nov 7, 2024, 8:21 PM

#

bitter hearth yeah they seem quite chill about it

as long as you don't drop them off the second floor balcony or something, they're happy with whatever

civic trail Nov 7, 2024, 9:00 PM

#

icy coral Nov 7, 2024, 9:20 PM

#

I don't know where the beef with the author of the P-word model originated from, but I was wondering if the team behind Illustrious decided to train a model on 3.5 would they be in a better position when it comes to obtaining a license?

bitter hearth Nov 7, 2024, 9:22 PM

#

fairly sure the situation was that Stability AI were busy and were behind in terms of dealing with the licenses

#

rather than them specifically denying a license

pseudo owl Nov 7, 2024, 9:25 PM

#

p-word is pony?

rapid pivot Nov 7, 2024, 9:27 PM

#

If that's it

#

I call stupid

#

lmao

signal shuttle Nov 7, 2024, 9:29 PM

#

icy coral I don't know where the beef with the author of the P-word model originated from,...

cagliostrolabs (Creators of AnimagineXL) announced that they had plans for a 3.5 fine tune. so far they are the only only ones who publicly announced that they plan on fine tuning 3.5. i believe that they will be the first to publish a full anime fine tune for 3.5 but i may be wrong, we will have to wait and see

icy coral Nov 7, 2024, 9:47 PM

#

bitter hearth fairly sure the situation was that Stability AI were busy and were behind in ter...

Someone should probably get in contact with him then, because it looks like he's pretty convinced he's being targeted somehow

untold valley Nov 7, 2024, 10:04 PM

#

@icy coral and everyone I guess, you need to think about optics with Pony models we are 100% talking about heavy NSFW and SAI or any company likely wants to steer 1000000000ft away from it. This is because in doing so you lose potential for future funding. It’s all about a business and money and lastly public perception.

#

What SAI needs is a adept PR person for retail.

signal shuttle Nov 7, 2024, 10:08 PM

#

Man flux ultra is so good, like what

icy coral Nov 7, 2024, 10:11 PM

#

untold valley <@426024298096885770> and everyone I guess, you need to think about optics with ...

I'm guessing the answer is no then

untold valley Nov 7, 2024, 10:11 PM

#

signal shuttle Man flux ultra is so good, like what

It’s generating nice images but can you run it in your own pc like sd3.5M? bobagirl

untold valley Nov 7, 2024, 10:12 PM

#

icy coral I'm guessing the answer is no then

No what?

signal shuttle Nov 7, 2024, 10:12 PM

#

untold valley It’s generating nice images but can you run it in your own pc like sd3.5M? <:bob...

No, but its fun to use while waiting for a major fine tune of SD 3.5

bitter hearth Nov 7, 2024, 10:13 PM

#

flux ultra is incredible quality but closed source models are absurdly expensive
for the price of one flux ultra image you can rent a 3060 for 30 minutes

signal shuttle Nov 7, 2024, 10:20 PM

#

MAN I REALLY WANT A OPENSOURCE MODEL THAT CAN DO SICK STUFF LIKE THIS

pseudo owl Nov 7, 2024, 10:22 PM

#

signal shuttle MAN I REALLY WANT A OPENSOURCE MODEL THAT CAN DO SICK STUFF LIKE THIS

prompt? I'm sure open models can do this with a bit of extra settings.

craggy crest Nov 7, 2024, 10:23 PM

#

@bitter hearth https://www.reddit.com/r/StableDiffusion/comments/1gm1kxm/ever_look_at_real_photos_and_see_ai_artefacts_in/

From the StableDiffusion community on Reddit: Ever look at real pho...

Explore this post and more from the StableDiffusion community

signal shuttle Nov 7, 2024, 10:23 PM

#

pseudo owl prompt? I'm sure open models can do this with a bit of extra settings.

my prompt was made with Claude "Vibrant anime key visual poster, "The Adventures of a Clumsy Female Knight!" prominently displayed in stylized Japanese text, featuring a cheerful young woman with messy blonde hair in oversized, ill-fitting armor stumbling forward, sword awkwardly held aloft, against a backdrop of a whimsical medieval castle and lush fantasy landscape, dynamic action lines emphasizing her clumsy movement, soft pastel color palette with pops of bright accents, highly detailed character design in the style of Studio Ghibli meets "KonoSuba", expressive eyes and exaggerated facial features conveying both determination and embarrassment"

craggy crest Nov 7, 2024, 10:24 PM

#

signal shuttle MAN I REALLY WANT A OPENSOURCE MODEL THAT CAN DO SICK STUFF LIKE THIS

standard anime lora can do that. all the base models can do that with the right prompt

signal shuttle Nov 7, 2024, 10:24 PM

#

craggy crest standard anime lora can do that. all the base models can do that with the right ...

Can they also do the text correctly?

craggy crest Nov 7, 2024, 10:24 PM

#

signal shuttle Can they also do the text correctly?

it's non-english characters, so it'll look correct but not be readable.

#

but we had this discussion yesterday

signal shuttle Nov 7, 2024, 10:25 PM

#

craggy crest it's non-english characters, so it'll look correct but not be readable.

Look above the non-english letters and you will see english words

#

Its small and in black

pseudo owl Nov 7, 2024, 10:25 PM

#

signal shuttle Its small and in black

the english sentence is not hard to do at all, its the detail which can be done with a lot of upscaling and refining.

craggy crest Nov 7, 2024, 10:26 PM

#

signal shuttle Look above the non-english letters and you will see english words

#

sd3.5 base, no loras

signal shuttle Nov 7, 2024, 10:27 PM

#

craggy crest sd3.5 base, no loras

Large?

craggy crest Nov 7, 2024, 10:27 PM

#

yes

#

prompt: a blond haired anime cartoon knight, happy, wild hair. across the top of the image is written the text "the adventures of a clumsy female knight"

bitter hearth Nov 7, 2024, 10:28 PM

#

craggy crest <@456226577798135808> https://www.reddit.com/r/StableDiffusion/comments/1gm1kxm/...

lol yeah looks like a bad AI generation

craggy crest Nov 7, 2024, 10:29 PM

#

craggy crest Nov 7, 2024, 10:29 PM

#

bitter hearth lol yeah looks like a bad AI generation

and yet, it's a photo - just goes to show it's really not possible to tell what is ai generated and what isn't, any more

bitter hearth Nov 7, 2024, 10:30 PM

#

yeah essentially not possible anymore

#

the impressive thing about Flux Pro Ultra is just the size without tiling, done within 10 seconds

#

it doesn't even need to be a new technology though, hand-optimised pipeline on Nvidia H200s, plus more finetuning and distillation shenanigans can go far

mortal mesa Nov 7, 2024, 10:38 PM

#

digital cameras(cell phones) don't necessarily take true pictures like film

bitter hearth Nov 7, 2024, 10:39 PM

#

yeah film was nicer IMO

craggy crest Nov 7, 2024, 10:50 PM

#

craggy crest Nov 7, 2024, 10:51 PM

#

bitter hearth yeah film was nicer IMO

course, there's the issue of the actual brains that are processing the data coming in through the eyes of the human that's viewing whatever it is. not everyone sees the same thing in the same way. re: that blue dress on the internet a couple of years ago

#

if you want to get really technical, you're not seeing the actual objects at all, just the light that is bouncing off them

river sleet Nov 7, 2024, 10:55 PM

#

You aren't even seeing that. You're seeing vague shapes and a tiny circle in the middle of your vision that's actually clear and sharp. Everything else is a hallucination made up by your brain.

craggy crest Nov 7, 2024, 10:55 PM

#

river sleet You aren't even seeing that. You're seeing vague shapes and a tiny circle in the...

yeah, the light is activating sensors and hopefully your brain will decode the data, and extrapolate in the same way everyone else's does. but what if it doesn't?

signal shuttle Nov 7, 2024, 10:56 PM

#

craggy crest Nov 7, 2024, 11:10 PM

#

bitter hearth yeah film was nicer IMO

bitter hearth Nov 7, 2024, 11:12 PM

#

nice, really sharp and good colours

mortal mesa Nov 7, 2024, 11:23 PM

#

the illusive leapordtigerpuma

craggy crest Nov 7, 2024, 11:24 PM

#

mortal mesa the illusive leapordtigerpuma

he was Dr. Moreau's kitty

mortal mesa Nov 7, 2024, 11:24 PM

#

hah nice

craggy crest Nov 7, 2024, 11:26 PM

#

he's self-polinating

pseudo owl Nov 7, 2024, 11:54 PM

#

Some interesting prompts with mochi

craggy crest Nov 8, 2024, 12:30 AM

#

rapid pivot Nov 8, 2024, 12:38 AM

#

pseudo owl Some interesting prompts with mochi

Trump, drugs and one interesting prompt

rapid pivot Nov 8, 2024, 12:38 AM

#

craggy crest

Twitter after the x accident sadcat

craggy crest Nov 8, 2024, 1:00 AM

#

@dusky thistle @bitter hearth https://youtu.be/EvmML-aFRsQ?si=Cw1f-PXzOSSzg7tn

YouTube

Purz

This Week In AI with Purz: Detail Daemon, Image Filters, Comfy Canv...

ComfyUI Detail Daemon
https://github.com/Jonseed/ComfyUI-Detail-Daemon

ComfyUI Image Filters
https://github.com/spacepxl/ComfyUI-Image-Filters

ComfyUI ComfyCanvas
https://github.com/taabata/ComfyCanvas

ComfyUI Resynthesizer
https://github.com/brayevalerien/ComfyUI-resynthesizer

ComfyUI Replicate API Nodes
https://github.com/replicate/comfyui...

▶ Play video

bitter hearth Nov 8, 2024, 3:05 AM

#

the canvas looks amazing

craggy crest Nov 8, 2024, 3:38 AM

#

bitter hearth the canvas looks amazing

yeah. so many comfy nodes I want to go download and fill my drive up with

#

bitter hearth Nov 8, 2024, 3:41 AM

#

I've essentially switched to diffusers/pytorch only at this point

#

I did enjoy comfy a lot though

craggy crest Nov 8, 2024, 3:52 AM

#

bitter hearth I did enjoy comfy a lot though

gonna miss out on all the toys ...

#

you know anyone that's made any 3.5 medium finetunes?

halcyon yarrow Nov 8, 2024, 3:54 AM

#

has anyone here tried mochi? I just tried it and it takes me 12 minutes with 8GB vram to render 31 frames or 2 seconds @ 15fps lol

bitter hearth Nov 8, 2024, 4:01 AM

#

seems too early for 3.5 medium finetunes but there's been some progress in koyha, ST communities etc apparently

#

people are split between more models now, and that's probably gonna get worse as time goes on

rapid pivot Nov 8, 2024, 4:08 AM

#

It's good and bad

craggy crest Nov 8, 2024, 4:12 AM

#

halcyon yarrow has anyone here tried mochi? I just tried it and it takes me 12 minutes with 8GB...

it's pretty promising

halcyon yarrow Nov 8, 2024, 4:12 AM

#

same reason i don't want BFL to release a flux-dev 1.1 it's just going to split the community further and invalidate all our work

#

14.75 minutes to render 73 frames aka 4.8 seconds

#

i'm going to keep increasing my frame count to find my max

craggy crest Nov 8, 2024, 4:13 AM

#

halcyon yarrow same reason i don't want BFL to release a flux-dev 1.1 it's just going to split ...

pretty sure the companies don't care about whether the community gets split or previous work invalidated. they're going to keep researching and releasing. and the community isn't going to care whether others are training, they're all going to do their own thing - as always

halcyon yarrow Nov 8, 2024, 4:14 AM

#

i can totally wait 15 minutes for a 5 second video clip if it's worthy and high quality

halcyon yarrow Nov 8, 2024, 4:14 AM

#

craggy crest pretty sure the companies don't care about whether the community gets split or p...

i agree, feelngs won't stop progress basically lol

#

and progress won't dictate communty sentiment

craggy crest Nov 8, 2024, 4:15 AM

#

yup

#

@bitter hearth

halcyon yarrow Nov 8, 2024, 4:16 AM

#

looks like something clown would make, good stuff

craggy crest Nov 8, 2024, 4:17 AM

#

halcyon yarrow looks like something clown would make, good stuff

skip layer tests for 3.5 medium. workflow is in each image if you want to play around with it

halcyon yarrow Nov 8, 2024, 4:19 AM

#

ive seen that in a few youtube videos and a few like "how tos" in civitai it's liike the trendy new thing to try, just picking at the model's layer to see how output changes given the same input, interesting stuff indeed

craggy crest Nov 8, 2024, 4:20 AM

#

halcyon yarrow ive seen that in a few youtube videos and a few like "how tos" in civitai it's l...

SLG - skip layer guidance - was released with 3.5 as a way of tuning the image and fixing some of the issues it might have.

mortal mesa Nov 8, 2024, 4:20 AM

#

sooo this is pretty wild [MOVIE-SHOTS] In an enchanting tale of nature's wonders, [SCENE-1] shows <Sophie> observing butterflies in a sunlit meadow, her expression one of awe and delight, [SCENE-2] transitioning to <Sophie> sketching the butterflies in her notebook, her brow furrowed in concentration, [SCENE-3] wrapping up with her lying back in the grass, gazing at the sky with a contented smile, surrounded by nature's beauty.

craggy crest Nov 8, 2024, 4:21 AM

#

you do need to make sure you're on the latest level of comfyUI though or you won't have the new node and the new scheduler

mortal mesa Nov 8, 2024, 4:21 AM

#

https://huggingface.co/ali-vilab/In-Context-LoRA

bitter hearth Nov 8, 2024, 4:22 AM

#

halcyon yarrow same reason i don't want BFL to release a flux-dev 1.1 it's just going to split ...

I don't think the community splitting is neccesarily bad

#

back with SD 1.5 and SDXL it was more needed to rely on this big ecosystem of fine tunes

#

but the models come out of the box stronger now, and loras can be done in 2 minutes on Fal

bitter hearth Nov 8, 2024, 4:26 AM

#

mortal mesa https://huggingface.co/ali-vilab/In-Context-LoRA

really cool paper, didn't realise Flux could be trained as an edit model so fast

halcyon yarrow Nov 8, 2024, 4:27 AM

#

just saying imagine if SDXL 1.5 came out or SDXL 2.0, I don't think we can compare SD 1.5 to SDXL since it's a complete archicture change, I'm talking about a company doing an incremental model release, it would suck to have to keep around 2 loras that amount to the same thing for 2 similar models becase they're not cross compatible

#

that's why i'm glad flux 1.1 was liimited to just pro so it doesn't fragment the community pool

bitter hearth Nov 8, 2024, 4:28 AM

#

I don't think its that big a deal anymore

#

people can mostly just use the base models now

#

and recreate loras themselves if needed as it can be done so fast

halcyon yarrow Nov 8, 2024, 4:30 AM

#

I guess storage is cheap enough where it's not that big of a deal but still sucks in other ways like having to keep 2 sets of vaes for each release. i just wish when flux does the next release it's a major release with significant changes that make sense to split the community for, i for once am not okay with incremental updates to major models

#

for SD3 vs SD3.5 it makes sense, SD3 was dead anyways and SD3.5 kicked it back to life, I'd consider 3.5 a worthy incremental change where they just messed with the training data and made everyone happy lol

mortal mesa Nov 8, 2024, 4:32 AM

#

bitter hearth really cool paper, didn't realise Flux could be trained as an edit model so fast

im trying them its unexpected to me

#

The four-panel image showcases a playful bubble font in a vibrant pop-art style. [TOP-LEFT] displays "Pop Candy" in bright pink with a polka dot background; [TOP-RIGHT] shows "Sweet Treat" in purple, surrounded by candy illustrations; [BOTTOM-LEFT] has "Yum!" in a mix of bright colors; [BOTTOM-RIGHT] shows "Delicious" against a striped background, perfect for fun, kid-friendly products.

bitter hearth Nov 8, 2024, 4:33 AM

#

halcyon yarrow I guess storage is cheap enough where it's not that big of a deal but still suck...

I wouldn't store stuff necessarily just download fresh when you need it

halcyon yarrow Nov 8, 2024, 4:35 AM

#

bitter hearth I wouldn't store stuff necessarily just download fresh when you need it

lol i'm like the hoarders of lora my collectioni is at 1.7TB 👼

bitter hearth Nov 8, 2024, 4:35 AM

#

I get what you are saying, it would be convenient for there to just be one big model with everything on it

#

1.7TB wow

#

okay yeah your perspective makes a lot more sense then

halcyon yarrow Nov 8, 2024, 4:36 AM

#

mortal mesa ```The four-panel image showcases a playful bubble font in a vibrant pop-art sty...

can you try the film one? I'd try it buy my GPU is busy rendering a video

#

I'm going to try this prompt as-is with that film-storyboard one

[MOVIE-SHOTS] In a vibrant festival, [SCENE-1] we find <Leo>, a shy boy, standing at the edge of a bustling carnival, eyes wide with awe at the colorful rides and laughter, [SCENE-2] transitioning to him reluctantly trying a daring game, his friends cheering him on, [SCENE-3] culminating in a triumphant moment as he wins a giant stuffed bear, his face beaming with pride as he holds it up for all to see.

CivitAI would have a field day if someone reposts those LORAs on there

#

this is exactly what CivitAI loves the 4-panel story board, film board, side by side concepts and it's not available there yet

mortal mesa Nov 8, 2024, 4:38 AM

#

halcyon yarrow can you try the film one? I'd try it buy my GPU is busy rendering a video

the first one was film lora

halcyon yarrow Nov 8, 2024, 4:39 AM

#

oh i see thanks for the heads up, you did a good job maintaining the same syntax they asked for but i feel Sophie iisn't consistent in every frame

mortal mesa Nov 8, 2024, 4:39 AM

#

portrait-illustration.safetensors This two-panel image presents a transformation from a realistic portrait to a playful illustration, capturing both detail and artistic flair; [LEFT] the photograph shows a woman standing in a bustling marketplace, wearing a wide-brimmed hat, a flowing bohemian dress, and a leather crossbody bag; [RIGHT] the illustration panel exaggerates her accessories and features, with the bohemian dress depicted in vibrant patterns and bold colors, while the background is simplified into abstract market stalls, giving the scene an animated and lively feel.

halcyon yarrow Nov 8, 2024, 4:40 AM

#

crossbody bag you say? oh you must mean crosseyed hag lol

#

85 frames seems to be my max @ 24 minutes for essentially 5.6 seconds at 15fps

untold valley Nov 8, 2024, 6:10 AM

#

finetunes taking forever fr, way too slow. lol/s but the higher we go in params the slower things will go.

#

need that nsfw fix from time to time....

dusky thistle Nov 8, 2024, 7:49 AM

#

civic trail Nov 8, 2024, 8:05 AM

#

dusky thistle Nov 8, 2024, 8:07 AM

#

untold valley Nov 8, 2024, 8:18 AM

#

#

I think im happy w faces.

dusky thistle Nov 8, 2024, 9:21 AM

#

#

#

#

muted dove Nov 8, 2024, 9:31 AM

#

dusky thistle

https://tenor.com/view/indiana-jones-indiana-balls-sisif-sisifus-rolling-gif-17011737

Tenor

dusky thistle Nov 8, 2024, 9:31 AM

#

#

#

#

#

#

all SD3.5M

#

#

my mona lisa

#

#

#

#

#

pseudo owl Nov 8, 2024, 12:43 PM

#

This is pretty amazing for quantization of models like flux/sd3.5. It uses less vram then bnb4bit while being similar quality to 8bit. It’s also far faster then bnb4bit. Works well on smaller models like pixart too.

https://github.com/mit-han-lab/nunchaku

GitHub

GitHub - mit-han-lab/nunchaku: SVDQuant: Absorbing Outliers by Low-...

SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models - mit-han-lab/nunchaku

#

You can also apply Lora’s on it while you can’t on bnb4bit

turbid grotto Nov 8, 2024, 2:28 PM

#

pseudo owl This is pretty amazing for quantization of models like flux/sd3.5. It uses less ...

can it already be implemented in comfy? waow

craggy crest Nov 8, 2024, 3:56 PM

#

untold valley finetunes taking forever fr, way too slow. lol/s but the higher we go in params ...

that's why you create LoRAs instead of full finetuned checkpoints

craggy crest Nov 8, 2024, 3:57 PM

#

dusky thistle

gravity fails

untold valley Nov 8, 2024, 4:02 PM

#

craggy crest that's why you create LoRAs instead of full finetuned checkpoints

and fight the actual model with concepts it does not inherently knows a semblance of, and have to have stacks on stacks of loras, nah. There is a need for better finetunes, like all Sai models that have been released to the public have mostly been designed for. Training Lora's for hundreds of characters and styles and everything in between vs training 1 model, seems theres a clear winner.

craggy crest Nov 8, 2024, 4:04 PM

#

untold valley and fight the actual model with concepts it does not inherently knows a semblanc...

the reason for a LoRA -low rank adaptation - is so that you can update the model with new or revised information and not spend hours or weeks, and hundred or thounds of dollars doing a retrain on the entire model

#

a full fine tuned check point is going to be vastly more expensive and take a whole lot longer time

untold valley Nov 8, 2024, 4:11 PM

#

loras are great fill a void/niche/gaps but its nicer when the model inherently understands fundamentals. SD3.5 is a superb base. a finetune would be wonderful to train other loras on.

#

And then SDXL went on to be wildly successful. Im hoping the same for 3.5

dusky thistle Nov 8, 2024, 4:33 PM

#

bitter hearth Nov 8, 2024, 4:40 PM

#

untold valley And then SDXL went on to be wildly successful. Im hoping the same for 3.5

I feel like the quote is kinda backwards

#

for a base model is good to be very underfit

#

so that fine tunes have more room to change the model

untold valley Nov 8, 2024, 4:41 PM

#

thats what quote saying.

bitter hearth Nov 8, 2024, 4:41 PM

#

is it?

#

I read it as saying the opposite

limpid thunderBOT Nov 8, 2024, 4:42 PM

#

Last 7 days <Nov 01 2024> → <Nov 07 2024>

Member counts
345955 ↗ 345969 ↘ 345955 ↘ 345930 ↗ 345948 ↗ 345967 ↗ 345976
Action members
0 → 0 → 0 → 0 → 0 → 0 ↗ 85
Message members
0 → 0 → 0 → 0 → 0 → 0 ↗ 58
Reaction members
0 → 0 → 0 → 0 → 0 → 0 ↗ 43
More details

Summary | comcom Analytics

comcom analytics は、Discord または Slack 上で運営されているコミュニティを分析・モニタリングできる完全無料のダッシュボードです。現在、パブリックにβ版を提供しています。

untold valley Nov 8, 2024, 4:42 PM

#

saying you need good base model, not too over trained so community finetunes.

dusky thistle Nov 8, 2024, 4:42 PM

#

Yeah I think it's just phrased a little oddly

bitter hearth Nov 8, 2024, 4:43 PM

#

maybe, not sure
I never really like trying to work out what past ambiguous quotes are saying

#

anyway if he means it should be underfit I agree

dusky thistle Nov 8, 2024, 4:43 PM

#

Yeah I think that's what he means

#

That it won't look great cuz it's designed to be a little undercut

#

Underfit

bitter hearth Nov 8, 2024, 4:44 PM

#

Flux was somewhat helpful in that it taught the population that if the model is overfit its hard to get rid of the Flux Chin

untold valley Nov 8, 2024, 4:45 PM

#

bitter hearth Flux was somewhat helpful in that it taught the population that if the model is ...

lololol just yesterday I saw someone getting mad on the butt chin from Flux, ROFL

bitter hearth Nov 8, 2024, 4:45 PM

#

lol yeah

#

some of the realism loras are good though

#

sadly I didn't save the workflow but I found one that works much better than the others

#

this same debate about fit happened in LLM world when Phi models came

#

cos Phi models are underfit so they are bad on release

#

but for example Omnigen is based on Phi, they are really useful

untold valley Nov 8, 2024, 4:51 PM

#

bitter hearth sadly I didn't save the workflow but I found one that works much better than the...

time to make your own lol, sometimes i feel its more fun messing with settings than actually gening stuff

bitter hearth Nov 8, 2024, 4:51 PM

#

oh yeah this was my own workflow

#

I spent $10 on L40s making it

#

then forgot to save it lol

#

all I have is this screenshot

untold valley Nov 8, 2024, 4:54 PM

#

should be enought info there to remake it no?

bitter hearth Nov 8, 2024, 4:55 PM

#

almost yeah
sadly the nodes to put the settings up top were wired up a bit wrong so it is missing some
but most are there

#

I feel very lucky I put that label up top and the graphs on the right

#

I would have nothing otherwise

hallow lion Nov 8, 2024, 4:56 PM

#

bitter hearth all I have is this screenshot

Hey Fartoo.

dusky thistle Nov 8, 2024, 4:57 PM

#

bitter hearth Nov 8, 2024, 4:58 PM

#

hallow lion Hey Fartoo.

yeah 🙂

#

base Flux can't do him well but with realism lora Flux can

untold valley Nov 8, 2024, 4:59 PM

#

bitter hearth base Flux can't do him well but with realism lora Flux can

have you tried 3.5 see how it does with him?

hallow lion Nov 8, 2024, 5:00 PM

#

r2D2 frollic in grass

#

https://tenor.com/view/padme-anakin-smile-love-gif-17180376

Tenor

bitter hearth Nov 8, 2024, 5:01 PM

#

untold valley have you tried 3.5 see how it does with him?

not yet I think I made galaxy bottle with 3.5

dusky thistle Nov 8, 2024, 5:10 PM

#

hallow lion Nov 8, 2024, 5:16 PM

#

happemad

dusky thistle Nov 8, 2024, 5:26 PM

#

hallow lion Nov 8, 2024, 5:31 PM

#

Usual Hogwarts report.

dusky thistle Nov 8, 2024, 5:34 PM

#

hallow lion Nov 8, 2024, 5:53 PM

#

Clownsharks images are like a near death experience

#

couple of days ago I dreamt of nuclear blasts.

#

https://tenor.com/view/fifth-element-milla-jovovich-leeloo-bada-boom-boom-gif-9053853716468720447

Tenor

craggy crest Nov 8, 2024, 6:06 PM

#

untold valley loras are great fill a void/niche/gaps but its nicer when the model inherently u...

yes, well, you said "finetunes taking forever fr, way too slow." and my response was telling you why. they are going to take a long time, because you're trying to update the entire model, not just creating a small set of information that'll be used with the model at runtime. even with highpowered gpus it's going to take quite a while to do this

fleet meteor Nov 8, 2024, 6:18 PM

#

dusky thistle

Nice gen!

untold valley Nov 8, 2024, 6:23 PM

#

craggy crest yes, well, you said "finetunes taking forever fr, way too slow." and my response...

I ordered a pizza and all i got was the pepperoni. bobagirl thomas sadcat

runic tusk Nov 8, 2024, 6:25 PM

#

I ordered a frappuccino, where's my fuckin' frappuccino.

craggy crest Nov 8, 2024, 6:27 PM

#

untold valley I ordered a pizza and all i got was the pepperoni. <:bobagirl:104533832760532176...

should be some cheese in your fridge

noble coyote Nov 8, 2024, 6:34 PM

#

Here's my photo! Sign me up!!! 😄

#

SD3.5 L Turbo

#

mortal mesa Nov 8, 2024, 6:38 PM

#

The CogVideoX1.5-5B series model supports 10-second videos and higher resolutions. The CogVideoX1.5-5B-I2V variant supports any resolution for video generation.

untold valley Nov 8, 2024, 6:39 PM

#

@lunar canopy sry ping but dont know if server has mods, this is a scam.

#

Actually am not sorry ping bobagirl catlurk

mortal mesa Nov 8, 2024, 6:42 PM

#

you come here from the SAI webpage then get fished

#

like this

untold valley Nov 8, 2024, 6:48 PM

#

Oh that guy spam posting everywhere

#

Where the mods

#

catlurk

mortal mesa Nov 8, 2024, 6:51 PM

#

abandoned ship

hallow lion Nov 8, 2024, 6:53 PM

#

Titanic is sinking.

cunning lintel Nov 8, 2024, 6:53 PM

#

This discord is so wholesome it needs no mods 🤣
Sucks btw, this used to be a nice place, now i still hope to find some tidbits of interesting talk/images here, but quite often i think "ignore, ignore, don't get dragged in"

hallow lion Nov 8, 2024, 6:55 PM

#

lol

craggy crest Nov 8, 2024, 6:57 PM

#

with SLG - skipping specific layers to adjust the details, and without slg

craggy crest Nov 8, 2024, 6:58 PM

#

mortal mesa you come here from the SAI webpage then get fished

just ping @lunar canopy if there are issues you need help with

bitter hearth Nov 8, 2024, 7:00 PM

#

cunning lintel This discord is so wholesome it needs no mods 🤣 Sucks btw, this used to be a n...

AI image generation communities are very toxic compared to literally any other area of machine learning

craggy crest Nov 8, 2024, 7:00 PM

#

bitter hearth AI image generation communities are very toxic compared to literally any other a...

fulll of emotionally charged creative people - which is usually a recipe for toxic

bitter hearth Nov 8, 2024, 7:01 PM

#

yes plus the NSFW community

#

I never see drama in the more dry areas of ML like communities for graph neural networks or time series analysis models

#

its always in image or chatbot communities

mortal mesa Nov 8, 2024, 7:02 PM

#

craggy crest just ping <@729066661029871638> if there are issues you need help with

they need help not me

sterile pendant Nov 8, 2024, 7:03 PM

#

bitter hearth AI image generation communities are very toxic compared to literally any other a...

Yeah because they are a hotbed for people on the spectrum. Lots of binary logic/rigid thinking.

craggy crest Nov 8, 2024, 7:03 PM

#

bitter hearth I never see drama in the more dry areas of ML like communities for graph neural ...

scientsts tend to only get emotional if some other scientist argues with their research

#

and then they get very emotional

mortal mesa Nov 8, 2024, 7:03 PM

#

that is %1000 wrong

craggy crest Nov 8, 2024, 7:03 PM

#

mortal mesa that is %1000 wrong

you're not a scientist so how would you know

mortal mesa Nov 8, 2024, 7:04 PM

#

i have a brain

craggy crest Nov 8, 2024, 7:04 PM

#

mortal mesa i have a brain

you should know better than to give people openings

mortal mesa Nov 8, 2024, 7:04 PM

#

so you should cry when you do bad science to get people to leave you alone right

craggy crest Nov 8, 2024, 7:05 PM

#

you are, apparently, in need of coffee this morning

bitter hearth Nov 8, 2024, 7:05 PM

#

mortal mesa so you should cry when you do bad science to get people to leave you alone right

I forgot how your feud with crystal started
was it over the SD3 medium release?

mortal mesa Nov 8, 2024, 7:05 PM

#

you said one of the craziest things ive heard involving science

mortal mesa Nov 8, 2024, 7:09 PM

#

bitter hearth I forgot how your feud with crystal started was it over the SD3 medium release?

no feud, they say crazy ass stuff, wrong often, opinionated to the point of attacking people with diffent opinions, literally the only person ive seen warned, once of the few ive seen attack people. Its like the passive agressive help bot you never wanted. a social media "reputation farmer" it seems, shit is weird, most blocked person

#

society is wrong i suppose

craggy crest Nov 8, 2024, 7:10 PM

#

mortal mesa you said one of the craziest things ive heard involving science

maybe, but your normally zinging rebuttals are lacking this morning

mortal mesa Nov 8, 2024, 7:10 PM

#

you broke sd3 by demending it came out as a huge member of the community

#

the clips you have to talk to the clips

craggy crest Nov 8, 2024, 7:10 PM

#

and he dives down a radom rabbit hole

mortal mesa Nov 8, 2024, 7:11 PM

#

nah i just get annoyed

#

i dont expect anything of anyone here

craggy crest Nov 8, 2024, 7:11 PM

#

how did you go from talking about scientsts to SD3?

mortal mesa Nov 8, 2024, 7:11 PM

#

you could read, its right here

craggy crest Nov 8, 2024, 7:12 PM

#

you went right from talking about rabid scientists yelling at each other directly to SD3 being broken, without even a transistion

mortal mesa Nov 8, 2024, 7:12 PM

#

i dont hate you i just often dont like you

craggy crest Nov 8, 2024, 7:12 PM

#

mortal mesa i dont hate you i just often dont like you

i don't hate you or dislike you, but this morning you are really confusing

mortal mesa Nov 8, 2024, 7:13 PM

#

maybe put it into a LLM to explain

hallow lion Nov 8, 2024, 7:13 PM

#

🍿 popcorn?

craggy crest Nov 8, 2024, 7:13 PM

#

naw, i'm just gonna send you some coffee

mortal mesa Nov 8, 2024, 7:13 PM

#

ty

hallow lion Nov 8, 2024, 7:13 PM

#

maybe wizard is just misunderstood...

craggy crest Nov 8, 2024, 7:14 PM

#

i thought you blocked me, mr. coin

hallow lion Nov 8, 2024, 7:14 PM

#

im willing to give a second chance to everyone

#

and its miss

bitter hearth Nov 8, 2024, 7:19 PM

#

mortal mesa no feud, they say crazy ass stuff, wrong often, opinionated to the point of atta...

I find it hard to criticise people for being wrong when a lot of the advice I gave about image models turned out to be wrong as I learned more about them

#

there's a lot of stuff I said earlier in the year which I now think is cringe because I understand the math better

hallow lion Nov 8, 2024, 7:22 PM

#

Right, we're all friends here!

#

Models come and go

mortal mesa Nov 8, 2024, 7:22 PM

#

you all ruined sd3

hallow lion Nov 8, 2024, 7:22 PM

#

hypemad

untold valley Nov 8, 2024, 7:28 PM

#

Hello

#

Did I miss participating in the drama 🎭

#

catlurk bobagirl

craggy crest Nov 8, 2024, 7:31 PM

#

mortal mesa you all ruined sd3

no one's stopping you from using somethign other than sd3

hallow lion Nov 8, 2024, 7:31 PM

#

nah

craggy crest Nov 8, 2024, 7:31 PM

#

untold valley Hello

wasn't much. everyone's asleep

mortal mesa Nov 8, 2024, 7:31 PM

#

craggy crest no one's stopping you from using somethign other than sd3

no one asked

hallow lion Nov 8, 2024, 7:31 PM

#

very mild popcorn moment

craggy crest Nov 8, 2024, 7:32 PM

#

untold valley Nov 8, 2024, 7:32 PM

#

Wait why are Kagi and Crystal arguing? Don’t y’all share same opinions?

craggy crest Nov 8, 2024, 7:32 PM

#

untold valley Wait why are Kagi and Crystal arguing? Don’t y’all share same opinions?

we're not?

#

jsut banter

mortal mesa Nov 8, 2024, 7:33 PM

#

ya over crying scientists lol

craggy crest Nov 8, 2024, 7:33 PM

#

i actually respect Kagi's opinion quite a bit. but we do lock horns at times.

mortal mesa Nov 8, 2024, 7:34 PM

#

im a mere mortal

bitter hearth Nov 8, 2024, 7:35 PM

#

untold valley Wait why are Kagi and Crystal arguing? Don’t y’all share same opinions?

fairly sure it started over the SD3 release

craggy crest Nov 8, 2024, 7:36 PM

#

bitter hearth fairly sure it started over the SD3 release

eh, we've been tradeing barbs longer than that i think

bitter hearth Nov 8, 2024, 7:36 PM

#

ah okay

#

I find it hard to keep track of the drama

craggy crest Nov 8, 2024, 7:36 PM

#

bitter hearth I find it hard to keep track of the drama

you can't complain of being bored here ;)

untold valley Nov 8, 2024, 7:37 PM

#

Idk but crystal likes gaslighting the community over sd3 failure and it seems kagi also thinks it catlurk
Good thing 3.5 was success imo mostly cuz they removed some training restrictions.

craggy crest Nov 8, 2024, 7:37 PM

#

untold valley Idk but crystal likes gaslighting the community over sd3 failure and it seems ka...

no, i do not. i don't gaslight, ever, at all.

untold valley Nov 8, 2024, 7:37 PM

#

thomas bobagirl

craggy crest Nov 8, 2024, 7:38 PM

#

you're more than welcome to go back through several months of posts and read what the commuinty was saying between march of this year and when SD3 releaed - and read the extreme toxic whining that was constantly going on

#

they got what they asked for and demanded

untold valley Nov 8, 2024, 7:39 PM

#

Ok we can go back to posting gens.

craggy crest Nov 8, 2024, 7:39 PM

#

sure.

untold valley Nov 8, 2024, 7:39 PM

#

waow

hallow lion Nov 8, 2024, 7:39 PM

#

happemad

untold valley Nov 8, 2024, 7:40 PM

#

@craggy crest figured out to make texture better. Add node to add grain to hide it sadcat

craggy crest Nov 8, 2024, 7:40 PM

#

that looks really good

lunar canopy Nov 8, 2024, 7:40 PM

#

nicest fall gens get to be server banner catlurk

#

or...er winter?

hallow lion Nov 8, 2024, 7:40 PM

#

Ye it does, still amazes me how we can get stuff like this by typing words.

craggy crest Nov 8, 2024, 7:41 PM

#

lunar canopy nicest fall gens get to be server banner <:catlurk:1017871526978125854>

i had to read that twice. first time i thought it said nicest fall gens get banned

lunar canopy Nov 8, 2024, 7:41 PM

#

that'd be fun

#

spookycat

untold valley Nov 8, 2024, 7:42 PM

#

Winter so like igloos and penguins catlurk

craggy crest Nov 8, 2024, 7:43 PM

#

mortal mesa Nov 8, 2024, 7:44 PM

#

i dont lora much but i saw a 2gb flux and 900mb sd3.5 lora, is this a new normal

hallow lion Nov 8, 2024, 7:44 PM

#

lol 2gb flux?

mortal mesa Nov 8, 2024, 7:44 PM

#

ya its kinda nice too

untold valley Nov 8, 2024, 7:45 PM

#

Good lord

bitter hearth Nov 8, 2024, 7:45 PM

#

untold valley <@407561236339752981> figured out to make texture better. Add node to add grain ...

I like adding grain yeah

mortal mesa Nov 8, 2024, 7:45 PM

#

hallow lion lol 2gb flux?

https://civitai.com/models/796382/ultrarealistic-lora-project

craggy crest Nov 8, 2024, 7:46 PM

#

bitter hearth Nov 8, 2024, 7:47 PM

#

there's people doing Lycoris Lokrs rather than Loras for Flux also

#

there's some on Civit if you want to try them

lunar canopy Nov 8, 2024, 7:50 PM

#

untold valley Good lord

banned....

untold valley Nov 8, 2024, 7:51 PM

#

Ty

#

bobagirl

bitter hearth Nov 8, 2024, 7:51 PM

#

it happens quite often that someone comes and puts their prompt in public like its midjourney, and the prompt is either super-NSFW or crazy

hallow lion Nov 8, 2024, 7:51 PM

#

Good riddance

bitter hearth Nov 8, 2024, 7:56 PM

#

midjourney requiring discord probably cost them hundreds of millions of dollars
most bizarre business error

#

they kept saying it is too much work to make a backend which makes no sense

#

it would have cost them way less than the cost of training the model

craggy crest Nov 8, 2024, 7:58 PM

#

bitter hearth midjourney requiring discord probably cost them hundreds of millions of dollars ...

why would it have cost them anything?

bitter hearth Nov 8, 2024, 7:59 PM

#

what I was thinking was server hosting cost plus the wages of a few developers

craggy crest Nov 8, 2024, 8:06 PM

#

bitter hearth what I was thinking was server hosting cost plus the wages of a few developers

you don't pay discord a cent to use their servers. what server hosting?

bitter hearth Nov 8, 2024, 8:07 PM

#

I mean if Midjourney had made an external website on day one

#

it would have been better, but they would have had to pay server costs and dev costs

#

but I think it would have paid off massively because so many people avoid midjourney because you had to use the discord

craggy crest Nov 8, 2024, 8:08 PM

#

bitter hearth I mean if Midjourney had made an external website on day one

david wasn't trying to make money, still isnt'

bitter hearth Nov 8, 2024, 8:09 PM

#

oh I see, is it one of those companies that is aiming for goals other than money?

#

wasn't aware of that

craggy crest Nov 8, 2024, 8:10 PM

#

bitter hearth oh I see, is it one of those companies that is aiming for goals other than money...

he's a researcher. midjourney is still a research project

bitter hearth Nov 8, 2024, 8:10 PM

#

ah okay yeah that makes sense

noble coyote Nov 8, 2024, 8:20 PM

#

lunar canopy banned....

I left you some pictures 🙂

craggy crest Nov 8, 2024, 8:21 PM

#

halcyon yarrow Nov 8, 2024, 8:26 PM

#

craggy crest

is this generated liike that, stiched together or using that new lora Kagi showed off yesterday?

#

@mortal mesa thanks for showing off that in-context lora project that thing is really cool man

craggy crest Nov 8, 2024, 8:27 PM

#

halcyon yarrow is this generated liike that, stiched together or using that new lora Kagi showe...

generated. one shot. sd3.5

halcyon yarrow Nov 8, 2024, 8:28 PM

#

really cool, did you speciify for 5 frames or did it just choose 5 on it's own?

craggy crest Nov 8, 2024, 8:28 PM

#

prompt was: happyness: colorful autumn trees by artist "Shaun Tan", by artist "Mab Graves", by artist "Rien Poortvliet"

halcyon yarrow Nov 8, 2024, 8:28 PM

#

ok that answers a lot

craggy crest Nov 8, 2024, 8:28 PM

#

i didn't, but i used several names, and stable likes to create sections when you do that

halcyon yarrow Nov 8, 2024, 8:28 PM

#

were you messing with the clip skip layers thing when you made it?

mortal mesa Nov 8, 2024, 8:28 PM

#

ya its cool stuff, the loras are attempting more consistency

craggy crest Nov 8, 2024, 8:29 PM

#

halcyon yarrow were you messing with the clip skip layers thing when you made it?

nope. no slg turned on. that's medium anyway and this is sd3.5 large

halcyon yarrow Nov 8, 2024, 8:29 PM

#

mortal mesa ya its cool stuff, the loras are attempting more consistency

there's also some effects ones in there, all good stuff

#

@mortal mesa I had a feeling the CivitAI community was going to love those loras so I reposted it there and iit's already gotten almost 100 downloads in less than 5 hours https://civitai.com/models/929592/creative-effects-and-design-lora-pack-in-context-lora

craggy crest Nov 8, 2024, 8:36 PM

#

@bitter hearth no SLG vrs skiping layers 8,10,18,23,22 at scale 1

bitter hearth Nov 8, 2024, 8:37 PM

#

wow it makes it brighter

#

I thought it would only help structure but it goes beyond that

mortal mesa Nov 8, 2024, 8:39 PM

#

halcyon yarrow <@401839506493538304> I had a feeling the CivitAI community was going to love th...

wild

untold valley Nov 8, 2024, 8:40 PM

#

bitter hearth I thought it would only help structure but it goes beyond that

It afecta everything it’s directly messing with how the model creates images outa noise

bitter hearth Nov 8, 2024, 8:41 PM

#

yeah makes sense

craggy crest Nov 8, 2024, 8:41 PM

#

bitter hearth I thought it would only help structure but it goes beyond that

vastly. it does a lot more than just adjust struction.

bitter hearth Nov 8, 2024, 8:41 PM

#

this means you've gotta start liking negatives though, cos this works via the negative 😂

craggy crest Nov 8, 2024, 8:41 PM

#

bitter hearth this means you've gotta start liking negatives though, cos this works via the ne...

you have no idea just how right you are

#

but with this you can be exact. with negatives, it's hard to be this exact

#

not only do you skip layers, you adjust how much, and you have other values. you can pin point

bitter hearth Nov 8, 2024, 8:42 PM

#

PAG and SAG work on the negative too
SAG blurs the subject in the negative and PAG scrambles it

craggy crest Nov 8, 2024, 8:43 PM

#

this isn't using negative prompts

bitter hearth Nov 8, 2024, 8:43 PM

#

yeah I mean it uses the negative prediction

#

SLG drops layers when making the negative prediction but keeps them when making the positive

#

dropping the layers makes the image worse, which is okay as CFG pushes us away from the negative

#

its very weird

craggy crest Nov 8, 2024, 8:44 PM

#

bitter hearth yeah I mean it uses the negative prediction

um - sort of but not quite

bitter hearth Nov 8, 2024, 8:44 PM

#

ah maybe I misunderstood it, haven't looked in detail at the code yet

craggy crest Nov 8, 2024, 8:45 PM

#

bitter hearth ah maybe I misunderstood it, haven't looked in detail at the code yet

I can't say any more than that. at some point, development might post something.

civic trail Nov 8, 2024, 8:45 PM

#

bitter hearth Nov 8, 2024, 8:47 PM

#

craggy crest I can't say any more than that. at some point, development might post something.

maybe they could post
... an Arxiv paper 😄

craggy crest Nov 8, 2024, 8:48 PM

#

bitter hearth maybe they could post ... an Arxiv paper 😄

you know - as many things as have shown up on arXiv in the last 2 years that were junk, i think it's better everyone move their papers to huggingface

bitter hearth Nov 8, 2024, 8:48 PM

#

there are some bad arxiv papers out there yeah

craggy crest Nov 8, 2024, 8:50 PM

#

bitter hearth there are some bad arxiv papers out there yeah

there are more than 'some'

#

huggingface is really doing a nice job of currating papers now

#

(we're back to carrying on a single conversion in mutiple channels ;) )

bitter hearth Nov 8, 2024, 8:53 PM

#

AI moves so fast that

#

it might be too fast for curation

craggy crest Nov 8, 2024, 8:54 PM

#

bitter hearth it might be too fast for curation

huggingface doesn't seem to ahve any issues

bitter hearth Nov 8, 2024, 9:00 PM

#

not sure

craggy crest Nov 8, 2024, 9:01 PM

#

mortal mesa Nov 8, 2024, 9:01 PM

#

speaking of papers https://hanlab.mit.edu/projects/svdquant

SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffu...

A new W4A4 quantization paradigm for diffusion models.

#

not SVD as in SAI SVD

pseudo owl Nov 8, 2024, 9:18 PM

#

mortal mesa speaking of papers https://hanlab.mit.edu/projects/svdquant

Yeah that quant is pretty amazing, similar quality to 8bit while being much faster then bnb4bit and using less vram. It can also work with loras, no need to requant(unlike bnb4bit)

They have a space to compare,
Flux.1 Schnell bf16 vs Flux.1 Schnell SVD quant
prompt: A man holding a sign that says “Is 4bit quant better or the full bf16 model?”

bitter hearth Nov 8, 2024, 9:25 PM

#

 We will replace unsafe prompts with a default prompt: "A peaceful world."```LMAO

mortal mesa Nov 8, 2024, 9:25 PM

#

the demo loads the Gemma-2B model as a safety checker by default. To disable this feature, use --no-safety-checker

bitter hearth Nov 8, 2024, 9:27 PM

#

bare in mind the biggest speedups are gonna be with the FP4 version not the Int4 one

#

and the FP4 version doesn't look as good as the Int4 one

#

especially with RTX 5090 which will have native FP4 matmul acceleration

noble coyote Nov 8, 2024, 9:30 PM

#

Wheeeeeeeeee https://x.com/StabilityAI/status/1854989620154712474

Stability AI (@StabilityAI) on X

Straight from the Stable Diffusion Discord, @JackTorcello1 captures a raw, surrealistic creation using SD3.5 Large Turbo ⚡(1/3)

craggy crest Nov 8, 2024, 9:31 PM

#

bitter hearth ```Notice: We will replace unsafe prompts with a default prompt: "A peaceful wo...

craggy crest Nov 8, 2024, 9:32 PM

#

noble coyote Wheeeeeeeeee https://x.com/StabilityAI/status/1854989620154712474

YAY!

noble coyote Nov 8, 2024, 9:32 PM

#

googlecat

craggy crest Nov 8, 2024, 9:32 PM

#

noble coyote <:googlecat:1006082182995005512>

kitty!

craggy crest Nov 8, 2024, 9:48 PM

#

dull socket Nov 8, 2024, 9:52 PM

#

untold valley Nov 9, 2024, 12:35 AM

#

noble coyote Wheeeeeeeeee https://x.com/StabilityAI/status/1854989620154712474

GGs

#

blush

craggy crest Nov 9, 2024, 12:37 AM

#

#

halcyon yarrow Nov 9, 2024, 12:56 AM

#

@short thicket have you been working on any new models? i'm waitinig for the booru-free release

halcyon yarrow Nov 9, 2024, 2:02 AM

#

mortal mesa speaking of papers https://hanlab.mit.edu/projects/svdquant

cool share, thanks for the link i just went through it but a few points

where can I download the svd quant version of flux dev?
I'm thinking it only runs through scripts right now there isn't a comfyui node to load these models?
i can't find the script or program that will allow me to take any safetensors or gguf file and quantsize it using their technique

craggy crest Nov 9, 2024, 2:23 AM

#

without SLG

#

vrs With SLG

mortal mesa Nov 9, 2024, 2:28 AM

#

halcyon yarrow cool share, thanks for the link i just went through it but a few points - where ...

links to code and files they offer are here, no for comfy as far as i know https://huggingface.co/mit-han-lab/svdquant-models

halcyon yarrow Nov 9, 2024, 2:29 AM

#

mortal mesa links to code and files they offer are here, no for comfy as far as i know https...

yeah i looked thhrogh the code, i didn't see what script or command would take an existing and apply their quantzation to it, i realize its the deepcompressor library but it's unclear how to actually use it

#

see I think specifically the process to quantsize a model is this 3 step thing

https://github.com/mit-han-lab/deepcompressor/tree/main/examples/diffusion

where you can omit the benchmark portion on step 3 if needed but it's like at what point during any of those steps do you input the safetensors file? doesn't make sense to me

GitHub

deepcompressor/examples/diffusion at main · mit-han-lab/deepcompres...

Contribute to mit-han-lab/deepcompressor development by creating an account on GitHub.

halcyon yarrow Nov 9, 2024, 2:59 AM

#

@mortal mesa using that in-context lora lol

mortal mesa Nov 9, 2024, 3:08 AM

#

halcyon yarrow <@401839506493538304> using that in-context lora lol

ide buy it lol

halcyon yarrow Nov 9, 2024, 3:09 AM

#

yeah the concept of some paper chips in a plain brown paper bag kinda intrigues me too

#

STOIQ did a terrible job with the same prompt horrible text horrible design, I'd avoid those chips bc the bag looks so flat and like there's nothing in there

#

I noticed for the couple-generation lora it doesn't really seem to abide by rules set forth https://civitai.com/models/929592?modelVersionId=1040555 @mortal mesa check out how like more than half of the images I made it didn't make a clean split it just ignored the prompt and put them together

untold valley Nov 9, 2024, 3:16 AM

#

giving large more a chance but damn its like 6x time longer gen time, but some results so different.

#

Medium Left, Large Right

#

damn

amber crow Nov 9, 2024, 3:17 AM

#

good

short thicket Nov 9, 2024, 4:15 AM

#

halcyon yarrow <@457597359099215893> have you been working on any new models? i'm waitinig for ...

Yes, but I'm in the very early stages. I just started building an image dataset for fine tuning. I took some time off to touch grass after the time it took to prepare Magic, Matrix and V1.

halcyon yarrow Nov 9, 2024, 4:23 AM

#

took tiime off to touch grass ini other words to smoke some weed? lol

short thicket Nov 9, 2024, 4:31 AM

#

https://tenor.com/view/wink-eye-wink-gif-12352742718877841322

Tenor

#

I needed to take some time to "brain storm"

#

my goal is to build a small dataset of about 10,000 images to start.

halcyon yarrow Nov 9, 2024, 4:53 AM

#

short thicket https://tenor.com/view/wink-eye-wink-gif-12352742718877841322

while you collect, caption and otherwisie prep that dataset wouldn't iit be easier to just remove fluuxbooru and leave that cooking in the meantime? so then you can compare that with and w/o your dataset?

short thicket Nov 9, 2024, 5:13 AM

#

halcyon yarrow while you collect, caption and otherwisie prep that dataset wouldn't iit be easi...

I'm gonna keep fluxbooru in.

halcyon yarrow Nov 9, 2024, 5:16 AM

#

it's your call but it doesnt hurt to do A/B testing w and w/o it just to see how it's affecting image generations, currently not happy with how it's performing even at 61 steps check it out.... damn nvm i can't show you, I instinctively delete bad generations, needless to stay it's very sloppy and incoherent even with 61 steps and 3.5 cfg on what I'd consider a super complex prompt, meanwhile flux dev destill manages to do remarkably better

short thicket Nov 9, 2024, 5:31 AM

#

It will get better in time. I'm basically just doing this in my free time as a hobby. There is always room to improve things. But after putting out 3 models in 2 weeks, I'm gonna take some time to chill. It would be just as easy for you to get the models and merge them how you want.

craggy crest Nov 9, 2024, 5:34 AM

#

short thicket It will get better in time. I'm basically just doing this in my free time as a h...

just a thought, but a lot of people like those booru tags...

halcyon yarrow Nov 9, 2024, 5:34 AM

#

makes sense, has to stay fun if it's as a hobby, dont wanna burn out on too much 'fun' lol anyways yeah youre right it's just merging not training, just never messed with that field of stuff

halcyon yarrow Nov 9, 2024, 5:35 AM

#

craggy crest just a thought, but a lot of people like those booru tags...

the booru tags are cool my disdain is the trainnig methodology the model creator used ended up making things worse not better imo, i think the training dataset was overly ambiitious and he didn't throw enough compute at it so it feels half baked but that's just my opinion from using it, maybe i'm using the model witih the wrong settings still

craggy crest Nov 9, 2024, 5:36 AM

#

halcyon yarrow the booru tags are cool my disdain is the trainnig methodology the model creator...

that might very well be the case. a lot of people don't like waiting, they can't stand the idea that something like training a model might take weeks, and want it done in 30 minutes or less

short thicket Nov 9, 2024, 5:37 AM

#

halcyon yarrow makes sense, has to stay fun if it's as a hobby, dont wanna burn out on too much...

its super simple. here's a very basic lora merging workflow.

halcyon yarrow Nov 9, 2024, 5:37 AM

#

well to be fair he threw a bunch of H100s at it but I think his dataset was larger than his compute, the ratio very well likely could've been off, and one more thing the creator specifically mentioned in one of my comment threads it was not trained with booru tags, in other words all the images processed used natural language VLM captions rather than the booru tags they were found to be part of

short thicket Nov 9, 2024, 5:38 AM

#

halcyon yarrow well to be fair he threw a bunch of H100s at it but I think his dataset was larg...

yeah he did mention that it was undertrained. That was the entire point of leaving it in. I want to continue fine tuning on it.

halcyon yarrow Nov 9, 2024, 5:38 AM

#

short thicket its super simple. here's a very basic lora merging workflow.

so you give it based dev, the fine tune and the lora and it generates the merged model, good to know

craggy crest Nov 9, 2024, 5:39 AM

#

short thicket its super simple. here's a very basic lora merging workflow.

not sure i'd just go with 50 on all the blocks...

short thicket Nov 9, 2024, 5:39 AM

#

halcyon yarrow so you give it based dev, the fine tune and the lora and it generates the merged...

yeah. you can add multple loras in at the same time as well. Or add a model that isn't the base dev

halcyon yarrow Nov 9, 2024, 5:39 AM

#

short thicket yeah he did mention that it was undertrained. That was the entire point of leavi...

its one thing to be undertrained, but he didn't even train it with the booru tags, dude got a lot of grief from that in the comments throughout hugging face and civitai about it

#

using the in-context lora suggested by @mortal mesa using the STOIQ model on this one