#🏞|general-with-images

1 messages · Page 100 of 1

oak osprey
#

every time i restart it, all my settings are gone

sterile temple
#

you can drop a previous image into the prompt area to get some of the settings for that image back, but it won't populate all the various extensions, controlnet etc

#

but with comfyui it saves everything used to make the final image

oak osprey
#

my discord bot lets every user set up their own settings

#

and then it just keeps them

#

it's really not that hard to do 😐

#

though i don't have style presets like a1111 does i've considered it a few times, i'd probably make them global styles for all bot users like SAI does

#

each user can set up a post-prompt that gets tacked onto the end of every prompt, which we found useful for 1.5 where you constantly need a bunch of crap thrown on at the end

cyan snow
#

oh wtf

oak osprey
#

dear god LMAO

#

i forgot i was running a server with a model training

#

it's just been, uh, going

cyan snow
oak osprey
#

oh

#

i wonder if this model has elder magic syndrome now

#

its just been burning to a crisp

cyan snow
oak osprey
#

so i have a theory about 2.1

#

and i'm kind of excited to test it

#

dawned on me today i'm not exactly working with the same conditions the ByteDance researchers were

#

hmmmm that model can almost make a clean black square

#

suswhat the fuck

cyan snow
oak osprey
#

i think i know what's happening here

#

the images are 768x768

#

that picture has a scratch in it

#

is this model burnt or not

#

also what painting is this because it haunts my dreams now

#

bro KEKL

cyan snow
oak osprey
#

i know, that's the real one, the 2nd one

cyan snow
oak osprey
#

theres a LoRA that used this as a sample prompt and when i run it on base 2.1 i get the real painting

cyan snow
#

like mona lisa

oak osprey
#

their LoRA is like, holy cow though

#

totally different image

#

pretty strong weighted lora if it can override this overfitted hunk of art

cyan snow
#

hey, it got it perfectly with this seed =]

oak osprey
cyan snow
#

also, in the presentation today, Emad said something about SD3.0, i thought SDXL is 3.0?

#

also, i'm currently installing kandinsky2.1, this is interesting.

oak osprey
#

it's apparently way overtrained on Midjourney data

#

should be fun though

#

i think the demo images looked really cool

cyan snow
#

A1111 supports it now

#

apparently it isn't based on SD

oak osprey
#

it's not

#

it's using CLIP-L for text embeddings

cyan snow
#

i'l make a comparison real quick

#

both are very similar, which is odd, because of the different architecture

oak osprey
#

lmao i think that's a bug

#

try restarting auto

cyan snow
#

i mean, there is kind of a difference..?

#

restarting now

#

it doesn't give any ''unable to load model, skipping'' messages, so it is the right model

oak osprey
#

it's unlikely they're overfitted on the same thing but maybe they are

#

try other prompts, man

cyan snow
oak osprey
#

A tense standoff in a dusty Wild West town.
Wild horses galloping across a dusty plain at sunset, sharp
a handsome man, vaping a massive cloud in a coffee shop, black and white, sharp
A vampire's castle on a stormy night.
An urban alleyway filled with vibrant graffiti
A thriving city under the sea, inhabited by merpeople.
A time traveler stepping out of their machine into an unknown era.
an embarassing family portrait, photography from 1980s kodachrome style realistic
The hidden underground hideout of a superhero.
an epically massive mecha robot in a fighting stance against a skyscraper in Manhattan, destruction, explosions, professional, masterpiece, majestic
A steampunk city in the midst of an industrial revolution.
An advanced space station orbiting a distant exoplanet.
a child smoking a cigarette in a coffee shop, black and white, 1960, kodachrome

#

whoops, that last one, lmao

cyan snow
#

it's cooking

#

this is the same kind of difference like the SDXL bot does

#

my brain hurts

#

i'll try to move all my models except kandinsky from my models folder to see if that changes anything

#

IT STILL LOOKS ALMOST IDENTICAL TO MY MODEL

oak osprey
#

please

cyan snow
#

wtf

oak osprey
#

something broken

cyan snow
#

there is no way, i don't understand what's happening here

mental cipher
#

ok newbie question, how do i update to the latest version of controlnet?? or update extensions in general?

cyan snow
mental cipher
#

i clicked that but then what

#

on the side it just says behind

cyan snow
#

it updates =/

mental cipher
#

i clicked check for updates and it just shows all of them behind, how do they update?

sterile temple
#

apply and restart

mental cipher
#

i did that 2 werid

oak osprey
#

extreme curling champ monica lewinski

cyan snow
#

whoa, kandinsky also has img+img=img

oak osprey
#

because it's a pixel diffusion model 😄

cyan snow
oak osprey
#

no i'm just being silly, you can do that by converting images into latents as well

oak osprey
#

that's what i was trying to get but what the fuck @ those tiles lmao

#

i disabled VAE tiling and it's still there. that's gotta be the Controlnet Tiles

jovial umbra
#

is the .9 model in Dreams yet?

green socket
#

just on clipdrop

oak osprey
#

you can use the API 😄

#

@cyan snow here's a weird one. heavy metal was the prompt. that's it

#

why is there a god damn chewbaccaaaaa

#

my model has chewbacca syndrome?

cyan snow
oak osprey
#

yes

cyan snow
#

why didn't i hear about this?

#

is it just the base model?

oak osprey
#

they told us all to keep that quiet around you

cyan snow
#

wait, I just looked into it, it's like OpenAI's API, there are no files or anything

oak osprey
#

yes

#

horrific accident pile-up on the expressway with the oscar mayer weinermobile

#

"it's smaller than i remember" KEKL

#

LOL the truck size is so mini

cyan snow
#

I feel like i just drank about 3 liters of mcdonald's sprite

oak osprey
#

it's like the truck on the left is one of those JDMs

#

either that or that's one HUGE Cadillac

cyan snow
#

bro, i feel so dumb rn

oak osprey
#

they sent the bigger weinermobile as backup and it got into another accident. tragic

oak osprey
oak osprey
cyan snow
oak osprey
#

those cars all look totaled

oak osprey
#

my model does not have elder magic

cyan snow
oak osprey
#

it's just because it's a base model

#

it's up to you to provide the magic

#

i suggest trying to fine-tune it with MJ 5.1

cyan snow
oak osprey
#

yea

#

but it needs their 1024x1024 images at a minimum

cyan snow
oak osprey
#

yes

cyan snow
#

so it's already a finetune -_-

#

did you just make a continuation of the 2.1 training?

oak osprey
#

i did it so slowly and carefully that there was not much added or removed

#

it was just trained on realism

#

i used the laion datasets with like 150,000 imgs

#

pulled out Nikon camera pictures and used those

cyan snow
#

so, it's just renamed 2.1 basically?

oak osprey
#

?

#

i trained it at a minimum of 1024x1024 and on 3:2, 2:3, 16:9 and 9:16

oak osprey
#

it still kinda does 768x768 but it starts being good at 1024x1024, especially with the CFG rescaling extension i assume - haven't tested that

#

if you use a negative text embed it gets better. i briefly tested the aether lux one from joachim and it cleaned up all of the remaining residual noise in the random images/seeds that still have it

cyan snow
oak osprey
#

that's a month

cyan snow
#

yeah, i'm not going to be at my country for a month also

oak osprey
#

grandpa has a bluetooth hand

cyan snow
cyan snow
oak osprey
#

grandpa searching for answers

#

we must assemble a team of grandpas, like Oceans 71

#

first we must learn to use the internet

cyan snow
oak osprey
#

christ this prompt is so strong lmfaooo

#

Pixar style little girl, 4k, 8k, unreal engine, octane render photorealistic by cosmicwonder, hdr, photography by cosmicwonder, high definition, symmetrical face, volumetric lighting, dusty haze, photo, octane render, 24mm, 4k, 24mm, DSLR, high quality, 60 fps, ultra realistic

cyan snow
#

It's an image, it can't have frames per second

oak osprey
#

i grabbed that prompt from somewhere random

#

just seeing what happens with these stupid prompts

#

they also have octane render in there like 3 times

split rover
#

that is truly a prompt that was

proud dagger
#

a prompt of all time

oak osprey
#

why put photorealistic in there pepehands

#

just type "uncanny valley"

#

SDXL knows

#

did i get a bad roll jackpot again

proud dagger
#

shit you might be onto something

oak osprey
#

the motion is pretty good

#

oh, did that prompt even work on the bad model?

proud dagger
spring sail
#

zoomzoom...

#

I was just about to ask in this server how a real big post w/ nsfw on the subreddit has been up for 18 hours and right as I was about to ask it got removed LMAO

proud dagger
spring sail
proud dagger
#

it was taken down 5 hours ago

#

how long have you left your reddit tab open

oak osprey
#

oh shit, called out

#

AI Man is here

spring sail
cyan snow
#

Also, I got Kandinsky2.1 working using a different WebUI specifically made for Kandinsky2.1 and I gotta say, this damn model might be better than SD

oak osprey
#

tell चाची (aunty) all about it

#

@neon crystal i have new toy for you

primal veldt
oak osprey
#

it's too realism focused 🤣

#

it's funny because my pseudo-journey model is like, pure magic

#

this one is more grounded in reality

#

pseudo-real is interesting because it is heavily trained on real photos and has very clear images and also evenly mixed with midjourney stuff

#

so it's more of a middle ground, where you can make funky houseparties and other weird stuff involving pretty well-composed humans

wispy nest
#

Guys i need a really good at stable diffusion guy

#

Who's there knows a lotabout it?

cyan snow
oak osprey
#

this guy!

#

just speak your problem my child and the solutions will happen

past crystal
#

Inspired by Lemaire 2023 collection.windbreaker,white light gray,Seasonless, Genderless,

Effortless,full body view

cyan snow
#

also, i gave up on kandinsky, it's way slower than SD, and lower detail

past crystal
#

/Inspired by Lemaire 2023 collection.windbreaker,white light gray,Seasonless, Genderless,

Effortless,full body view

cyan snow
#

also, why is img+img not possible on SD?

#

atleast, there is no extension for A1111 that has that function

sterile temple
#

what is img+img?

cyan snow
sterile temple
#

existing images?

cyan snow
#

yeah

#

i heard that MJ is capable of that

smoky oak
#

MJ can do it pretty poorly, but hey, its a lot better than not at all haha

cyan snow
#

why isn't there an A1111 extension for it? kandinsky can do it

oak osprey
#

john cusack 😄

#

wheres the trust, john.

#

just hangin out at the park, no big deal

#

i love how my john cusack looks just like SDXL's

#

it's like an OpenCLIP thing

cyan snow
cyan snow
#

doughnut burger

#

"bruh moment"

#

the AI looked at what i want it to describe and just went bruh

wild pagoda
#

dude is popping off

cyan snow
cyan snow
cyan snow
# cyan snow

i can't beat this with SD and XL, this was made using kandinsky2.1

#

i swear man, that model is so hit or miss

#

like damn, it follows the prompts perfectly

#

i feel like kandinsky has so much potential

sterile kiln
#

Zoom in guys

spring sail
#

so I've been doing like

#

all day working on a piece of ai art in clipstudiopaint

#

and it's driving me insane that. even if I repaint every single aspect of the image if I post it as 'my art' without disclosing it started as an ai generated image I'm immediately an awful human being

#

at least in like, friend group servers or art servers

#

BUT

#

let me show off some progress on it bc >:333

#

the kimono is nowhere near done that's just like. blocking out stuff atm

#

but!! yeas. silly

#

I plan on replacing the background completely as well

#

I also plan on adding more shadows and highlights but that'll be towards da end..

#

OH and I wanna add a hair ornament. and the arm will b added back ofc

#

anyways. little ramble. enjoy this

wispy nest
#

Guts

#

Who have A BIG knowledge about stable diffusion there?

#

@tropic shell

#

Ad

tropic shell
#

H m m

#

sleepily opens eyes

#

Hello everyone

wispy nest
#

Get some rest from character ai

tropic shell
#

I just woke up...I'm alivvvveeeee

#

🧟‍♀️

wispy ether
cunning geode
#

so far my general way of generating things was always generate in batches of 4, and do high-res fix from 512 to 1024 at the same time, with a denoising strength of 0.4
yesterday someone showed me a better way of doing things.
generate stuff in larger batches, only 512, then take a good result and selectively apply highres fix to that. but put the initial gen first into controlnet tile, then put denoising strength to 0.7/0.8. that way one gets a much much higher quality and more detailed output, while also seeing a larger variety of output to choose from.

here is an example of my old method (left) vs. the new method (right)

EDIT: nvm, after more testing its a bad method. dont do it. results in extremely overdetailed, fried images.

cunning geode
#

i put it through my own model:

#

needed a few tries tho

cyan snow
#

but well, the unfinished SDXL model that the bot uses can beat kandinsky2.1

#

the only advantage kandinsky2.1 has over SDXL is that kandinsky2.1 can fuse 2 images together

cunning geode
#

someone on reddit sent me a message that my model is very good at anime, and not so good at photorealism, and that i should focus on the anime part of the model as the people who make it to the top either do very unique stuff or are the best in their field, and currently there is a race to the best realism possible, while there isnt that much competition atm for actual anime

and he said i am not doing myself any favors with the images on the frontpage as my model can create much more beautiful stuff

#

ill keep it multistyle but ill definitely update the thumbnail and example images with something better
like that dog made out of water for instance

wild sorrel
#

waow why don't you tell him that

cyan snow
cunning geode
dense tapir
wispy ether
#

Merge real life model with anime model and get anime dwaynw johnson

dense tapir
cyan snow
dense tapir
cyan snow
dense tapir
cyan snow
dense tapir
#

Too MJ like for my taste.

#

I never really did see what people saw in MJ

cyan snow
cyan snow
dense tapir
#

Glad that it did but so cartoony

#

I mean MJ is

#

I wonder but up the stairs what is that? Are we in a jar with jars?

kind quartz
#

Jar in Jar
@cyan snow what is your model name?

cyan snow
kind quartz
#

o.k. just asking if not using it 🙂

oak osprey
#

it doesn't know Olivia Chow Sad

cyan snow
kind quartz
#

it is regional prompter, i like that bank

@cyan snow have you used word model for those miniatures in jars? It is magic word imho. When used my own no lora tiltshift it is like charm.

cyan snow
kind quartz
#

i dont mean embeddings, just prompt. model of village in garden for example

cyan snow
#

ohhh, i get it

#

no, i didn't use that word

kind quartz
#

word "model" i mean 🙂

cyan snow
weak sage
oak osprey
cunning geode
oak osprey
dense tapir
oak osprey
#

tried to make shirley maclaine and it put some dude crawling up her back

stoic pivot
#

I made some characters in a t-pose and had an idea to model them in Blender, but I want to use them in VR-chat just for the LoLs, so is there any method to make 'image to 3d characters?
It do not need to create perfect result, but a plus would be if I can use with Mixamo,

oak osprey
dense tapir
#

Poor fish on the left

oak osprey
#

controlnet goes wild sometimes, man

stoic pivot
#

I wrote "cute feminine clown" and we have to trust AI that soon is smarter than us according to the old media, this is cute?

oak osprey
#

man that thing is freaky

#

james dean as harry potter

cunning geode
dense tapir
oak osprey
#

SDXL's

dense tapir
novel ravine
cyan snow
dense tapir
oak osprey
#

i don't know if this is good or not, as i don't watch anime thinkies

crimson goblet
#

Is this too generic for a possible kinetic novel or comic?

cunning geode
#

this took a ton of inpainting little details, as well as some very slight editing of little details in GIMP, and its still not perfect, but i think it looks good now

dense tapir
wild pagoda
cyan snow
dense tapir
stoic pivot
#

Add the word "robot", robot clowns have to be less creepy

dense tapir
#

lol

cyan snow
oak osprey
#

smells like cookies

dense tapir
cyan snow
cunning geode
stoic pivot
#

Test, used a Pixel art model and then a extension to remove background, after that resize to a 512px size again.

#

Darn. That was a good test, now I see it left a white line that I did not see when I made it.

dense tapir
sterile temple
#

morning! did someone mention clowns?

stoic pivot
dense tapir
sterile temple
dense tapir
smoky oak
#

Oh, the server is back

#

Anybody else get that big server blackout?

smoky oak
dense tapir
#

Yep

#

Must have crashed or been rebooted

wispy spindle
#

It's global

smoky oak
#

yeah, i got it across several partnered servers

#

wonder if it was an issue that hit their higher priority servers

dense tapir
#

Ahhh, all of my discords remained open

#

low priority though

wispy spindle
#

The bigger servers got the hits

#

Yeaaa

smoky oak
#

all of my smaller ones were fine, but the big servers I am in were down

dense tapir
#

yes, the same

smoky oak
#

Newegg, this one, Python

#

very interesting

dense tapir
#

waiting on my new case from newegg and never knew they had a discord, lol

smoky oak
#

they have had it for years and years

#

I am like one of the top 50 most active in there, and I haven't touched it in a long ass time lol

dense tapir
#

This is my first order in 17 years from them on my 20+ year old account.

sterile temple
#

I just got my new case :>

dense tapir
#

my new case is today just waiting on UPS

smoky oak
#

I am trying to find something to print now that I have my 3D printer dialed in again

boreal falcon
#

i wish id have one

#

but it will take some time until i get one

sterile temple
#

did the type of printer that uses light and a liquid plastic ever become a thing? i remember seeing a cool tech demo years back, to see the solid object rise out of the liquid

smoky oak
#

Luckily you can get great ones for very affordable prices

oak osprey
#

this must confuse the shit out of the AI

boreal falcon
#

what i got my eyes on is very expensive tho but techically i could get more affordable ones too

stoic pivot
#

Even it is not perfect, the remove background is kinda cool

mild tusk
#

it nice

#

did you photopsh remove the background or was there a thing for it?

stoic pivot
#

It is a extention for Automatic1111

smoky oak
mild tusk
#

oh neat

smoky oak
#

They are what I use now

#

Damn, my 3D model I made for printables now has me at top 60 3D artists on the site in the world lmao

#

a single design

oak osprey
smoky oak
#

I still can't believe it was my first fusion 360 design, and still by far the best to this date lmao

cunning geode
mild tusk
#

pikachu if he was in darkest dungeon

smoky oak
#

oh yeah, I messed a bit with that art style LoRA

mild tusk
#

i love it

smoky oak
#

I wish there was a more capable AI to 3D workflow

oak osprey
smoky oak
#

I got all excited and tried out that Shap E addon, but I didn't realize how severely limited it is

mild tusk
#

there is with blender

smoky oak
#

eh, I'd rather not have to use blender anymore haha

oak osprey
#

casts a spell and ties you to Zbrush forever

#

your wish is granted

smoky oak
#

I don't care much for Zbrush either

oak osprey
#

theadv3OhYes I SAID WISH GRANTED

smoky oak
#

My favorite 3D workspace has actually become Medium, believe it or not

oak osprey
#

the blog post website?

#

wat

#

not

smoky oak
#

IMO, its unparalleled for ease of sculpting and intuitive using

oak osprey
#

"believe not", my choice, final answer

sterile temple
#

is maya still a thing? I think we used that when I did a year in art college about... 18 years ago?

smoky oak
#

its been out for years, and its still the best for VR sculpting IMO

oak osprey
#

i was gonna say you should try TED talks next

smoky oak
#

Its so much faster and more effecient than 2D projected sculpting

sterile temple
#

yikes, 2630 dollarydoos a year 😐

smoky oak
#

Yeah, Maya and 3DSMax are way overpriced

weary cloak
#

Does anybody know how to update xformers?

stoic pivot
#

It will be so cool in the future when you see your friends work from home through their AI robot.

smoky oak
#

@oak ospreyhttps://youtu.be/qnTvoAzxnbM?t=598

Adobe Medium

Adobe Medium (Virtual Reality Sculpting Basics). Adobe Medium was formerly Oculus Medium and owned by Facebook. Adobe acquired them in 2019 and recently released their first version under the Adobe banner. There aren't a huge amount of major changes on the outside but lots going on under the hood.

Medium is a VR sculpting app that is probably...

▶ Play video
cunning geode
#

@smoky oak@mild tuskin case youre interested, my st-AI-le model also includes the darkest dungeon artstyle
here is an example

smoky oak
#

that time stamp shows really all you need to see

sterile temple
#

phew! I can buy 16 days of access for only 425 dollarydoos, that's a relief!

smoky oak
#

its insanely nice to use

mild tusk
#

i need to try it

#

it lora right?

smoky oak
mild tusk
#

ohhh that one

#

ye i was gonna try it

smoky oak
#

oops, wrong one

mild tusk
#

oh

smoky oak
#

Thats the better one

cunning geode
#

no i am saying i have a model, like a checkpoint, that among a lot of other styles includes the darkest dungeon style, its not a lora

mild tusk
#

ohhhhh

#

a checkpoint wtih this would be nice

cunning geode
smoky oak
#

Interesting model

cunning geode
#

i try doing something different than other people

#

cant tell you if my darkest dungeon style will be different or better than the two loras tho

smoky oak
#

I much prefer having lots of individual models that are very good at specific things over having one model that's pretty decent at doing a lot

cunning geode
#

yeah i heard that a lot lately haha

smoky oak
#

But there are some people who much prefer not having to mess around with different models

dense tapir
cunning geode
#

@smoky oak@mild tuskdo you want me to do some test prompt(s) for you in my model using said style?

mild tusk
cunning geode
smoky oak
#

Oh, no thank you, I'm good. I stick to the models I use daily, just forgot about the darkest dungeon style is all

mild tusk
#

i saw but nobody did made one :C

cunning geode
mild tusk
#

ohhh

#

ye go for it

cunning geode
#

well do you wanna see something specific?

mild tusk
#

hmmm

#

not really sure whta to see

#

anything i guess

oak osprey
mild tusk
#

i made a goblin

#

real design btw

sterile temple
#

grumpy old bones full of glitter dust

mild tusk
dense tapir
mild tusk
#

dr octopus

cunning geode
mild tusk
#

neat

cunning geode
#

the skull on the ground and the head in hand were random lol

oak osprey
#

@smoky oak this image to me should be on the tin for SD 2.0

oak osprey
#

bill engval as a school teacher in a classroom in front of a chalkboard

mild tusk
#

lets see what i get with the dd one

oak osprey
#

i didn't know i have to be specific and tell it "with desks of students" KEKL

cunning geode
mild tusk
#

give me a curse prompts

sacred fulcrum
#

Prompt: a zombie giving a math lesson in a university, dark, cartoon, green lighting, darkest dungeon style

mild tusk
#

i lvoe it

oak osprey
#

@smoky oak i could have a whole blog i just post "Images that confuse AI" from the LAION data

cunning geode
#

no idea why there is suddenly a white border

sacred fulcrum
mild tusk
#

flashback

mild tusk
#

oh yo mean checkpoint?

oak osprey
# mild tusk give me a curse prompts

a handsome mage wizard man, bearded and gray hair, blue star hat with wand and mystical haze
An old wizard's tower, filled with magical artifacts and spellbooks.
A tense standoff in a dusty Wild West town.
Wild horses galloping across a dusty plain at sunset, sharp
a handsome man, vaping a massive cloud in a coffee shop, black and white, sharp
A vampire's castle on a stormy night.
An urban alleyway filled with vibrant graffiti
A thriving city under the sea, inhabited by merpeople.
A time traveler stepping out of their machine into an unknown era.
an embarassing family portrait, photography from 1980s kodachrome style realistic
The hidden underground hideout of a superhero.

sterile temple
sacred fulcrum
mild tusk
smoky oak
oak osprey
#

they updated clipdrop

#

they say "it consistently outputs our best images"

smoky oak
#

Maybe for SDXL, but their other tools suck ass

oak osprey
#

ohhhh

#

yeah those need an update with new SDXL base i think

smoky oak
#

No, they are just bad bad lol

#

like their upscaler is bold face lies

#

and it does a frankly terrible job

sterile temple
#

hey it's me during math class 😄

smoky oak
#

Their example images are all fake as well

#

If a company gives me a section that says "Try it yourself", and then feeds me fake results, I am not vibing lol

sterile temple
smoky oak
sterile temple
#

whatever gets those conversion rates up

cunning geode
#

darkestdungeon artstyle of a zombie (((giving a math lesson in a university))), dark, (((green lighting)))

even with triple emphasis its struggling with that prompt lol

smoky oak
#

My favorite is how they bold face lie by saying their AI can turn this

#

Into this

#

All they did was lower the res of an already high res image lmao

#

Or this one

oak osprey
#

that's how those upscalers work

#

they took some stock photos and cropped the example area to show detail

#

just like how the controlnet tile page uses a puppy from 64x64 to upscale to 1024x1024

#

using just the prompt, "best quality" too

#

i imagine the puppy was originally larger than 64x64

#

i don't know if it looked exactly like the original puppy, after upscaling all the way to 1024x1024... if you roll the seed or change the prompt to like "mastepiece quality" the fur on the puppy changes

sacred fulcrum
oak osprey
sterile temple
# oak osprey

why does that look like trump? must be the square head

oak osprey
#

ohhh that explains a lot

smoky oak
oak osprey
#

oh Sadge

smoky oak
#

It's not even as good as gigapixel

#

But, they make you pay to find out what their "detailed" upscaler does

Which is why I think it's scummy that they take these images that are so clearly just downsampled

sacred fulcrum
oak osprey
#

idk, SDXL is a capable uspcaler, i think it just needs an update

smoky oak
sterile temple
smoky oak
#

Yeah, that's why I prefer ultimate upscale, much easier to get working, faster, and has controlnet support

sterile temple
#

I just want to be able to drop an image in and it uses the controlnet tile thingymajig automagically

smoky oak
#

Ultimate isn't that easy, but it's a lot easier than fiddling around with the sliders in multi

sterile temple
#

none of this divide by 8 business 😄

smoky oak
#

Yeah, ultimate doesn't have that

#

You select your final res target, the tile size, and the pixel upscaler (if you want one)

#

I have gone to absurd resolutions with ultimate upscale

sacred fulcrum
#

I'm very used to multi, maybe I should try Ultimate 😁

smoky oak
#

I did 16k ultrawide on an 8GB GPU with it. I'm confident I could do it much faster and higher quality now

cunning geode
smoky oak
#

2560x1080 base res for reference

#

here is the difference

sacred fulcrum
#

I have an amd gpu with 8gb and can't do more than 1586x1024 with multi...

smoky oak
#

Can't send the full image, it's like 200 some MB

smoky oak
#

All it does is render tiles of the image at the res you provide, and then stitch them together

#

another example from base res to final crop

#

I don't have the full res images on my phone ATM

oak osprey
#

loss=0.0926

#

hey my theory is right so far

sacred fulcrum
smoky oak
#

I got this image with 1.5 and ultimate

Please excuse the whisker crust, found a way to stop that from happening recently

smoky oak
#

It's made specifically to work with ultimate to prevent that by allowing it to reference the whole original image

oak osprey
#

that's not a controlnet tho 😄 it is in the suite but yeah it's different

#

controlnet is in between the blocks and reference is a hack on the block itself

smoky oak
#

Either way, it part of controlnet default now haha

#

All I know for certain is it does a damn good job of making sure that it stays faithful

oak osprey
#

ever seen loss that low? loss=0.00459

smoky oak
#

Only in LoRA training, never in dreamboothing or fine-tuning

oak osprey
#

this is SD2

smoky oak
#

My lowest I ever got was 0.0023, and the results was trash lmao

oak osprey
#

interesting

#

well i'm simply testing a few theories out at once, if it fucks up, nothing is lost

#

i win either way 😄

sacred fulcrum
#

I'm going to try Ultimate since multi is super slow (4 minutes per image), but the end result is exactly the same image as the original without any new detail.

smoky oak
#

4 images for an image on an AMD GPU doesn't sound that slow

oak osprey
#

are you trolling lol

#

even a vega56 can do them in like 30 seconds

#

4 minutes is awful

smoky oak
#

They are using multidifusion to 1080p

#

That was like 2 minutes on my 3060ti

oak osprey
#

well, fair, but still

#

that's just bad vs bad

smoky oak
#

Multidifusion is really slow, which is unfortunate cause most people know of it over ultimate

sacred fulcrum
oak osprey
#

i mean normal image gen

#

512x512 happens in 30 seconds

#

so if you're just after images, well, 4 minutes is a long time

sacred fulcrum
oak osprey
#

yeah 🥹 the vega56 sucks now

sacred fulcrum
#

20 steps

smoky oak
#

With 1536x1024, I could probably hit that in like 15 seconds on my 3080 using ultimate upscale

sacred fulcrum
#

rx 580...

oak osprey
#

oh i think i was doing like 50 but i used PNDM back then

#

those cars are cool

dense tapir
oak osprey
#

ho shit, the model can do william boooooth

dense tapir
oak osprey
#

@smoky oak remember how hard it was for me to make real people for so long? now i can't make fantasy stuff with the model KEKL

smoky oak
#

Yeah, its good to see it actually looking fairly good for real stuff. Very impressed and happy about that

oak osprey
#

im sad i cant make cool stuff anymore Sad

#

imagine plugging this shit into a controlnet

#

or even better, training a controlnet tile model based on this base

#

i'm so close to just offering to buy the training details

sterile temple
#

you could try mixing the two models, use weighted block merge?

oak osprey
#

nope they have a different noise schedule

mild tusk
#

cyberpunk apparently

dense tapir
oak osprey
#

if i throw midjourney+++ on there, which works on the pseudo-journey-v2 model 😁 it doesn't do much here

dense tapir
mild tusk
#

i love this

#

what are the model and prompts?

dense tapir
#

It is a new lycoris I am training

#

well, trying to train

mild tusk
#

lycoris?

dense tapir
#

lycoris locon a lora type

mild tusk
#

i see

#

mother 3 if ported

dense tapir
mild tusk
#

same

oak osprey
#

so high

dense tapir
#

Is that a model, or a lora/lycoris?

mild tusk
#

question as im unsure what do the negative prompts do?

dense tapir
mild tusk
green socket
mild tusk
#

ah i see

#

neat

oak osprey
#

uh oh, someone stop the photoreal model! it's going creative!

mild tusk
smoky oak
#

Actually not that far from some of my Synesthetic reactions lmao

oak osprey
#

it's not great to me but yet at the same time, it's not bad

#

like, it's kinda hilarious how hard it's trying for papa

#

" 🥹 i can do magic for you"
"show me Kappa"
" strains, strains, strains, farts 💨 🌈 thinkies "

#

i have looked up some traditionally epic prompts and all of them come across as very grounded in reality in this model

#

the elderly magician doing quantum experiments yada yada was just some grandpa with a sparkler

#

damn, the detail in that image reminds me of DALL-E 2

#

OpenAI is definitely sampling from the last timestep

cunning geode
#

anyone know how to upscale an image without the "smoothing" effect?

oak osprey
#

use a prompt with the img2img that adds more

#

if you just really want to do it for one image, it's really easy to use the controlnet tile example code and tweak the prompt you give it directly, as A1111 doesn't let you set them separately

round helm
#

@gritty trellis 🤫 🤫

oak osprey
#

you tunin' it now? 🙂

wispy nest
#

nm i figured it out (i think lol)

sterile temple
#

option for the API

oak osprey
#

i prompted mark zuckerberg using the computer in a totally normal way

#

i love the weird apology notes SD can make

gritty trellis
#

You said you wouldn't finetune anything fancy!

#

And yet; there you are!

round helm
#

ha not a fine tune yet lol, need a bit more time, just raw output

#

"jesus christ in a ny yankees baseball cap, by claude monet"

prisma iron
#

😄

oak osprey
#

i will have to play catch-up to update my training code i guess

sterile temple
#

I think this 1.5 has been trained on more images of him

oak osprey
# sterile temple SDXL 0.9

DPM++ 2M makes it look so damn good all the time, but hyper-smooth when the prompt isn't like, "good", it's hard to describe

neon crystal
#

Just how good is SDXL damn! Finally we have a base model that is at par with midjourney.

#

No need for prompt fluff like "beautiful eyes", "hyper realistic photograph" anymore. Almost as crisp as Midjourney generations. Follows the prompts as well as anything else I've seen ( Midjourney, Kandinsky etc ), Hands and fingers come out well most of the times. Does extreme eating contests pretty well lol ( Bing anyone ? ) Glitch is improved. Man I can go on and on.

oak osprey
#

and it used to be better! imagine that 😛

neon crystal
#

Anyone else fooling with Zeroscope XL ?

neon crystal
oak osprey
#

i still write all my own code, whatchu talkin about KEKL

neon crystal
# oak osprey

Was joking of course 🙂 I wish I had the time to deep dive and work on an aesthetic discriminator or something. Such is life I suppose

oak osprey
#

i'm looking at fine-tuning 2.0 and its surprising how much more fine details it has in it than 2.1

#

this is 2.0-v, not 2.0-base

#

it seems like 2.1 is just overcooked or something

#

think they put like 1.5 million steps on 2.0 to make it

sterile temple
#

what was 2.1 again? the appealing to outrage model? 😄

neon crystal
#

I played around with 2.0 too recently! Really liked what I saw. This leopard gecko fella I presume generation is so crisp wow!

oak osprey
#

i've tuned terminal SNR into 2.0 to get it like this

#

i've been tuning 2.1 for a while and it had all the same kinds of anatomical deformities but it lost faces. 2.0 does faces better

#

especially little faces

#

finer details in general are far superior

#

the best i can get 2.1-v with fine details is a disappointing kind of patterning state like this

#

same prompt in 2.0

#

i will keep it training just to see if i can resolve the deformities and keep the details because i think it looks great. a lot of the prompts between the two behave pretty similarly but there's others that are DEFINITELY better in 2.0

#

this one actually has a woman that's an astronaut. 2.1 does not, she turns up as some decapitated head inside a cylinder

#

interesting that this one already doesn't have dupes

#

2.0 can make toddlers light up in a coffee shop without issue, i had to work so hard to make that work in 2.1 lmfao

#

even the family photos?? nooo why KEKL

oak osprey
#

lmfao i brought my fine-tuned OpenCLIP into the fold and they look like a early 00s rap music video

#

but people look fine now when they do show up solo

#

hmm, so text encoder trained on photos is A+

#

SAI probably could have saved themselves a LOT of time

#

this shit's gonna be great. it'll not be the best thing ever but it'll be soooo different

#

looks like West World

spare lantern
#

😮

oak osprey
#

@split rover you guys should tune the text encoders a little someday 😛

#

they seem to come with fucking cursed quanta

#

imagine instead of paying to do 800k steps on AWS you guys just did the same thing i did to the text encoder over just 4 weeks

cyan snow
cyan snow
cyan snow
#

''lost in the joy of creation''

smoky oak
#

@oak osprey you here?

#

I made a post warning about the company that scammed me on Reddit, and I was pretty tame with the post. They found my claims and tried to pull a 180 and defend themselves, saying that I was the problem.

That warranted my response going into extreme detail over all the things that they did, to which they have refused to respond to.

The Post only has eight up votes, but their response has -5, and my response to them has +4, so that alone assures me that I was not alone in my feelings about their shitty practices and deceiving behavior

exotic breach
#

As a newcomer to the whole AI image generation thing I am more than a bit impressed with this new 0.9!

smoky oak
#

These were the results from the refiner paired version (most are mine, some are from other people in the community)

#

0.9 is still dope, but full refiner was legendary haha

exotic breach
# smoky oak These were the results from the refiner paired version (most are mine, some are ...

I have also been using it on the Clip Drop website and noticed the quality was a little better on there but I wanted to use different aspect ratios and had a load of credits I was not using in Dreamstudio (I was very let down by Dreamstudio and the credits were just sitting there). Until my personal finances allow me to set myself up with something better than this crappy laptop I think Dreamstudio with whatever 0.9 version they have and my CC sub for gen fill will be ok for me 🙂

smoky oak
#

Up until a couple days ago, the one on a clip drop was considerably worse. It appears as though they recently updated the clip drop model to be the better refined version of 0.9, however it still doesn't have the refiner model which is what brings the results to the next level unfortunately.

If they can offer a decently priced SDXL generation service with the full refiner model, I see it being a genuine threat to mid journey, cause it can even do good text

smoky oak
#

For now, at least

exotic breach
exotic breach
smoky oak
#

As of right now you can't, but I already have a friend in this server who will be working on a discord server bot deployment for SDXL

cunning geode
#

can somebody tell me how to upscale an image without it becoming so smooth?
i tried different samplers, different sampling steps amounts, different upscalers, with and without controlnet tile, etc
nothing works
it always is this smoothed out

smoky oak
#

sure can, what kind of res are you trying to reach?

#

oh man, that image is cursed the more I look lmao

#

@cunning geodeMeant to reply

cunning geode
#

@smoky oak thats from higher denoising but thats irrelevant

this was just simple from going from 1024 to 2048

smoky oak
#

What model?

#

Cause this looks exceptionally bad for whatever reason

sterile temple
#

I believe it's their own 1.5 model they trained without mixing

cunning geode
# smoky oak What model?

what humblemikey said but that it looks bad isnt relevant here
i know it looks bad because i put denoising to 0.6
its just the quickest example i had availble for the dmoothing effect
it doesnt matter what denoising i used, it always came out this smooth

sterile temple
#

did you try a latent upsampler with the high denoising?

cunning geode
#

i see people upscale images all the time without this smoothing effect
i dont get why it doesnt work for me

cunning geode
#

i already tried with foolhardy but little difference

#

again dont focus on the denoising or the bad image quality
this also happens to me with 0.1 denoising and better image quality
it was just the quickest example i could grab

cyan snow
#

Kandinky

#

1.5 finetune

#

i feel like latent diffusion models can do better detailing(SD) and pixel diffusion models(kandinsky) can follow prompts better

#

like, i can't get this level of coherency out of 1.5 finetunes and maybe even XL

sterile temple
#

what's the prompt?

cyan snow
#

SD just does a dog on fire, not what i asked it

#

and kandinsky does just that

cyan snow
sterile temple
cyan snow
#

=/

sterile temple
#

try adding 'concept' to your prompt

cyan snow
#

yeah, i feel like Kandinsky might be better with this kind of prompt =[

sterile temple
#

SD can make dogs out of other things, just not fire 🙂

cyan snow
#

i think i can get the highest level of detail with 1.5 finetunes or SDXL, but the most coherency out of kandinsky

#

for the most part, I'll probably still make the SDXL finetuned I said I'll make and likely mainly use it, but if i would want something very specific, Kandinsky might be the way

sterile temple
cyan snow
#

but yeah, I hope that the Russians won't beat us

sterile temple
cyan snow
#

We have been working on Stable diffusion way longer than the Russians are working on Kandinsky, so I refuse to let their model beat SDXL

sterile temple
#

it will be hard for them to catch up

cyan snow
sterile temple
cyan snow
# cyan snow

Like, I can't make these kinds of images using Stable diffusion, only using Kandinsky locally. But I'm pretty certain I will be able to make a better model than Kandinsky and MJ after fine-tuning XL.

sterile temple
sterile temple
cyan snow
#

But yeah, both Kandinsky and SDXL are impressive and different models, SDXL isn't able to fuse 2 images together like Kandinsky can

proud dagger
#

"fusing images" sounds like a thing software tricks might be able to pull off once you're running at home rather than via bot or api

#

(via eg controlnet, img2img, unclip, etc)

cyan snow
proud dagger
#

I don't know offhand what Kandinsky does but, uh, i doubt that?

cyan snow
#

That might be the reason it can pull off those images I posted earlier

cyan snow
#

It worked pretty nicely

proud dagger
#

Did... what? implemented the source code for whatever their mixing thing is?

#

The thing I doubt is that it's somehow limited to pixel models only

cyan snow
#

2.1 has different architecture than 2.0, at least as far as I know.

proud dagger
#

I know the feature works.

#

It will probably work in SD too, is my point

#

actually yeah it's literally just multi-unclip lol

#

that's definitely a valid thing to do in SD

cyan snow
#

Than why does A1111 UI don't even begin to fathom that feature?

#

This should be a thing as far as I know

proud dagger
#

because you haven't installed the controlnet extension?

cyan snow
proud dagger
sterile temple
proud dagger
#

oh, yeah, it does allow multiple, nice!

#

gotta love comfy

cyan snow
#

It's the UI with the most features

#

Well, I'm about to return home and I'll make an organised comparison of 1.5 fine-tunes, SDXL and Kandinsky 2.1

#

Kandinsky2.1

oak osprey
cyan snow
#

I personally think the Kandinsky won here

oak osprey
cyan snow
#

i can't make this with 1.5 fine tune

#

as much as i hate Russia for what they did to Ukraine, I gotta admit, they make damn fine models

proud dagger
cyan snow
sterile temple
cyan snow
#

huh, i think the SDXL might have beaten it

#

hmmm, I think after finetuning XL it might be way better then this Russian model, like Emad said

oak osprey
#

they're both trained on midjourney images 🤣

cyan snow
#

SDXL isn't trained on MJ, as far as i know

proud dagger
#

bear in mind the SDXL bot randomizes settings and models and all

#

also bear in mind SDXL interprets prompts differently vs. kandinsky, so you might need to change up your prompt to fit you goal

#

i tried running it locally and got some cool results

cyan snow
proud dagger
#

or that but darkerer

#

(I'm running a singlestage model cause im too lazy to load the refiner stage)

cyan snow
proud dagger
#

I added "dripping acid" to try to go for the style of that, which is neat too

cyan snow
#

but I'm sure we can make a far better model than both after SDXL gets released and finetuned

proud dagger
#

ye, finetuning SDXL is super powerful

#

even a small lora can do a ton

#

(I've been prepping for a followup post about loras after my last one on bare minimum training test)

oak osprey
#

still no weight access

proud dagger
#

base model definitely does cool things with just a bit of playing with prompt

cyan snow
#

my conclusion is that this is not a fair comparison because as far as i know, Kandinsky is an already finetuned model with different architecture. in might beat SDXL with those images, but after it gets finetuned it will for sure be better than Kandinsky

#

but time will tell =], we will for sure know eventually

oak osprey
#

kandinsky is a base model too and it can be fine-tuned

#

the terms base model are thrown around pretty loosely, 2.1 is called a base model but that bastard is burnt to a crisp compared to 2.0

#

as far as i can tell we're fighting all of the extra hundreds of thousands of steps that SAI put into 2.1 for no reason other than to increase their performance on benchmarks

proud dagger
#

just increase LR and burn it back out of the way

oak osprey
#

i tried

#

there's only so much you can do

#

my best results with 2.1 are still pretty "meh", even though it can make stunning photos of people, their fine details are like "what the hell?" vs 2.0 that has really strong fine details just 1 epoch into terminal SNR fine-tuning

#

@proud dagger am i correct in the assumption that converting a model from epsilon to v-pred is going to need more than a single GPU?

#

cuz it really feels like that, lol

#

600 steps in on the 2.0-v model it's starting to understand high res properly too. i love how these two still look like some bootleg rap music video

proud dagger
oak osprey
#

it's an A100 80G

#

if it'll work, i'll just let it run for a long time

#

but it's feeling like i don't need to go all teh way back to 2.0-base to get the results i was hoping to obtain

#

i gotta tell you, the built in text encoder of 2.0 makes it feel distressing when doing validation tests. everything looks so bad. there's body parts showing up out of nowhere? the faces of the people are ironically amazing compared to the anatomical gore of their arms and legs

#

175 steps on my custom OpenCLIP ViT-H/14

#

600 can do brightness now, quicker than 2.1-v picking up the new noise schedule

cyan snow
kind quartz
wanton nebula
#

it's like being right there

oak osprey
#

try to get him to show up in "my cousin larry's dirty room"

cunning geode
#

berliners unite

oak osprey
#

2.0 fine-tuning is fun

#

i know just how to treat it NPCDryadCuddle

#

and somehow its results are more creative

#

it needs a run with the 100k images of hands dataset through the unet, it seems. the text encoder already had that done, but the unet doesn't seem like it has

steady skiff
#

Predator 2023 The Movie 🎬

mild tusk
#

question what do restore faces option mean? it just fix the face?

smoky oak
smoky oak
dense tapir
#

Yeah, the lack of TC is a killer

#

ultimate upscale I can use though with real

#

sadly, 2.x has no upscale via controlnet

#

we lack style, tile, and there is one more I would have loved to have. The one you colour in things and can change just it. Forgot its name now.

#

@smoky oak Well, my new 1300W psu will be here on Saturday so that is everything now. New CPU, ram, SSD, case, and PSU which leaves saving for a new card.

cunning geode
dense tapir
#

I am lucky as my case went up yesterday 21 dollars which, as I reviewed it, makes it not worth it. Buy one of the name brand ones. At 89 it was a steal

smoky oak
dense tapir
#

3070 can handle all that

smoky oak
#

Well in that case, I can only assume the model is not playing well with upscaling, cause you have every ingredient

cunning geode
#

nah lol this was deliberate

dense tapir
#

Did you see how the reviewers are ripping into that 4060? I swear it is the worst so far.

cunning geode
#

this is what someone did

#

this is me using exact same settings

#

for some reason it comes out worse for me

#

only differences are him using vlads and not a1111, and using deliberate v1 instead of v2

#

otherwise literally same

#

i dont get it

dense tapir
#

A 3060ti is faster by leaps and bounds over the 4060 and the 3060 is slightly slower but has 12gb of ram

smoky oak
#

Difference between V1 and V2 are huge, and they likely inpainted a lot

dense tapir
#

Jayz, lol "The RTX 4060 is the strongest argument to buy AMD"

smoky oak
#

Honestly tho lol

dense tapir
#

The 4060 is a 4050 and 4060ti is a 4050ti. smh

#

As almost all reviewers have said the only card worth a damn this gen is the flagship 4090.

#

4080 is good just overpriced to the point of making it bad value so is classified as junk

#

the rest is just buy last gen or wait for next gen and hope

#

my fear is that next gen they are going to be sneaky with their neural compression and reduce the 5090 in vram to get people to buy the pro cards when it should all be going the other direction not less.

#

I already see Jensen warming up his cut down baseball bat getting ready to knee cap stuff.

cunning geode
smoky oak
cunning geode
smoky oak
#

Ok, then IDK, seems like you know every single thing that isn't wrong here

cunning geode
#

yeah i just dont know whats wrong

#

someone did 1 to 1 the same thing as me, and gets completely different and better results

#

its so weird

#

something must be borked with my install but i already reinstalled it

smoky oak
#

What are your starting ARGS for SD?

cunning geode
#

ill watch some youtube video, see if i did something wrong during the installation process

cunning geode
smoky oak
#

Hmm... Why medVRAM? 8GB is more than enough for full

#

That is a huge performance hit

dense tapir
#

yeah, med is 6 and low is 4

smoky oak
#

I ran full on my 3060ti with 8GB, and I never had any problems

cunning geode
dense tapir
#

Same deal as I watch but no way would I train on 8 and watch anything

smoky oak
#

Still, I watched YouTube videos all the side and had no problems

#

Oh wait, I run my Videos on iGPU

dense tapir
#

see

smoky oak
#

Have you ever tried that? @cunning geode

I offload all of my non high performance things to my iGPU to save performance on AI

dense tapir
#

watching YT is fine even on my 6 but turn off the browser's hardware accel or POOF goes the vram

oak osprey
#

taming 2.0 🙂

smoky oak
#

I find it so shockingly interesting how you point out that 2.0 is better than 2.1, cause damn, the results in DM's are a hard agree lmao @oak osprey

oak osprey
#

some prompts it does shit like that but they're surprisingly few

#

and also i am as surprised as you

smoky oak
#

It really does look like 2.0 was much better trained, then they slapped 1.5 million steps of shit on top and killed it

dense tapir
#

2.0 trains easier than 2.1 too. 2.1 gave us v_parameterization.

oak osprey
#

well GA is somewhat right in that they were fighting OpenCLIP's cursed quanta

#

but it just took idk 2 weeks of fine-tuning on a single 80G GPU for that to be fixed up pretty well

#

it looks like i can do another 2 weeks of it on a new photo subset

mild tusk
#

"batman i got myself a pokemon"

dense tapir
#

TheLastBen had issues training 2.1 for the longest time on colab because, as he said, they broke shit.

oak osprey
#

Emad was here briefly, asked him why no fine-tuning of OpenCLIP or CLIP is done, no answer. i never get an answer on that

smoky oak
#

2.0 and 2.1 bolts seem like unfortunate byproducts of a company reaching too far in a field that was not properly understood at the time.

That feels like they're not making the same mistake with SDXL, and instead are taking their time to figure shit out

#

*both

oak osprey
dense tapir
#

Not going to give them any credit for anything I need to see results not hype so I am waiting with hope.

oak osprey
#

@smoky oak yeah i'm glad i went on this quest because i understand our progress and what makes it stall out, much better now

smoky oak
#

2.0 has v-ptediction, and an epsilon model, apparently

dense tapir
#

maybe just know 2.1 has to have it on when training

oak osprey
#

2.1 also has an epsilon model that breaks when you enable v-prediction

dense tapir
#

2.0 also has the 512 model while 2.1 doesn't

oak osprey
#

2.1-base and 2.1-v are somewhat related but technically different models, as 2.1-v was fine-tuned from 2.0-v which is fine-tuned from 2.0-base

but 2.1-base is fine-tuned from 2.0-base

smoky oak
#

I genuinely think a lot of the fuckery of 2.1 was from the inconsistent training at 512 then 768, on top of them training it poorly with a frozen text encoder

oak osprey
#

2.1-v is so much trash because it has been over-trained upon over-trained

dense tapir
#

We agree

oak osprey
#

@smoky oak most of the training was just 256x256

#

and the 2048x2048 was stupidly cropped to 512x512 randomly

dense tapir
#

all or none not mostly 512 then let's throw 768 on top of it.

smoky oak
#

Pseudo has been showing me some incredible 2.1 fixes, now that he is correcting their fuckups, and it looks amazing

oak osprey
#

they trained about 550k steps on 256 and then 800k steps on 512 and then another 800k or so on 768, and 1.5 fucking million steps on 2048x2048 crops to make the x4 upscaler

#

burn baby, burn

#

disco inferno

dense tapir
#

Well, hopefully he has fixed giraffe neck syndrome that affects even animals.

oak osprey
#

he did

dense tapir
#

that was the first symptom of 768 ontop of 512

oak osprey
#

they needed to evenly split batches up between all of the aspect ratios, is the real issue

#

i don't know if he's meaning that giraffe neck happens in square images, too

#

i don't think ive noticed that

smoky oak