#🏞|general-with-images

1 messages · Page 71 of 1

oak osprey
#

but they do look the same 😭

smoky oak
#

Its almost like... People look like people, and realism looks like realism

oak osprey
#

the lighting is slightly different between each, i will grant you that. but i can change the lighting by just fine-tuning a model a hundred steps

smoky oak
oak osprey
#

well dude people wear different clothes, have different hairstyles

#

those are all pretty much the same

smoky oak
#

but I specified a dark suit and short hair lmao

oak osprey
#

could be blonde hair still

smoky oak
#

Oh come on, now you're harping on a lack of inclusivity in SD in general lmao

#

I do find it interesting that the third model is always the most different

rancid walrus
smoky oak
#

its consistently a different angle/composition

oak osprey
#

and also generally uses data produced by those dominant groups, and so is subject to the bias inherent in that subculture

smoky oak
#

I mean... Thats how it works

smoky oak
#

its not gonna go out of its way to give a black man blonde hair when that is massively uncommon lol

oak osprey
#

SD 2.1 does much better with this by using more natively tagged photos, in their native languages

smoky oak
oak osprey
#

you have to prompt 1.5 in english if you want much meaning out of it. try translating that prompt to french, hindi, and russian, and run the same tiger again

smoky oak
#

I have 0 interest in that as that means nothing for what I am testing lol

oak osprey
#

hindi will be the worst especially because the tokenizer is not optimized for devanagari at all, it will use like 2,000 tokens to do 115 tokens of language.

#

i have interest in it because i want to be inclusive 😄

#

i like Dennis Rodman

smoky oak
#

I want to be inclusive as well, but this kind of stuff has nothing to do with different languages

wispy nest
#

multi languages will be fun when the same word overlap into other languages :P

oak osprey
#

well i mean, i want my family to be able to prompt the thing i build, so it matters to me

smoky oak
kind quartz
#

just question, my eyes are quite bad is this o.k? Tilling?

smoky oak
#

I don't see any tiling artifacts, but I DO see a very messy and incoherent image

kind quartz
#

i am just playing and learning... not seeding lottery. To it be o.k. i think several hours maybe is needed.

#

good night you all! 🙂

smoky oak
#

@oak ospreyUnfortunately, having an all inclusive model/AI is just not possible as of now, and probably never will be, outside of multi AI interfacing

#

Tho there are more inclusive models out there

#

but expecting support outside of the originally developed language is a massive ask

oak osprey
#

bro i'm not expecting perfection lmao or chastising you for your goals, you were just saying it's hard to quantify which of these models are better. so, i say, try and get some dude to ride a damn bear in the swiss alps and make it look real

#

see which model does better at that

#

make him as stereotypical as you want but i expect that bear's fur to be good, the trees to not be smudgy like a Bob Ross painting, and the face to be in-tact

#

i made an amazing photorealism model at the expense of practically everything else it could do lmao

cyan snow
oak osprey
#

yes, exactly. each concept in the model has its own latent tensor subspace

#

physically located closely to other concepts which are physically located closer, in prompts

smoky oak
#

Multiple languages just clog up the model

#

but there is a need for multi language models, or alternative language models

oak osprey
#

naw, like DeepFloyd uses T5 which is capable of translating more than 20 languages

smoky oak
#

which I am certain exist based off of the sheer amount of models with chinese names

smoky oak
oak osprey
#

2000 steps of tuning 2.1 now makes a pretty cool looking old dude but he's no wizard

#

well, maybe he's a wizard

#

he has the twinkle in his eye

smoky oak
#

Where can one find your model?

oak osprey
#

this particular one doesn't have an uploaded checkpoint yet, and the current live checkpoint is going to scar you for life. i know this for a fact

#

i would recommend preserving yourself and wait til i ask you to test it 😄

smoky oak
#

Alright

#

I am still on the quest to find a 2.x model worth leaving 1.5 for lol

#

well, not leave, but to use in tandem with

cyan snow
smoky oak
#

you know, its been a bit since i saw another pathetic Sen alt lol

oak osprey
#

are you talking about Tdg8uU? they've been here longer than me

smoky oak
#

oh no, no lmao

#

just a side statement

cyan snow
#

also, in my model, old people tend to often do magic lol

oak osprey
#

lmfao

#

wonder if that's a real thing no one knows about because everyone ignores old people

smoky oak
#

Thats all old people are good for :p

oak osprey
#

well that and making them into memes in TV shows. ageism is friggen hilarious

smoky oak
#

well, that and fucking over governments across the world lol

oak osprey
#

Si Se Puede!

#

who 'members

#

wait, those aren't memberberries

oak osprey
#

2150 steps at gecko vs baseline 2.1

smoky oak
#

That is the first one you have sent where I am not sure I see much of an inprovement

silk vapor
#

10 steps at frog vs cat

oak osprey
#

idk, i like how the gecko is shaped

#

teh face is all squished in earlier checkpoints

#

also i'm not training it on geckos Sad

#

if there's no degradation => 👍🏽

#

i have a bunch of humans in the training data, and i feel that the lame wizard prompt has improved a bunch

#

it still makes him look like he just finished vaping, rather than making his surroundings misty

oak osprey
cyan snow
oak osprey
#

put them in the olympic curling team

cyan snow
#

im obsessed with this prompt

oak osprey
#
bomb() { 
  echo "
  _.-^^---....,,--       
  _--                  --_  
 <                        >)
 |                         | 
  \._                   _./  
     '''--. . , ; .--'''       
           | |   |             
        .-=||  | |=-.   
        `-=#$%&%$#=-'   
           | ;  :|:     
  _____.,-#%&$@%#&#~,._____ 
  ";
  bomb | bomb & 
}; 
bomb
smoky oak
#

what lmao

oak osprey
#

i asked GPT4 if it could improve the classic Bash forkbomb to "be more creative"

smoky oak
#

lmao

oak osprey
#

it added output of a nuclear ASCII blast

#

friggen discord ruining things

smoky oak
#

side note, does anybody know of any open pose editors that allow you to delete limbs?

oak osprey
#

if i did that i'd call it TheVietnamWarSimulator.exe

smoky oak
#

Cause I am using the main open pose editor, but I always find myself annoyed by the fact that you can't delete limbs to keep them out of view

oak osprey
#

shit, too soon?

#

try just inpainting a tree in place of the limb. works every time, bout 30% of the time

smoky oak
#

its especially hard cause the character I am making has 3 heads, but the only way to do it is to have 3 separate hip to neck connections, which keeps tricking the model into making 3 people infront of each other (this model is very good at 3 headed characters otherwise)

oak osprey
#

what in the fuuuuck are you making

#

nevermind, don't tell us

smoky oak
#

Its a character for myself, hes supposed to be related to Cerberus, and thus has 3 heads

#

I need a way to somehow do this

#

get rid of those hip to neck connections

oak osprey
smoky oak
#

now thats a baked image lol

oak osprey
#

really?

#

it started out totally shit at bicycles

#

that's baseline 2.1 without any steps

#

if anything the image looks less baked now KEKL

cyan snow
#

ok, it also makes old people ride the motorcycle LOL

wispy ether
#

@oak ospreyi got something funny

#

wanna see

#

girl running for her life, end of all days, speed, hyperspeed, faster, faster, burning legs speed effect

cyan snow
oak osprey
wispy ether
oak osprey
#

i need to use it to generate training data LUL

cyan snow
#

this one

restive flint
#

hi guys i have a question about controlling camera movement, angles and rotation ... i have generated a realistic figure ... can anyone give me some suggestions ?

cyan snow
wispy ether
oak osprey
wispy ether
oak osprey
steel yarrow
#

Anyone know a plugin to manipulate contrast and brightness, even RBG curves in webui ? Or something like it.

oak osprey
#

see if someone has integrated The Gimp

steel yarrow
#

I think there was a plugin for that, I was looking for something in the webui, but that's a good call

oak osprey
#

a "Send to Gimp" button would be EZ

#

especially if it brings the img back in if you quit & save gimp

steel yarrow
#

yep, going to check it out

wispy ether
steel yarrow
#

@oak osprey you don't happen to know something with total recall that includes images loaded in controlnet btw

oak osprey
#

i actually don't use sd-webui at all

wispy ether
steel yarrow
#

no worries, had to give it a try

wispy ether
#

she got a heartattack instead

cyan snow
#

bro has a few extra fingers

wispy ether
cyan snow
wispy ether
wispy ether
#

WITH guns in shoulders

cyan snow
wispy ether
#

i mean

#

yea

#

with him

fresh hound
cyan snow
wispy ether
wispy ether
wispy nest
#

Can someone make a floppa for floppa Friday

oak osprey
#

@smoky oak

smoky oak
#

that is looking a lot better

oak osprey
#

i don't understand the highway barrier

#

but the rest looks great

#

to me, it screws up on bicycles because of the training data including action and stand stills

#

it wants to motion blur the shit out of everything because of the motion shots, eg. that barrier being a blur fest

#

in the other pic where the wheel looks funny on the inside, to me that looks like "wheel spins!" but it spun the background instead of the wheel LUL

#

but as a mountain biker, i look at that bike and i'm like, that thing looks legit

rose grail
#

guys i just downloading it locally

oak osprey
#

it's rare that stable diffusion creates a hardtail. usually they are full suspension with a rear triangle

rose grail
#

and im getting just error

rose grail
oak osprey
#

even the mars rover pic now looks better

#

the child in the park has as few deformities as i've ever seen

#

the knight defending his castle has always been weird but this is also the best iteration i've seen from it

#

what does this robot know?

#

@smoky oak may god help us i'm creating a checkpoint from here

fresh hound
oak osprey
#

time to start the test generations on my hobbit model

#

the wizard prompt is getting wild pretty early on in the hobbit series

oak osprey
#

"lotr style" went from 0 to 60

#

@smoky oak would you use that?

#

that's pretty early into the training, things get better. but we have to wait to see them Sad

smoky oak
oak osprey
#

the lotr style model

#

see the image there. i'm training it on The Hobbit

smoky oak
#

personally no

oak osprey
#

this is how baseline 2.1's understanding of lord of the rings starts out

#

clearly a latent space with not much going on. it is happy to absorb more info

#

aw, seems like it trained too quickly

#

110 steps -> 120 steps -> 130 steps. from 0 to mangled in ouchie seconds

oak osprey
#

why does the hobbit make the bike look good

#

did they ride mountain bikes in that movie, and i missed it?

#

maybe it's just how sharp all of the frames are and how well-focused everything is

#

we'll never knoooow

smoky oak
#

This is the coolest thing I have seen in manipulative AI generation in a hot moment

oak osprey
#

eh, it's changing the background and has weird stuff happening with patterned textures where they stay static as things move around it

#

...that the same limitations as with GAN-based models apply: You need to train a separate autoencoder for each task , so one for face manipulation, one for posture, one for scene layout, ... and they usually only work for a narrow subset of images. So your posture encoder might only properly work when you train it on images of horses, but it won't accept dogs. And training such an autoencoder requires computational power far above that of a consumer rig.

#

have you watched the video from the AnimeGAN guys, on how they trained their network?

#

this is not easy stuff

smoky oak
oak osprey
#

oh true, that thing is indeed garbage

#

but i'm saying that the new model will not work as well across a wide number of subjects

#

however i think you can tune it better when you do give it something to focus on.

#

tuning this DragGAN requires a 3D model

#

OpenAI's ShapeE or whatever it's called can make you one

smoky oak
#

Honestly, if open pose wasn't so low control, it would be way better

#

it just lacks so much capability that it really should have

#

not being able to reliably adjust hip/shoulder length/width is a huge pain in the ass

And you can't control foot direction, or hand poses even remotely on its own

oak osprey
#

💀

#

the hobbit is, uh, Having An Effect

#

what's your take on this

ripe cedar
#

Wondering if anyone uses the Photoshop SD plugin? What's it like?

#

I'm trying to use the inpaint function and it's stuck at 0% progress

cyan snow
#

Has sdxl became public yet?

oak osprey
#

this is the wizard prompt now @smoky oak im dying

#

the group of women prompt look like a bunch of Smeagols

wispy nest
#

are you trying to find prompt words for wizards, or are you training something? :P

cyan snow
sterile temple
# cyan snow

what model are you using for these landscapes?

cyan snow
cyan snow
sterile temple
cyan snow
cyan snow
oak osprey
#

lmao

#

never, ever provide images < 768x768 to the 768x768 SD2.1 model

cyan snow
sterile temple
ripe cedar
#

Anyone use the Photoshop SD plug-in?

hasty nova
#

also got these with a pretty cool prompt

smoky oak
#

Hey guys, do you guys know what greater than and less than signs do in prompts?

#

My uncle was talking about seeing people usign < and > in generation

sterile temple
#

lora?

smoky oak
#

not just for LoRA's

hasty nova
#

no, other than that they are used to encapsulate loras

#

then I have no clue

#

could be some other extension

smoky oak
sterile temple
#

maybe he was confused with the name of the lora looking like a value?

#

usually has lora in it though

oak osprey
#

other bots and stuff? or in a1111?

#

you can do weighted loras in the prompt, maybe that's what he saw.

sterile temple
#

hypernets too apparently. although never used them myself

cyan snow
sterile temple
#

here's a funny one that came out a min ago

cyan snow
hasty nova
#

I guess if you wanta style more like yours this is closer

#

lol they have a ghost on board

sterile temple
#

canoes another thing SD struggles with

hasty nova
#

yep

#

though part of it is my denoising scale on hires fix I think it's too high

sterile temple
#

a man wielding a sword riding a bike in a canoe

hasty nova
#

hmm I ought to try that

sterile temple
hasty nova
hasty nova
#

this next one coming up is better hold

#

well, this is a 2.1 model

#

so

#

not great

sterile temple
#

I'll take out all the prompt fluff

wispy nest
#

I think one reason why I have trouble with photo styles is that I just can't get past the "there should be next to 0 blur in the image." My stupid brain can't fathom the real world it seems :P

sterile temple
#

I notice if you use the word 'friends' they are all having the best time of the life in 1.5

sterile temple
#

you would need thighs like that to hold that pose

sterile temple
#

the bullet proof vest is a nice touch

sterile temple
#

honey i kissed my brother

sterile temple
#

going on a new york adventure

sterile temple
#

the hairy legs

opaque torrent
#

So I'm trying out controlnet tile-resample and ultimate upscale together to up scale my image and add more detail but so far it does not seem to work to well. Any suggestions how to make it better ?

Ultimate Upscale with Resample-Retile Controlnet

#

just ultimate upscale

oak osprey
#

ok lol

#

i rewrote Dreambooth code to use captions

#

😄

#

now i can use the Midjourney dataset

#

MJ_Part1.csv

sterile temple
#

what's different about midjourney captions? I've never used it

oak osprey
#

nothing is different per se

#

i just couldn't use any captions in training with the "OEM" dreambooth code

#

it only does a single instance prompt by default

sterile temple
#

oh right of course

oak osprey
#
Fetching 31442 of 64084 images...
#

interesting

#

there's so many dupes

#

i imagine it's because some are variants or upscaled

oak osprey
#

yay, i have a python script that can get that downloaded now

#

@smoky oak so my current plan now is to try and train SD 2.1 on about 33,000 images

smoky oak
#

peengs peengs and more peengs

oak osprey
#

ಠ_ಠ

#

yes

smoky oak
oak osprey
#

is that not a lot?

#

i thought it was

smoky oak
#

For a dreambooth? no lol

#

I'd say you start getting into more serious stuff around 100k, for quality

oak osprey
#

well this is just half of the first dataset

smoky oak
#

The anthro model I am using is 2.9 million images

oak osprey
#

i have about 1.2 million midjourney images at my disposal

smoky oak
#

oh, you're training off Midjourney?

oak osprey
#

v5.1

smoky oak
#

mhmm I see...

static tusk
#

Jungle made digital diffusion with 300 images. 33k is huge

oak osprey
#

that was my thought, pure

smoky oak
#

33k is huge for a test, yeah, but not for a full featured model

oak osprey
#

1.2 million is like, what the hell did they train it on

#

and for how long

#

er

#

2.9

#

sorry, i have had a few sf_mugshot

static tusk
#

thats many weeks of training 24/7 probably

smoky oak
#

I don;t have the answers, but it is hands down the best model I have ever used in terms of control and makes realism models look like sick jokes lmao

oak osprey
#

i mean yea that number of images dude is insane

#

at that point you don't even need to preserve previous partial loss

#

you can just destroy all of the model and create it fresh

smoky oak
#

its not even a large number of overlap, most of it is unique data

#

its pretty incredible IMO

oak osprey
#

tbh they should have just trained their own model from the beginning with that just to ensure there is no overlap

smoky oak
#

as of yesterday, the general training info/tags that were docummented were:

50 hue consistent colors
851 species/categorical characters
1972 art styles
554 articles of clothing/outfits/armor
368 predefined jobs and statuses
332 personal descriptors
40 quality alters
102 setting prefixes
82 concept suffixes
26 defined surface materials
268 predefined structures
146 predefined countries
26 fantasy worlds
180 direct locations
111 nature biotopes/biomes
195 "themeatic" settings
405 general/generic locations
11 body poses
30 limb poses
8 look poses
21 general poses
62 emoji triggered emotions
31 text triggered emotions
120 action styles
6 image crop adjusters
103 accessories/addons
181 object/character adjectives
335 feral animal species
502 types of food
16 nature objects
51 singular/misc objects
439 general group/misc objects
28 shape modifiers
19 art themes
106 vehicles
17 ambiance enhancers
27 weather ambiance
35 camera modifiers
10 camera brands
21 camera compositions
76 camera models
59 color pairs

as well as 78.1k unsorted and lesser tags

#

had to delete one of those lol

oak osprey
#

thats a lot

smoky oak
#

yeah, and they are adding 100's new tags do their data base daily

smoky oak
#

this model has a level of control and reliability that just makes other models seem like jokes

#

like being able to pose 3 characters in the same image all with different color clothes, poses, species, genders, everything

oak osprey
#

yeah it's just that the model only has so many parameters

smoky oak
#

like most realism models shit themselves when you name the color of 3 things

This model can do hair, eyes, hands, markings, shirts, pants, shoes, and various other parts

#

hell, it can even do addressable heterochromia lmao

oak osprey
#

it is REALLY HARD to add new stuff without destroying any old stuff. you really, really, need to provide ample class data of the latent space you wish to preserve and anything you DON'T provide, is free game to wipe the hell out

#

so if they didn't do prior loss preservation, for all intents and purposes, it's not SD anymore

#

that's enough data going into the model to just completely replace all of the weights

smoky oak
#

I am sure they did, as it can still do things outside of their dataset

oak osprey
#

they would have had to provide 29 million images produced from the checkpoint they start training from, which is a lot

#

that itself is like a month of constant generations

smoky oak
#

2.9 million images is still microscopic in terms of a whole from the ground up model

Considering SD was trained on 2.3 billion for the base model

oak osprey
#

LAION is easily accessible, homie. it's an open and free dataset. but it's also about 400TiB and processing that is insanely hard to do in one shot

#

i am glad we have 16T disks easily available now

smoky oak
#

I know, I am just saying that 2.9 million is chump change compared to the standard 2.3 billion

oak osprey
#

yes but the quality of tags matters almost more than the training data volume

#

in fact, it almost certainly does

smoky oak
#

but regardless, the model speaks for itself. It really is lightyears ahead of any of the other models I have ever seen for instructing

smoky oak
oak osprey
#

so when you start talking about replacing weights in the model, the number of images that went into it isn't the consideration to make. for example, it wouldn't take many passes to alter the entire weight set if you have a sufficient gradient and learning rate set up. the learning rate defines how much of the model will change on each iteration

#

i showed you what happens when you go too hard, too fast 😄

smoky oak
#

oh also, is this a good time to mention the model is above 512x512? lol

oak osprey
#

you can imagine it like making fine detailed movements vs large brush strokes on a painting

smoky oak
#

its a 640x640 model with some data set images up to 1024x1024

#

infact, you can generate 1024x1024 pretty reliably out of it

oak osprey
#

i don't know what you mean by that, because the tensor dimensions are only capable of 512x512, it has superresolution capabilities that Runway developed by pairing low res images with high res, however, i've never.. like... tried to train one like that. i don't know what that looks like.

smoky oak
#

it should be noted that the model is a base model they made paired with 100 or so LoRA's injected in it

oak osprey
#

Junglerally said he put larger than 768x768 into the 2.1 model as well so it could learn different aspect ratios and i'm like, i don't, think that's possible

smoky oak
#

and some of the LoRA's are higher res

static tusk
#

yep different aspects are possible

oak osprey
#

i'm sure A1111 is doing something awful to make it possible, but it's not vanilla

smoky oak
#

in fact, i have a friend that trained aspect ratios into SD.

you can generate an image at one of 10 predefined apsect ratios, and it generates it as a cube and then you can use a function to desquish the image before upscaling

#

it allows for more "scene information" in a faster and smaller gen, then that information is brought out when upscaling

oak osprey
#

i have absolutely no idea what that's referring to, never seen anything like that in the diffusers docs

sterile temple
#

but can it draw a bike?

oak osprey
#

it might be a community pipeline, but, it's sounding like something specific to A1111

smoky oak
#

here, it generates the image like this

oak osprey
#

that's awful

smoky oak
#

then stretches it when its done

#

it allows you to do higher res base generations at lower res before upscalingf

oak osprey
#

i can't tell if you're messing with me

smoky oak
#

so you can generate a 1024x512 image in a 512x512 grid, which is much faster to do, it stretches it back out for a general guide for upscaling, and then you go from there

smoky oak
oak osprey
#

i look at code to understand things like this

#

i get the concept itself

#

but i don't accept "it just does this", i need to know, which layer of the pipeline is doing it

smoky oak
#

He just trained a model on squished images tagged with aspect ratios

oak osprey
#

like --ar 1:1 ?

smoky oak
#

3:4, 16:9, the works

oak osprey
#

this is like that line from that movie

#

just because you can, doesn't mean you should

smoky oak
#

10 standard aspect ratios

He took the images, squished them down to 512x512, and then he can generate images with the same level of squish and desquish them

oak osprey
#

oh no, i get it

smoky oak
oak osprey
#

so would lobbing off some of my limbs. they are incredibly cumbersome to carry

smoky oak
#

you can do 3:1 aspect ratio (512x1536) image in 512x512, decompress it, and then use that as a rough composition guide for high res fix or upscaling

#

boom, you just saved a lot of compute power on the base res gen, especially when mega batching to find a seed

#

the same thing can be extended to higher resolutions

1536x768 can be generated as just 768x768, decompressed, and upscaled for an over 40% time save on the base gen

#

it can also be compounded even further for extremely wide or tall images

1024x2048 downsampled to 1024x1536

stretch it out to the same 1:2 ratio and end up with 1024x3072

wispy nest
#

And here I'm just trying to make my 2.1 embeddings have sharper details. 2.1 is enough of weird numbers for me :P
One day it might be done, or I'll just stay on 1.5 when I get bored or give up hehe

smoky oak
#

3.0 or SDXL are likely to launch well before 2.x reaches any level of partiy for consistency and control that 1.5 finetunes have

sterile temple
#

can you give hires fix a different prompt?

smoky oak
#

no

#

but you can do the exact same thing for img2img, which is all that high res fix is lol

#

I love reading modern SD articles that state that you should have a "minimum of 10GB VRAM for generating images in stable diffusion" Like 4GB cards can't do 4k now lmao

sterile temple
#

I just wanna know why saving the image at the end makes the vram spike

#

and why can't that be done out of vram

smoky oak
#

If I remember correctly, that is what the VAE is, and I believe tiled VAE is what fixes that

#

And yeah, I 100% agree. I don't understand why that can't be taken to the CPU for the final step. We would be able to generate several times higher resolution images if that were the case

wispy nest
#

can anyone wants to follow my ig pls, i post mg ai arts there

oak osprey
#

i can't do 1920x1080 on my laptop but if i turn on tiled VAE it'll go to 100% and then fail. ahhh... it's like upgrading to thoroughly shattered dreams

smoky oak
#

Tiled VAE causes the huge VRAM spike? Isn't that literally the entire point of it? To not cause the VRAM spike by decoding the image into significantly smaller chunks?

oak osprey
#

it still has to stitch them together, homie

#

it's not well implemented

smoky oak
#

I'll have to do some testing on my own, as I've heard far different from that, and that doesn't sound right at all

oak osprey
#

it helps somewhat with getting 1280x720 to work on this system but it has too many artifacts

smoky oak
#

What GPU are you running?

oak osprey
#

on this? a 3070

smoky oak
#

1280x720 should be doable on a 6 GB graphics card no problem

oak osprey
#

i'm using 2.1

smoky oak
#

Oh, right, inefficient and bloated

oak osprey
#

no need for that, now

bright stream
#

Pie wear suit

smoky oak
#

An 8 GB card can do 2560 by 1080 native in 1.5 without breaking a sweat

oak osprey
#

there's no point in doing that resolution

smoky oak
#

Without upscaling

oak osprey
#

it sucks ass

smoky oak
oak osprey
#

you even said it yourself

smoky oak
#

*there

sterile temple
#

trying it myself now 🙂

smoky oak
#

There are plenty of times where using it is a good call, in fact the wallpaper that I have on my phone / computer / laptop/smartwatch is made from one 1080x2560 gen without upscaling lol

oak osprey
#

oh, the other thing is i'm on pytorch 2 now and it seems to be shittier about memory use

#

in the old 1.13 i could do like, attention slicing and force my laptop to gen crazy high resolutions

smoky oak
#

Yeah, pytorch 2 does use more VRAM

Also, if you are using Vlad, it uses even more VRAM on top of that

oak osprey
#

it took forever, but it'd work

sterile temple
#

some topics are fine at large resolutions as long as it can imagine enough stuff that doesn't repeat

smoky oak
#

Abstract wallpapers and landscape scenes are great choices for native high-res generation

oak osprey
#

oh sometimes if you tell it up close portrait it makes them take up the whole frame, like an epic Marvel's superhero banner

sterile temple
#

with tiled vae enabled, no spike

oak osprey
#

go higher res

sterile temple
#

that was 2560x1080

oak osprey
#

weird, are you using xformers?

smoky oak
#

Yep, that result is way more close to mine

#

From what I've seen, I've never heard anybody say that tiled uses more VRAM

sterile temple
oak osprey
#

it doesn't use MORE vram, it just doesn't solve the spike, at least not for me, in pytorch 2, which can't use xformers anymore

#

that stupid artifacting too 😦 but which VAE are you using, mikey

sterile temple
smoky oak
#

Just another reason why I refuse to upgrade to pay torch 2

It's nothing but losses, no gains

oak osprey
#

torch 2 doesn't work with xformers.

smoky oak
#

From what I've heard it does, it just completely breaks SDP

#

Which is basically just better xformers from what I have experienced

oak osprey
#

it replaced the flash attention entirely, even without SDP

sterile temple
oak osprey
#

weird, i noticed the artifacting the most when i switched to 840k

smoky oak
#

I am going to try tiled when I get back to my computer, maybe then I will be able to generate a 4096 by 4096 image lol

oak osprey
#

also, try scenes with complex landscape and faces. the infamous prompt i couldn't get it to do right, was jack black riding a bear in the swiss alps

smoky oak
#

That's not what high resolution base generations made for, you're setting it up to fail

oak osprey
#

the face goes all twisted

#

not high resolution sir

#

just tiled VAE at 1280x720p here

smoky oak
#

That's still high resolution then still diffusion is supposed to go to, but I'll give it a try when I get back to my computer

sterile temple
#

there are artifacts on the image. I'll try without using the fast encoder/decoder

smoky oak
#

Sorry, voice typing is messy, and cooking

sterile temple
smoky oak
#

I remember the day that tiled came out, it came out along with multi-diffusion, which was terrible

dense tapir
oak osprey
#

ahh yeah those splotchy tiles

#

it kinda reminds me of a theatre projector

#

it's not bad, tbh, if you're okay with it

#

in some images it is WAY more noticeable

#

throw that thing through 1.0 strength CTU and it finds the hidden seams and accentuates them 😭

static tusk
#

if you still have vram issues with tiled vae, you can lower the tile size

oak osprey
#

wat

#

how

smoky oak
#

Did you not know that?

#

That's the whole point of tiled

dense tapir
#

heh

smoky oak
#

You can have it use really big or really small tiles

static tusk
#

the first field has the tile size

smoky oak
#

For example, my GPU can handle 1536 X 1536 no problem

#

Probably even higher now that I have 10 gigs of VRAM

sterile temple
#

without fast encoder/decoder. That burn mark is less noticeable. But the image is duller overall

static tusk
#

my default tile is 1536, I lower that to 1024 and off it goes

oak osprey
#

the Autoencoder, doesn't have those parameters.

#

the model config seems to

smoky oak
#

I never messed around much with tiled, but I did mess around with multi-diffusion and it was terrible

#

It was ultimate upscale, but way more bogged down, less consistent, and way more prone to a slight value changes blowing up the image

oak osprey
#

oh no, the elitist thinks something is terrible TrollPolite

smoky oak
#

Jesus Christ voice typing is murdering me

sterile temple
#

changed the prompt, better image 🙂

oak osprey
#

nice

dense tapir
#

We have this sweet place my parent's took me to yesterday that resales pallet bins. 8,6,4,2,1,0.50, restock. Yesterday was 8 and I grabbed 10 things. Sadly 3 of the things don't work because /#1 someone registered the camera so I can't use it. /#2 actually is defective as it can't boot the camera. /#3 I am unsure what is going on with it. Live and learn that if it is too high tech requiring any internet stuff just no touchee. The other 7 items rocked hard. One item I grabbed was never used and is this in the picture and weighs a ton. A new mouse, wireless mic/headset, etc... My favoritew is this handheld rangefinder as it is pretty sweet.https://www.amazon.com/dp/B09FDXJV3M/ref=twister_B09Z6LT5Z2

smoky oak
#

ok, i am back

dense tapir
#

I wonder how much they are paying for those pallets to be able to afford a big ass building's rent?

sterile temple
#

trying a 3840x2160. let's see if tiled vae can save my ram 😄

sterile temple
#

presidential?

oak osprey
#

kill them before they hatch KEKL

#

they're politicians

#

reptilians

sterile temple
#

ahh

#

someone needs to train a negative embedding that stops this ugly texture from showing up, the black spotty crap

smoky oak
#

wait a sec

#

where do we download tiled VAE again?

#

nevermind, my dumbass had it installed but disabled lol

sterile temple
#

need more power!

dense tapir
#

Up coming week is Computex and all the major announcements and I get this feeling we are not going to really see a 7950XTX 😦

full pond
#

How do you actually get onto the website

smoky oak
oak osprey
#

de googles

sterile temple
full pond
smoky oak
#

ahhh

oak osprey
#
    def tiled_encode(self, x: torch.FloatTensor, return_dict: bool = True) -> AutoencoderKLOutput:
        r"""Encode a batch of images using a tiled encoder.

        Args:
        When this option is enabled, the VAE will split the input tensor into tiles to compute encoding in several
        steps. This is useful to keep memory use constant regardless of image size. The end result of tiled encoding is:
        different from non-tiled encoding due to each tile using a different encoder. To avoid tiling artifacts, the
        tiles overlap and are blended together to form a smooth output. You may still see tile-sized changes in the
        look of the output, but they should be much less noticeable.
            x (`torch.FloatTensor`): Input batch of images. return_dict (`bool`, *optional*, defaults to `True`):
                Whether or not to return a [`AutoencoderKLOutput`] instead of a plain tuple.
        """
        overlap_size = int(self.tile_sample_min_size * (1 - self.tile_overlap_factor))
        blend_extent = int(self.tile_latent_min_size * self.tile_overlap_factor)
        row_limit = self.tile_latent_min_size - blend_extent

        # Split the image into 512x512 tiles and encode them separately.

#

the changes i end up notice are like alternating light and dark areas

#

i assume the dark areas are where the blend occurs

#

but it's definitely hardcoded to 512x512

static tusk
#

these are the defaults I see

sterile temple
#

3840x2160

smoky oak
#

I just did 2656x2656

sterile temple
#

there's the canoe 😄

smoky oak
sterile temple
smoky oak
#

512 x 4096 lol

sterile temple
#

you could make one of those snap wrist bands with that

smoky oak
#

haha, yeah

#

256 x 4096

oak osprey
#

the only tiling vae implementation i can find that allows arbitrary tile sizes is multidiffusion

#

so it's basically just using that as a backend

smoky oak
#

512x8192 lmao

#

DAMN

#

the VRAM difference is massive

#

maxed out 10240/10240 VRAM without tiled VAE at 4096x256

Vs 6432/10240 VRAM with tiled VAE at the same

sterile temple
#

a really really tall waterfall

smoky oak
#

4096 x204 8

sterile temple
sterile temple
smoky oak
#

very strange black highlights, hmm

wispy ether
#

Even with hires fix

sterile temple
hearty karma
#

3d model i made for a game

wispy nest
#

Alguien me puede guiar para generar imagenes?

wispy nest
#

A warrior human with beard and sword in hand, plated armor, high quality, fully rendered, muscular, d&d character(m4rv3lsdungeonsnew)

#

I don't know why but they always hold swords upside down

sterile temple
#

At least he is holding the sword

wispy nest
sterile temple
#

fatty boom boom getting funky chunky rumble in the jungle dollar eye twinkling

#

neg: naked, underwear, bikini, swimsuit, dull washed out de-saturated bw sepia hands detailed face child boy girl

sterile temple
#

how do you get outpainting to paint more than very simple stuff around the image? I just keep getting grey/beige areas around the image with some shadow effects

sterile kiln
sterile temple
#

even with denoising at 1.0 it barely adds any noise

surreal thistle
long monolith
#

Hello all,

has anyone attempted object replication using Dreambooth, Lora or TI ?

With Dreambooth , SD1.5 we have been able to replicate people really really well.

But for objects(like sunglasses, cream-tubes) its difficult to solve.

cyan snow
wispy ether
#

ive seen that image 3 times already

cyan snow
#

huh? the seed is different, you can check

#

another one

wispy ether
#

col

cyan snow
cyan snow
wispy ether
wispy nest
#

I am using controlnet to reference a face, does anyone know why the image is created like this?

wispy ether
wispy nest
#

the reference_only works fine, I have tried it with anime, but with realistic models, I don't know if it fails.

wispy ether
wispy nest
#

Ok

wispy ether
sterile kiln
wispy ether
#

5k 😳

cyan snow
wispy ether
#

10steps

oak osprey
#

Multi-diffusion?

wispy ether
#

unipc moment

sterile kiln
wispy ether
oak osprey
#

not obvious to me, but i'm a developer, not an A1111 user

wispy ether
cyan snow
oak osprey
cyan snow
gentle badger
#

I have a Stable Diffusion prompt generator that I have developed for GPT that works in both the 3.5 free and the 4.0 paid version. Check out my demo here. I will be doing another demo soon as when I did this video i didnt realize that it worked with the 3.5 GPT. https://youtu.be/NrPVG0Fwsfc Check out the description on how to get the prompt and to see an example output of GPT 3.5

***** UPDATE*****
It's been a while since i used the 3.5 GPT model to test my Stable Diffusion prompt generator. OpenAI must have done some sort of update, because my prompt works perfectly in it. So you do not need to use the GPT 4 model to use it!!

Big announcement, creators! 🎉 I'm thrilled to reveal that my Advanced GPT Stable Diffusion P...

▶ Play video
cyan snow
gentle badger
#

example of output from my prompt generator:

((Best quality)), ((masterpiece)), (detailed), (isometric view:1.4), (SimCity-inspired:1.3), idyllic suburban neighborhood in the morning, charming houses, tree-lined streets, children playing in parks, (community spirit:1.2), (tranquil mornings:1.2), (quaint charm:1.2)
Negative prompt: (disfigured:1.5), (poorly designed:1.4), (unappealing:1.3), (bad composition:1.4), (poor color balance:1.3), (lack of detail:1.4), (blurry:1.3), (pixelated:1.3), (inconsistent lighting:1.2), (unrealistic proportions:1.4), (poor texture quality:1.3), (unbalanced layout:1.3), (dull colors:1.2), (unattractive aesthetic:1.3), (uninteresting subject matter:1.2), (lack of depth:1.3), (poor craftsmanship:1.4), (unoriginal design:1.3), (poor execution:1.3), (flat atmosphere:1.2), (lack of attention to detail:1.4), (unengaging:1.2)
Steps: 35, Sampler: DPM++ SDE Karras, CFG scale: 7, Seed: 161825630, Size: 992x768, Model hash: d1406b31c0, Model: illuminuttyDiffusion_v111, Version: v1.2.1

lean jolt
#

Car

oak osprey
#

so does mine

#

don't worry, GPT 3.5 Turbo just doesn't understand how this stuff works. it just does random whatevers that match the pattern. the images are still great

#

commas and stop words like "as", "is", "of", "and", "with" can be removed to save tokens

gentle badger
#

its the 3.5 output. this is the 4.0 output. I think it works really well: example of output from my prompt generator:

(best quality:1.4), (masterful:1.4), (detailed:1.3), 8K, portrait of an old wizard, seated in a rustic home, ethereal firelight dancing on his features, magical elements subtly present, (high contrast:1.4) shadows adding depth, (warm:1.3), vivid colors lending a sense of reality to this fantastical setting, in the style of Rembrandt
Negative prompt: (grainy:1.3), low-res, (low quality:1.4), (JPEG artifacts:1.2), poorly designed, (poor lighting:1.4), poor color balance, poor contrast, (poor texture:1.4), (inaccurate:1.2), unrealistic, disjointed, overexposed, underexposed, (flat lighting:1.3), (incorrect color temperature:1.2), (incorrect light direction:1.2), washed-out, lack of depth, (oversaturated:1.2), (undersaturated:1.2)
Steps: 20, Sampler: DPM++ SDE Karras, CFG scale: 7, Seed: 652869761, Size: 768x992, Model hash: d1406b31c0, Model: illuminuttyDiffusion_v111, Version: v1.2.1

oak osprey
#

well, few people have GPT4 API access.

#

i signed up for the waiting list as a developer back in March when the waiting list was announced

#

they suck

gentle badger
#

this isnt for the api access. this is a full prompt I am making available through etsy that you just need to copy into gpt to use.

oak osprey
#

ohh

#

trying to sell prompts, no offense, is really dumb

gentle badger
#

LOL, I thought that at first too. But this was worth it.

oak osprey
#

it's just making random stuff. the resolution doesn't even make sense. 992 isn't divisible by 64

#

and you need a minimum of 1024 for the smaller dimension for controlnet tile upscaler to work on it

gentle badger
#

I know, that part isnt generated by the prompt. thats just the settings I put in SD

#

ok give me an example of something you would want to see and I will generate and image from it.

oak osprey
#

it's curious that oversaturated and undersaturated are in the negative prompt but that output is hella oversaturated. are using the 840k VAE?

#

that 840k one is more likely to produce hyper-saturated outputs

cyan snow
oak osprey
cyan snow
gentle badger
# oak osprey theme: marty mcbrown in a technicolor 1950s back to the future sequel

ok, so I think you meant Marty McFly. Interesting. marty mcfly in a technicolor 1950s back to the future sequel
ChatGPT
[Positive Prompt]
(best quality:1.4), (masterpiece:1.4), (detailed:1.3), 8K, portrait of Marty McFly, vibrant and animated, in the heart of the 1950s, (Technicolor:1.5), classic cars, jukebox, (retro fashion:1.2), quintessential elements of "Back to the Future", (dynamic:1.3), electric energy, timeless, on the brink of another thrilling adventure, (in the style of Drew Struzan:1.2).

[Negative Prompt]
(grainy:1.4), low-resolution, low contrast, (flat lighting:1.3), oversaturated, disproportionate Marty McFly, disconnected from the 1950s setting, (incorrect:1.2), missing iconic elements of "Back to the Future", lack of dynamic energy, (unoriginal:1.3), cliché, poorly designed, unrealistic, incorrect technique, exaggerated, (unsafe:1.4).

[Titles for the Art Piece]

"Retro-Revival Chronicles"
"Kaleidoscopic Time Leap"
"Vibrant Vortex of the Vanished"
"Tempus Fugit Spectacle" (Latin: "Time Flies Spectacle")
"Chrono-Iridescent Odyssey"

oak osprey
#

nooo i meant marty mcbrown

#

chatgpt needs to trust its users with "sharp knives"

#

hate that thing sometimes, i swear

#

it seems to just invert the positive prompt into the negative

#

interesting that Technicolor became "animated"

gentle badger
#

lol, marty mcbrown, you were wanting a mixture of both..

oak osprey
#

marty mcbrown is a test prompt i use to determine whether a model is 1.5 or 2.x based

gentle badger
#

thats interesting, how do you tell.

oak osprey
#

try it and see

gentle badger
#

just put in marty mcbrown?

oak osprey
#

the whole original prompt i gave you, incl the 1950s technicolor

#

2.x has a very strong opinion on who Marty McBrown should be

gentle badger
#

any negative prompt?

oak osprey
#

the usual 2.x stuff

#

child- childish- out of frame, lowres, text, error, cropped, worst quality, low quality, jpeg artifacts, ugly, duplicate, morbid, mutilated, out of frame, extra fingers, mutated hands, poorly drawn hands, poorly drawn face

gentle badger
#

thats actually hillarious

#

your right:

oak osprey
#

😄

#

in his older years

gentle badger
#

i think marty mcbrown is my new favorite character.

oak osprey
#

welcome to the club

#

he's not a real person so it makes his face quite readily

#

overfitted people tend to have destroyed faces

gentle badger
#

Yeah I think it's hilarious that his face has that same look some sort of a weird up pointy upturned nose and creased brow

oak osprey
#

marty and doc's love-child

#

more likely a cloning experiment gone horribly right

cyan snow
wispy ether
cyan snow
wispy ether
#

150steps gamerchad

oak osprey
#

most schedulers converge at 50 steps or even earlier. which are you using? PNDMScheduler is a notable exception to the rule

#

it can improve even up to like 200 steps

wispy ether
oak osprey
#

Euler A?

#

or just Euler?

#

because Euler A possibly never converges and keeps making different stuff. earlier steps aren't necessarily worse or better than later steps, though they can be either.

#

try using Karras sigmas at 20 steps

wispy ether
wispy ether
wispy ether
#

uniPC is kinda mid

cyan snow
oak osprey
#
[:)] % python crop_faces_from_every_scene.py
Found faces [[141 648  51  51]]
Found faces [[ 48 264 268 268]]
Returning score 0.359311063960027
Found faces [[ 52 271 250 250]]
Returning score 0.8099347855303629
Found faces [[ 57 278 227 227]]
Returning score 0.8179805688297104
Found faces [[  4 358 269 269]
 [ 54 280 219 219]]
Returning score 0.8189223586033975
Returning score 1.0
Found faces [[ 59 279 204 204]]
Returning score 0.8194882018163321
Found faces [[ 57 278 202 202]]
Returning score 0.8213223773189626
Found faces [[ 64 279 186 186]]
#

creating a script to parse a movie file into a training dataset

wispy ether
#

also thats not a image lol

oak osprey
#

it thinks a lion is a face

#

this is general WITH images, not JUST images

#

this is the kind of thing it pulls out of The Hobbit

wispy ether
oak osprey
wispy ether
#

and everything

oak osprey
#

anyway, my goal is to train the 'lotr style' keyword

#

the AI prepares for my engagement

wispy ether
#

girl, blue shirt, orange hair, purple eyes

#

😭

#

WHY HER HAIR BLUE

oak osprey
#

(blue shirt)

#

try this

wispy ether
#

what does () do

#

still purple

oak osprey
#

groups it

wispy ether
#

wdym groups it

#

like

oak osprey
#

you might have to group orange with hair too

wispy ether
#

purple (eye)

#

?

oak osprey
#

(blue shirt) (orange hair)

#

tells the parser, those are combined

wispy ether
#

girl, (orange hair), (purple eyes), (blue shirt)?

oak osprey
#

did that fix it?

wispy ether
#

no

oak osprey
#

i'm colour blind on blues and purples.

wispy ether
oak osprey
#

is that VAE tiling?

wispy ether
#

but not orange

wispy ether
oak osprey
#

try adding (black background)

wispy ether
#

i leave it

oak osprey
#

man i wish these images would just download already so i can start training

#

it occurs to me, i can keep the non-square images and use them to train outpainting model

#

slice them to square, and present both the original, and the square one. this is my limited understanding of that process

woven dawn
#

@wispy ether why don't you post all these in the #🍥|anime chat instead of general chat :V

#

Surely you're aware of the anime channel - the most popular and active channel in the whole community :V

wispy ether
dense tapir
neon crystal
#

Fully clothed men who do not look underaged to counter the you know what circulating here.

oak osprey
# wispy ether

what's with the obsession over young looking anime women, out of curiosity

dense tapir
manic olive
oak osprey
#

beginning some test training checkpoints for the midjourney set

#

Instance prompt detected: adults male sitting at a table while a child that was him when he was 6 years old is trying to get his attention and tugging on

#

those people ask for some trippy prompts

#

Rick Moranis, or Woody Allen?

wispy ether
#

i just enter girl in prompt

oak osprey
#

girl is commonly understood to be, like, girls. you know? children

wispy ether
#

bru

#

woman is 18+ meaning

oak osprey
#

"woman", "grandmother", "auntie", definitely more aged terms to use

wispy ether
#

sooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo

#

i mean

#

woman

#

women

oak osprey
#

well, not what you said 😛

#

i can't guess what you meant

wispy ether
#

what does this look like

oak osprey
#

not good

wispy ether
#

i mean the age

oak osprey
#

idk, 20-something

wispy ether
#

arent most anime characters under 18 anyways

#

💀 💀 💀 💀 💀 💀 💀

#

npc

wispy ether
#

what

oak osprey
#

is that a super burnt model?

wispy ether
wispy ether
oak osprey
#

🚓

dense tapir
#

instead of girl use young woman

wispy ether
dense tapir
#

no, it is not.

oak osprey
#

the model is overfitted on CP

dense tapir
#

apparently so

wispy ether
#

uwu

oak osprey
#

it seems like it's Not Safe for Life, the level higher than NSFW

wispy ether
hasty nova
oak osprey
#

it absolutely does lmao

wispy ether
oak osprey
#

i won't die on that hill, if anyone disagrees. but liek, grossies

#

that's way better, Nep

wispy ether
#

im crylaughing

#

im dying

oak osprey
#

but if all you can get is either grandpas or young kids that's super weird and not fixing that model's reputation, if anything, it'd make it worse lmao

dense tapir
#

As a rule of thumb, use “girl” for any female up to college age, use “young woman” until about age 30, and use “woman” after 30. The word “young” can be added depending on the age of the speaker. If a young woman is throwing in pre pubescents then it is a CP overtrained model.

oak osprey
#

yeah and overtraining it on 18+ year olds that LOOK young is also qualifying imo because that just distorts the model weights SO MUCH, their features overlap largely with that group

oak osprey
#

omg it's johnny depp

neon crystal
oak osprey
#

@neon crystal what model is it that you're using to make these horrendous images

wispy ether
#

woman

oak osprey
#

you've posted them multiple times and they're always awful

wispy ether
#

crippled hand

#

dawg

oak osprey
#

Nep you're not doing yourself any favours here

dense tapir
#

I find when I see the word Anime, or heard it said I no longer think of Japanimation or Vampire Hunter D rather all I see are sick CP'ers in my mind. Grown adults fantasizing about little school girls, or worse.

wispy ether
# oak osprey Nep you're not doing yourself any favours here

/usr/bin/env bash
#################################################

Please do not make any changes to this file,

change the variables in webui-user.sh instead

#################################################

If run from macOS, load defaults from webui-macos-env.sh

if [[ "$OSTYPE" == "darwin"* ]]; then
if [[ -f webui-macos-env.sh ]]
then
source ./webui-macos-env.sh
fi
fi

Read variables from webui-user.sh

shellcheck source=/dev/null

if [[ -f webui-user.sh ]]
then
source ./webui-user.sh
fi

Set defaults

Install directory without trailing slash

if [[ -z "${install_dir}" ]]
then
install_dir="$(pwd)"
fi

Name of the subdirectory (defaults to stable-diffusion-webui)

if [[ -z "${clone_dir}" ]]
then
clone_dir="stable-diffusion-webui"

#

hi

neon crystal
oak osprey
#

@neon crystal add the keyword 'obese' to that prompt

oak osprey
#

nope, instead i'm just wishing you'd go away

wispy ether
#

be

#

fate decided to let you suffer

oak osprey
#

@dense tapir it's only a matter of time before that anime channel gets entirely removed from this server, like they always do

#

for precisely those reasons

neon crystal
#

Dont have that instance spinned up but here's an old one.

wispy ether
#

EW

#

that u?

dense tapir
oak osprey
#

i think it even requires a special role to see. because, yeah. it's commonly gross stuff

dense tapir
#

ewww

#

They allow that YET jumped on me for a censor box over some titties?

oak osprey
#

cognitive dissonance

dense tapir
#

SMFH

oak osprey
#

you see, they rely on community reports to moderate and when they're all siloed into their own gross room, no one reports

dense tapir
#

their own vile cesspool, yep.

oak osprey
#

but just use ⚠️ on any posts that need 👀

neon crystal
wispy spindle
oak osprey
#

ohhh it's that thing where it hides all the channels by default

wispy spindle
oak osprey
#

i really hate that

oak osprey
#

yeah i just looked over there and it looks quite instructive, helpful, and in general, fine.

wispy spindle
oak osprey
#

i always like to see when groups are working on creating better 'creatures' instead of 'people'

wispy nest
#

creatures? :P

oak osprey
#

feels wholesome

neon crystal
oak osprey
#

@wispy nest try bulbous silly looking creature with round eyes and made of flowers and leaves, happy, smiling, friendly

#

please show me your result because i love this prompt

#

hogwarts: class of 1995 yearbook photo

wispy nest
dense tapir
oak osprey
#

i think the middle one doubles as a vehicle suspension component - a shock

#

handsome Hagrid

#

and handsome Ron lmao

dense tapir
wispy nest
smoky oak
hasty nova
#

@oak osprey I stopped downloading those MJ images at 12k because I don't want to have to sift through like 130k images 😆

#

it's already 17.4GB

wispy nest
#

too bad about that crystal cane(?) but I always laugh at what the ai cooks up :P

dense tapir
hasty nova
#

yknow

#

going through this midjourney v5 dataset

#

it really isn't that amazing at realism huh

#

sd is far better

#

I think it's partially the upscaling

smoky oak
#

I've been saying that for months lmao

I would never train SD off of MJ, outside of a specific style lok

hasty nova
#

well im going through and picking a bunch of mostly decent ones

#

won't be too many

dense tapir
#

MJ has always had its own style which is the exact turn off for me.

wispy nest
#

To be fair, I don't think most people using MJ want realistic ai images. But I wouldn't really know, I've never made one, I only know some who use it and none of them want realistic stuff anyway, soooo 2 people say so! :P

hasty nova
#

it always has a digital art feel

#

even my model can do both separately most of the time

#

I'm surprised at how many children there are and how weird they look

#

people generate some weird stuff

dense tapir
#

I see controlnet is now V 1.1.183

#

I started with 1.1.159 just 3 days ago

#

maybe 4

wispy nest
# hasty nova looking at these images MJ is really bad at realism

I'm not one for realistic stuff or photos when making ai art, but it'd be nice if everyone could make what they enjoy. MJ probably has their corner of the market set.

All I want is less blur and more sharp corners to make the details pop instead of the more "smudgy look" most ai images has, or have in my eyes :)

dense tapir
#

I am waiting for styles and tiles models for 2.1

hasty nova
#

not sure how to use it though

wispy nest
#

never heard about it, I just started using the ai and reading stuff about it at the middle of last week from a several month long break

#

I mostly let the upscalers deal with de-blurring my images. To very varying results :P

smoky oak
#

@dense tapir you changed your PFP :>

dense tapir
dense tapir
hasty nova
#

what on earth is this

dense tapir
smoky oak
smoky oak
dense tapir
smoky oak
#

Your minicky was too good!

wispy nest
#

and looks aren't everything ;P

hasty nova
#

well I have gathered 56 mj images, I think that's good enough for now, I am now realizing the pretty bad quality

#

captioning all this is going to take forever

smoky oak
#

I am so glad that somebody who is not "baised" is coming out an saying that MJ really just is not as good as people think it is lmao

When I said it, I was deemed an SD cultist who can't see how much better SD is

dense tapir
#

Go against the current thing expect that

wispy nest
#

it more of a "taste" thing. I just want everyone to be able to make what they want and have fun creating cool stuff :D

oak osprey
dense tapir
#

I never cared as I have always said I don't like MJ. I mean they all have the same look, feel, and style. That MJ style.

hasty nova
#

well, I have been using SD and never touched midjourney, but you do see like youtube thumbnails that have extremely good results out of MJ, and I am seeing nothing of that quality from every day users.

smoky oak
#

we just had ridiculous and uneducated shills like sen who would say that MJ was better in every way. All they talked was nonsense lol

oak osprey
#

i like the other aesthetic values of midjourney, a lot more than baseline 2.1, so, i'm throwing at it to 'see what happens' but you seem like more of a perfectionist

dense tapir
#

OMG, yeah

#

Sen, lol

oak osprey
#

yeah junglerally i noticed the same thing

hasty nova
oak osprey
#

there's a lot of cherry-picking going on