#๐๏ฝsd3
1 messages ยท Page 67 of 1
left is AYS kolors workflow, right original workflow by kijai. add AYS seem give more adherence on prompt
sure. but not very useful for the individual person. the only thing that really is useful to them is whether the results they personally get are what they want or not
Reading this link with my tiny eyes it looked like "Im gay dot org"
Manual man, we need a manual. XD
play around on the site, you'll quickly see why it's such a powerful tool. it pits two random models against each other while only showing you the prompt. you vote L/R or tie, then it shows you afterwards which was which. the whole point is to test how flexible a model is. on the rankings page, you can click the stats button to see how a model fairs against other specific models. Oh and the prompts are random stuff people ask for (you can prompt for something as well i think, but ive never tried)
so basically, when you thrown a bunch of random prompts at a model, you get a much more realistic rating of how well the model handles various types of concepts vs the cherrypicked BS that most papers show where they have some ultra fine crafted prompt that happens to work really well with a model, since they know the data/captions that were used to train it.
you do that on svd too - however, i'm pretty sure that while it's useful to other peopel, it's not as useful to them as just learning how to talk to the models correctly and generating themselves
you should see some of the random ass prompts people throw at it, it's actually kind of rare to see cliche waifus and stuff.
people are undereducated and the average reading/writing level is that of an eleven year old
All of this is until 8b is out, so if you could get on that. ๐
what does that have to do with working with the AI, learning how it thinks, so you can get what you want out of it every time? that's what is actually useful to someone.
you can expect 8b in about 5 years, longer if people continue to harp on it
I'll also drop more sugar on you, I've been using your aam xl model a lot. It's really great as a general model as well, able to handle an impressive amount of concept beyond just "anime".
First impressions matter a lot, SD3 got off on the wrong foot
Imagine if they keep hyping it like the 2B and in the end it's not the same as they are using in the API but it's a neutered one
Hopefully in dog years. ๐
2b isn't neutered and it wasn't lobotomized
Thanks, I'm happy you like it. There are also finetunes of that one now by other people
I like Neta Art
My message did not imply 2b is neutered
in general most models built on top of Animagine are pretty great
apply transductive reasoning here: dumb/undereducated people vs highly complicated tech stuff. i'm 99.999% positive that you don't actually fully understand what's happening under the hood. you likely have a gross overview understanding, but it's just the tip of the iceberg.
AAM XL in particular is built on top of both Animagine (indirectly) and Dreamshaper. Another reason why it's able to go Turbo without too much quality loss.
Agreed, anime has a lot of dynamic camera and posing angles that break out of the standard symmetrical portrait looks.
@lavish osprey I'm wondering if my glif (that I remixed from another) has anything built in somehow that isn't mentioned? I looked through it all, and it's only SD3 and Claude helping with prompting. But somehow the images come out better than just straight up SD3 on my own computer.
To see any settings, you can just hit remix. But more likely it's something glif has built in to just make everything better, OR, is claude the awesome and it's all about prompting?
i'm 999.99% positive that i understand exactly what is happening under the hood.
Glif is using API, so SD3 Large, right? Or they allow to select SD3 Medium?
on your computer it's 100% sd3 medium, since we didn't release large yet.
Hello 
meow
well i'm sure dunning-kruger would have some things to say, but i'm not going to be mean at this hour lol. regardless, i promise you're not prompting as ideally as you think you are, which means you don't actually understand "ai" as well as you think you do.
Balls
i promise you that after spending more than 3000 hours meticulously prompting stable diffusion, changing as little as one character in a prompt, i know what i'm doing and that you do NOT know, though you think you do, my skill set. I also promise you that i'm a programmer and have probably got a deeper understanding of the code than you do.
shiny
Pineapple pizza ball
promise promise
dogs that makes lots of sound
typically dont bite
MFW I'm italian
i'm not sure if that's edible or not... where's the crust?
Inside is all crust
Chocolate and MMs?
not on tomato sauce. On marshmellow, sure
I'm fastly losing my health points
You'll become American after this one
10 health points, however 200 diabeetus points (which is like the USA mana points)
i know plenty of people that have done some kind of thing for a job for decades and are still not great at it. practice doesn't make perfect, perfect practice makes perfect. part of growing and getting better at something is accepting that you don't know it all and that there is always something to be learned. otherwise, it's just megalomania/narcissism mixed with dunning-kruger.
think what you like.
i will
and keep it to yourself
i wont
doing something wrong a LOT doesn't make you good ๐
You will
Dunning Freddie Kruger effect
how about a fruit pizza sphere
I have no clue what you guys are on about, seems very dumb
Hilarious joke, dunno if you made it on purpose
Like the people I see in games arguing who is right 
Idk, I didn't read half of it
I think we share the same braincell
Sharing is caring
I share my balls half the time
now stick that inside a bubble
Imagine doing this in the house of a vegan, they won't be able to eat
nah, bros acting like he's dr. jenkins from starship troopers saying "its afraid..." like he can magically read an ai's mind or something and trying to hit people with the git gud spiel lol...
i said learn how to talk to it. learn how it thinks. it is not that hard.
why not use the square ratio for this
you can't "talk" to something if you don't know how it "thinks"
what'd you do with the rest of the fish
well we certainly know how you think. ignoring you now.
Rolled away
fish heads in floating spheres.
Fishy situation I reckon
yeah, logically and realistically... but alright man, i get it, your ego is hurt and at this point, you're probably taking this as cyberbullying or something. see how i'm "talking" to something because i can understand how it "thinks" inside?
How are you using SD3 with 4gb VRAM ๐ญ
naw, just done trying to explain something to a person with a huge ego and lack of comprehension
He's a liar (not a cat)
There's a block option that alleviates you from certain unpleasantries. It solves a lot. ๐
Ban that mf
I just realized you can go negative in the positive prompt. Like I can write "None of them are outside." and it worked.
You all probably already knew that, but anyway
By utilizing the powers of the internet im able to use the api and be lazy in my bed while my PC is crying in the corner
They don't say, so perhaps ๐
Only their cat has 4gb of vram. They themselves have more than enough.
Underrated joke
I'm always impressed by how perfectly spherical your balls are.
Dude works out
it's Large probably, They had that contract before Medium was released. Unless they switched to Mediun for the price
i know, the problem is that you then can't see what they are saying - and sometimes it's important that you do so
Does it really matter? Not really.
Did you shave with the razor?
No razor can tame that
That's the aftermath of razor shaving (some weeks later)
That's on a cold day
Clearly some shards of razor have been left behind.
Booba
Can you output a ball lying on grass?
is it bad that I can tell this is 100% fireworks sd3 8b?
I dunno
Try it on 2b
My prompt is "A shining metallic ball with 2 arms open lying on grass"
Science has gone too far

Damn
runs away
New York Times Opinion: Is This Too Suggestive For SAI?
lol! does this count ... ?
Clearly underhanded methods were used
Because hands are under
photograph of a woman leaning sideways on a grass wall?
That grass looks like a... yeah lol I was beat to that
8b
How sd learned hands: it grows them
the side view made me rotate the image, was more hoping for a front view with flat solid green wall of grass
Extra fingers are, in fact, fruits
try adding "front view" to your prompt then
Imagine a mummy during that covid toilet rolls shortage
Kolors did a good job too. Not sure I can think of an idea for that one. Maybe in the aisle with a barren shelves with hands raised to the sky yelling why
That's nice, maybe with a face mask would be more understandable though (I guess)
Neither sd3 8b nor kolors is understanding "barren shelves". It's giving me the opposite. Maybe he's angry about too many choices.
100% sure OpenAI will make a special denoiser (or call it what you like) for AI generate images and train the next Dall-E on insane AI generated images but will be sure to edit every one with some automation in order to avoid artifacts
Try empty shelves
Glif? Or, it costs a a subscription fee to change your username, could be an old one lol
Out of a lot of attempts, managed to get a single one with "empty" ๐
That's my local supermarket after I bought all the toilet paper (I have a hyperactive gut)
Ref image maybe?
That exceeds my laziness quotient.
BRB gotta photograph myself in mummy bendages while I show despair in an empty supermarket (I have the budget)
try "empty shelves"
Empty shelves is a banned phrase from the SAI datasets because it's too naughty
I smell a Lora. Or is that something else?
No I was joking
govt intervention, cant show empty shelves
I know, but a dedicated Lora of people shouting why at the skies in grocery stores would probably find a niche market on civit
Gotta make a dataset with Dall-E 3 and then train a lora on it I guess
go to walmart and take pictures for your dataset
Wow even dalle couldn't do it
I'd rather not pose kneeling down with a desperate expression to be photographed
so ... real life photo model for clothing branding, seems an easy task for ai.
custom desing NOEDEL brand outfits. not for sale yet!
Off-grey stretchy combi outfit, top with half-long stretch skirt. Good for both parties or casual activities. Not for sale yet. Noedel brand.
sleeves not included
Hey Goo Goo Gage, I hope you get banned for life
y'all are such good friends
well crystalwizard, I certainly hope you missed that video which has been removed. was there for too long.
So how come Ella over the various Claude or Ollama ones?
I don't care about the hands, but how can I make SD3 stop using double ll ? It's Noedel, not Noedell
photoshop it!
no, sd3 should do it. back in the days using sd15 i first generated and image, then photoshopped the text over it, then do a refinement with img2img
SD3 should do text better, it's not like I'm asking for a full poem in correct layout
maybe I should ...
btw, this is my character Caitlin. Always wearing custom designed glasses, cuz she wants and can afford it.
8b, does. 2b - doesn't have that sort of enhanced ability.
I just want a working imitation of M$ Word Art (tm) inside the SD3 medium model. perhaps too much to ask for a 2003 technology?
๐
sorry, that was bordering trolling. Nevermind, I switched to generating realistic images of Caitlin anyway!
like ascii ?
lol, no. ascii is 1963 tech

give me the prompt you used there
have you ever looked at Comfyroll studio nodes
no, what is that?
Caitlin, expensive glasses for each day or outfit. Cute looks but a calculated cold character, who loves her work and does not like standing still. Her study in behavior management really helps her at the facility.
For all the talk of censorship in SD3, I find it draws women who are randomly straight up naked (but disturbingly without nipples or like anything other than continuous skin downstairs) quite often, even when I didn't prompt for it at all. Not gonna post any examples cause they're all creepy, looks like burn victims or something lol
yes
damn, it seems to be hard to get a real photo style with purple eyes from sd3
yes, it seems there is still some nsfw hidden in the model, but indeed somewhat masked/mangled
it seems they have not removed all nsfw from the dataset and trained from scratch, but instead evolved existing models based which do include nsfw images in the dataset. I think sd3 is just trained not to show the more explicit stuff that is hidden within it.
if nipple reroute to smooth plastic
does anyone have a guide to CFG and Steps numbers for SD3
maybe it's just me, but I think sd3 makes better sexy images when putting stuff like (naked, nude, explicit:1.1) in the NEG prompt
yes, go for low CFG
there was a complex discussion on here a few days ago about negative prompts
they might not do much
like between 3.8 and maybe up to 6 typically around 4.0 to 4.4 or maybe just keep it on 4.375
I am not sure as I am just now barely starting to test SD3
ah okay thanks
yeah 2 was too low and 7 too high
and if you want to go crazy, don't let the usual limits stop you
btw, I only know sd3 medium running locally
im not sure, but I think the architecture and prompting for the larger models differ
I am using a huggingface space
but the only problem is it doesn't say the sampler
I might have to do it properly in comfy to know what the actual full settings are
many samplers dont ever converge by design, kinda why i like DPM_Adaptive it picks out how many steps it needs by itself to converge
Caitlin back at work, as happy as she can get! (she loves serious work...). though maybe sd3 turned her a bit asian. Oh well, it's all about behavior for Caitlin, the good looks are just her trademark.
I've always used DPM++ 2M Karras, DPM++2S a Karras or DPM++ SDE 2M Karras
cos that gets you an ancestral one and an SDE one
as well as just DPM++ 2M Karras
Interesting
very similar prompt in SD15 - RealisticVisionV60B1 - for reference
you don't see it, but there was color bleeding all over, so had to cherry pick this one. SD15 might be able to give nice(r) results, but at the cost of some rejects
prompt: hdr photograph, head and shoulder shot, a man,1960s hippy
hdr photograph, head and shoulder shot, a man, cyberpunk 2077 hippy
i just thought it was funny that when i asked for a 1960s hippy, i got a jim croce look alike
makes me think of Frank Zappa
yeah, sort of a mix between them I think
sadly the bartender triggered the anatomy problem
ok so putting man in the negative is the way to go
my friend challenged me to train this barbie bimbo instagram model on sd3 and said it was impossible and could't be done. i'd had difficulty getting people's likenesses but i thought i could show him up. so i think i trained sd3 to do this megan millions girl using pics scrapped off her instagram. it does a lot of selfies mostly but it works.
||https://ibb.co/album/F5cDXw|| may be slightly pg13. these are all from sd3 with my trained lora. she's not my type but i love giving a good ol "in yo face" to haters.
if i could do this with 100 images over 100 epochs. minute per epoch. i believe in fine tuners
Results are good, but damn a lora on 100 images for a single subject...
negative works
maybe that's where i went wrong. other expert lora trainers are using 80000 alpha
No I didn't mean you are doing it wrong, if it's working it looks like an issue of the architecture of the model. I'm used to a max of 20 images for the best likeness
i was joking a bit lol. i have no idea what i'm doing. i'll try with less images.
Those results are VERY cherry picked i should say too. All the underlying model problems are still there
I think the average instagram model should be easy enough to train on 10 images, you'd have more difficult time with videogame aliens characters
i think you could just prompt that character without training anything and get it without too much trouble
yea its easier to train on photorealistic than on highly stylized imgs like 2d
Well... if you have to train nudity on SD3 maybe you might need some thousands of images for a lora, as Cat with 99999 gb vram implied
sd3 training is more efficient or maybe i'm just stupid. I can barely do batches of 2 on sdxl but on sd3 i can do batches of 10
wth room to spare
Do not make me come here and use my VRAM, yeah?!
๐ซฃ
Me: clueless
like, totally
I have no idea either
BALLS
https://www.youtube.com/watch?v=EZNFo5lL4iw has this energy to it
The Official HD Video straight from the Band. From the album SUPERFAST.
oh goodness!
ok so it can make a bartender correctly
but only if the bartender is batman saying hello
thosea are some interesting chairs
lol yeah
art deco in the prompt does that
if you cherry pick you get much better results, this one is fantastic
its very inconsistent
hello
hmm the image quality goes up if you stop prompting for super heroes
same prompts but with bartender instead of batman or superman
the fashion
add rococo to your prompt
can someone DM me the 3.1 2b plz
you have to prompt for it
if you cant I think its a skill issue
no way jose I copied all the skillz i needed from the hugginface
huggingface called, they want their skills back
A narwal anthro ๐
those are some magnificent cats
hey its the sd3 wizard form the promo banner, he's back with a gooder license and the promise of an even gooder model to fix and rectify the universe. SD3.5 will be great just like SD1.5 just like poetry- it will rhyme.
how to use?
thats a very vague question...
if u mean sd3 then u got the API and/or comfyui but youll have to dig a bit for the models since civitia banned them
i want to use like a midjouney in discord
then go to the midjourney channel
well there is Artisan
thank u
At dusk, a muscular man riding a bicycle at 120 KM/H on the highway, dramatic lighting, intense motion blur, dynamic pose, cinematic atmosphere, high-speed action, detailed muscles, realistic style
balls!

made with the medium opensource version
What about to lay down? ๐
Neat, I woke up to some balls!
There seems to be 3 versions of sd3 for dl. 1 w/o clips, the 10gb one w clips, and the 15gb one w clips. Does the w/o clips version require less vram? Does the 10gb one require less vram than the 15gb one?
The largest includes clips and the T5. You can load everything individually, or all together. The different sizes just give you flexibility on which parts you want to use/load.
I like SD3 now
didn't really at first
but making stormtroopers invade 17th Century France has been fun
Thank you ๐
I have the 10gb one, tempted to get the 15gb instead. They both have the clips.
I love it when software "shopoing" is free ๐
The most flexible way is to get the smallest sd3 model(doesn't have text encoders) and to then download the three individual encoders. Within comfyui, you can then use a single, double or triple clip loader to pick which ones you want
And it will take up the same amount of storage space as if you downloaded the largest sd3 checkpoint that contains all three encoders with it
one tree during winter, reflection from lake, all white --v 6.0
oh and dont worry about the fp16 version of the t5, it's mostly pointless. you could run a million A/B blind tests and would likely see them both within margin of error of each other, in terms of voting
if you have the ram/vram for it, sure, go for it
I can barely run sd3 to begin with lol (only 8gb vram)
It knows quite a few painters from that period as well, I'm case you ever want to get really specific
Dall-E 3 very likely knows even more, but the prompt expansion might fugg them up... and not to talk about the filters lol
I don't know anything about anime but this is my sci fi anime attempt
not even sure which anime that style comes from
Thank you for using comcom analytics.
"comcom analytics" supports all community managers (moderators and server owners) by stats, visualization, and analytics.
If you have any questions, feel free to ask us!
Your dashboard
Help
Support server
Other languages
en: help
ja: help Japanese
I run SD3 on 8gb just fine, still using an old RTX 2080 FE on this PC
the one thing emad delivered on: it is PC resource friendly.
Thanks emad. Say thanks kids.
well i'd also give a HUGE shoutout to comfyanon and all the others that work on comfyui. that's where the real performance comes from. his system for automatically handling model offloading at various steps and stages of the process, is what keeps the vram usage down.
yes comfy is amazing, it is as fast as foooocus but without the quality loss
6GB also run just fine
although may be a little bit slower by community standard
( 40 sec per 28 steps 1024x1024 )
that is fast at my standard
4-5 for sdxl 1024
nah thats about average if you're using an older gen card. my 2080 spits out 35 step 1024x1024 sdxl images like like 30 seconds i think
lmao
๐ค
it is a Laptop 4050 so yeah
if you know specific resolutions you like to use, i highly advise going the tensorRT route for experimenting. like if you don't need to use cnets, loras, ipa, etc, you can get like 75% reductions in generation time. it's great for exploring (they might be able to work some of those features? not sure, never tested to actually see)
like for me, the only resolutions i use are 1024/1024, 1152/896 and 1344/768, or reversed
yeah, but the problem is that most models are not trained for extreme aspect ratios like that. sure, they'll work sometimes, but think of the dataset used to train the models. there is likely VERY little content involving things like 21:9 aspect ratios. 16:9 would is likely the widest they naturally want to go, which aligns pretty well with 1344x768(still roughly 1 megapixel, so well within a typical model pixel range, and also, both numbers are divisible by 64/32/16/8 evenly). i use 1344/768 a lot because if you do a NN latent upscale by 1.5, it puts it just a hair of 1080p and is super easy to crop/resize to size
Create a highly realistic and dynamic image of the Indian cricket team celebrating their victorious moment after winning the Champions Trophy. The scene should capture the exhilaration and joy of the players as they celebrate on the cricket field. Use vivid colors and sharp details to portray the players in their blue uniforms, some holding the trophy high, others embracing, and some jumping in joy. Include elements like confetti raining down, fireworks in the sky, and a jubilant crowd in the background. The expressions on the players' faces should reflect pure happiness, pride, and excitement. Ensure the setting is a well-lit stadium, with bright floodlights, a lush green pitch, and the Champions Trophy prominently displayed. The image should evoke a sense of triumph and national pride, making the viewers feel the energy and emotion of this historic win.
Specific Details:
Players' Emotions: Capture various emotions like shouting with joy, tears of happiness, and players lifting each other in celebration.
Team Unity: Show the players in a close group, arms around each other, symbolizing team spirit and camaraderie.
Trophy Display: Ensure the Champions Trophy is clearly visible, being held by the team captain or a group of players, reflecting the significance of the win.
Background Elements: Include a cheering crowd, waving Indian flags, and banners with congratulatory messages, adding to the festive atmosphere.
Action Shots: Some players could be shown spraying champagne or doing victory laps around the field.
when i try to create an image with SD3 1368px * 2048px it just fills top and bottom with nonsense and process a square. Is there any workaround ? SDXL works fine in comparison.
That resolution isn't supported in SD3, try creating a 1:1 image or a 16:9 or a 9:16 image
9:16 i have the same issue ๐ค
Nvidia 2060 here. It runs fine with simple workflows, but not with any fancy ones.
How is 2080 old?
how much is your cfg?, and what sampler + scheduler are you using?
cfg 7, sampler dpm2, sheduler normal
try lowering the cfg to 3.5 or 4 and use heunpp2 with simple or normal and set your steps to 28 or 30
None of that fancy triple prompt workflow stuff though ๐ฆ lololol
SD3 works best with low cfgs, higher cfgs make the images look burned
I need that fb care reaction lolololol
Sadly sd3 2b looked burned a lot of the time even at 4 cfg. Rather frustrating.
Weird, i never have a problem with burned images with SD3 2B
This is with cfg 4. Seems harsh lighting.
I generally think that's an issue of prominent synthetic data in the pretraining
Euler. Heunpp2 is massively slower and doesn't yield better quality for me. I, doing 50 steps though which did make a difference.
I'm using comfy's workflow that he published.
It the regular one
they don't say how many images of each aspect ratio were in the training data, but this is the table from the SDXL paper and it has 16 9 in the form of the resolution of 1536 x 640 in particular. It apparently has some images at 2048 x 512 which is crazily wide.
still questionable output oO
with sdxl i got a lot better results
Can you paste the prompt? I'll try on mine
here are more (questionable) SD3 outputs
tried just a simple prompt:
positive: top view, Music Festival, anime style, key visual, vibrant, studio anime, highly detailed
negative: photo, deformed, black and white, realism, disfigured, low contrast
Negative prompts don't do much in SD3, using ...................... or aaaaaaaaaaaa has the same effect on the image
Based on comfy's workflow, I've found the optimal res to be 1024x1280 or 1024x1344. I don't use anything else anymore.
i need something like 1368 x 2048 as smartphone backgrounds
i will try with 1024x1344
Yeah, if you need higher, you'll have to use upscaling methods. Sd3 won't full had resolutions directly.
can you tell me your settings, or show / send your workflow.
i get heavily nonsense with SD3, idk xD (same prompt as told above)
I use the workflow on the woman picture on that page, at 50 steps
give it to the cat with 4GB vram, he needs help
Do not make me come here and use my VRAM, yeah?
I've seen Cat's images, no way they are from a bgb system only lololol
Dude's Ballz (TM) are amazing, I guess it's not a neutered cat
I have to say tho that all the low vram users I think limit the model. Can we really get an excellent model if it has o fit in 4GB?
I put mask in the prompt to avoid face issues
and it decided to do a lace mask
its quite clever at adapting to the theme
when I put friendly witch in the prompt it added flowers and a vase to also make the background more friendly
hmm same prompt but now they have covid mask
4 I gon't think is actually possible. Even 8 is pushing it (no advanced workflows). 12 is recommended.
I'm still surprised someone managed with only 6gb vram!
He uses the API
Allegedly. Meow.
The catch with the API (and glif and huggingface) is that we won't be able to add checkpoints and loras to it when they come out I've been eyeing up some cloud systems!
via glif SD3 large with CLaude helping
A werewolf sphinx lol
brb, Imma post this on some conspiracy groups ๐
Prince Thun, of the Lion Men.
A Ctulu centaur lol
the 2080 came out almost six years ago and was a top of the line card that cost in the 700-800 dollar range. now, it's roughly on par with a 4060 that costs less than half that and that draws less than half the power. in the tech world, six years is a long time.
yeah that is pretty wide, but again, they are likely a very tiny percentage of the actual dataset. wouldn't surprise me if all of the data beyond maybe 16:9 or 9:16 made up less than a few percent. this is why a lot of models will list recommended ranges and warn you not to stray too far beyond the recommendations. and when you do, you end up with the weird duplicated bodies and stuff like that
2080 is ancient in AI terms, itโs still a decent card mind you but saying itโs not dated is a stretch
yeah its likely only a few percent of the actual dataset
I tend to use hidiffusion or deep shrink for generations like that
Yeah it's still decent for 1080p gaming and other 3d related stuff, but Nvidia is two whole generations ahead, about to be three when the 50xx series launches.
Point was that an entry level card of the current generation is on par with a top of the line card from the 20xx gen
But it also meant I got a ton of usage out of it before needing to upgrade again, so there's that
mmm its better than some, i rock a 2080 TI
waiting on 50 series pricing, when i get up off the floor ill probably get a used 3090
Hmmm, cloud computing options seem to be less than monthly computer payments, for a computer which will be obsolete in 2 years, hmmm
not obsolete. the 2080 is 6 years old and still useable. just ancient
obsolete has a specific meaning
lots of old tech stil has uses
landfills!
e waste is a useless thing
Give me that useless and obsolete card 
who doesnt want more land
Yeah exactly. But I'm holding out for probably a 5070. I'm not THAT big of an AI junkie, but I still game quite a bit. I don't care about shit like 4k 360fps style gaming though, so the xx70 model cards are more than enough
you should turn this into a where's waldo
I'm assuming the 5070 will have 16gb vram, so it's good enough
i game tons so i'm always getting a new gpu. AI was the cause of me switching from AMD to Nvidia. I probably should've switched around the 1080 generation instead. Nvidia really started to shine so hard then.
I have a 7900xt in our other PC, it's dawgpoop for AI related stuff, or at least it was the last time I tried it last fall
new card every couple of years but i try to find purpose for my old cards and don't just landfill them. at my old house i'd have them on my wall but they were ugly. i might make a knolling case for them next
thanks i learned a new word today
was actually going to get the 7900 on launch week but it was apaper launch in Canada. I couldn't find any places anywhere between edmonton and vancouver that got any in stock. amd fucked around that launch hard. that was the other major factor that made me switch to nvidia
i actually learned that when sd2 came out and someone had a knolling case lora for it that was beauty
https://civitai.com/models/1203/knollingcase-embeddings-sd-v2-0 mb it was an embedding
Bruh the term knolling case is so redundant... Almost every case you'll ever see or uses 90deg angles, even if there are curves, the base is flat or the object being held is kept perpendicular to the surface the case is on
Sd3 2b. I asked for inside the brain of Cookie Monster.
Never ladfill computers! in my city there's this for eg., as well as another~~ one~~about ten or so which donates them to people to broke to buy their own computer. Probably similar in every city https://www.rebootcanada.ca/#:~:text=Supporting reBOOT Canada is simple,virtually anywhere in the country.
reBOOT Canada provides computer equipment, training and technical support to charities, non-profits and people with limited access to technology.

i will gladly accept your computer
gotta save room for the wind turbines, they ar HUGE
I gave away some from the 90's earlier this year, are you sure? ๐
Becky is one step ahead of me 
Don't get too excited, the tech they give away is generally 6 years old at least. Also people have to prove that they really well below the poverty line.
used to pick computers on garbage days and Frankenstein them into something, dont see much on the curbs like that now
My gpu is 6 years old 
when i was a kid and couldn't afford a pc that could play quake, something like $4000. I had a 486 given to me that kind of ran windows 3.1. it had a 1 speed cdrom and the discs had these jewel cases i had to put them into before popping that whole case into the drive.
I played the shit out of lemmings on that beast and learned so much
and none with GPUs I'm pretty sure lol

I have a dell workstation from 2018, and it has no GPU, darnit!
BALLS!
Haha
A colorful, swirling vortex of half-eaten cookies and crumbs inside a fuzzy blue brain. Chaotic synapses shaped like chocolate chips firing erratically. Dark, shadowy corners filled with forgotten vegetables. Frenetic thought bubbles containing jumbled letters spelling "COOKIE" in various fonts. Tiny, worried-looking Sesame Street characters trying to navigate through the cookie debris. Flashing neon signs reading "EAT" and "MORE" scattered throughout the brain tissue. A distant, echoing laugh track playing in the background. Cracked mirrors reflecting distorted images of cookies and milk. Pulsing veins carrying streams of cookie dough instead of blood.
Is that an ai generated prompt?
I think SD3 should work best with AI generated prompts as its training captions were made with CogVLM
It's Claude 3.5 sonnet. I was using gpt4o but I just started using Claude and it's SO much better for creative prompts.
AI prompts, what's next ? AI images?? 
Ah darn, these new things made by THE DEVIL, in my times we had Dall-E 3!
it doesn't inherit some greater compatibility with the model that created the captions. cogvlm generates natural language captions. the inherent compatibility is then natural language prompts.
for a lot of people this means using an LLM
How to use?
Hi
Not sure, natural language made by LLMs is not really natural
i believe you're stuck in magical thinking and don't have any evidence to support your hypothesis. cogvlm captions don't lead a model to understand LLMs better than people prompts.
You have a skewed understanding of what natural language is.
I mean I'm not entirely sure, there are many systems that recognize "synthetic" sentences and long forms, but that's another thing, when I said it works best with AI generated prompt I didn't mean necessarily in contrast to prompts written by humans but rather "it's the intended way"
natural language was intended. not a stronger compatibility to LLMs. LLMs can produce tag style prompts too
hey im having problems installing ReActor, it's not showing up when i installed it. Any way you guys could help me out?
consle probably says an error, something about no insightface installation. this is tricky on windows and i often elect to use a precompiled version of insight face. precompiled are often not ideal since that's how viruses can easily spread, but in this case insightface is popular so it's kind of easy to find a reliable one. https://github.com/Gourieff/sd-webui-reactor?tab=readme-ov-file#viii-for-windows-users-if-you-still-cannot-build-insightface-for-some-reasons-or-just-dont-want-to-install-visual-studio-or-vs-c-build-tools---do-the-following
the docs have a wicked troubleshooting section
I actually think OpenAI did the right thing by providing an LLM that was finetuned to prompt their model
I wish every publisher of a diffusion model did that
After seeing what omost can do, i'm ceratin that LLMs to prompt the model is the future. But thats for reasons outside of "cogvlm captions mean it understands LLMs better than something human written". I mean, LLMs were trained on human material to begin with
yeah its the insightface
if you don't use a precompiled version, the install requires all sorts of visual studio build tools and gets hairy to set up.
the original cogvlm is not too smart though
its not like GPT-4o
there are patterns in its captions
like repeated mistakes etc
@torn wharf i sent u a dm
you see that ninja scrolls is getting a remastered theatrical release?
I'm afraid I know nothing at all about anime
I was in best buy today picking up plugs. Stopped by the computer department for fun. I looked at the specs of the ones on display and said out loud "is that ir?!"
The person working there explained that the second any high end computers are released they are bought immediately. There's a limited amount if gpus apparently.
This was in Vancouver, Canada, a rather large city.
Not sure how true that us but Nvidia owner us buying a few more yachts I think lol
all I know about anime I learnt from looking at stable diffusion generations lol
yeah nvidia is most valued stock right now
Hey, need someone's help with gen. that has stable installed on a local machine
Long story short im away at work and will be home in like a month. Need someone with "stable" running locally (non XL, can be makeayo). Had been bored and wrote a prompt in my spare time. Wanted to see how well it performs, but it needs to been locally since its long/weighted and i wont get the same results with cloud based gen. (like never). Side note its nsfw(nothing hardcore or lolicon). I would send both positive and negative privately. Thank you
ninja scroll from 1993. it's legendary like akira. back when it was all hand drawn and they still took detail to unseen heights. I dont know anime too much but i respect animation as a whole.
am on vancouver island
no fires yet thats later in the year
Very awesome ๐
balls
Where'd you buy your system? Prob newegg or Dell or something
sort of built mine. Bought a second hand alienware area 51 pc and i've replaced most of the core components at this point. went from amd threadripper to an intel alderlake
this one was a 3 paragraph long prompt instead of one i wrote. its okay too i guess. still balls.
ice cold
top is SD3 and bottom is Kolors
not a fair comparison since SD3 a base model
but I hope it can get to Kolor's level
Kolors is excellent. Sadly it doesn't know Cookie Monster or Elmo particularly well
Is that Death Star fully operational?
Just train a lora
Did they bring out training stuff for it yet?
I managed to train it with my own script. It just replace the encoder to glm

https://civitai.com/models/571029/kolors-cotton-doll-lora-trained Here is the lora. Use it with lora loader. But you need this plugin first. ComfyUI-Kolors-MZ
First? trained kolors lora. Trained with my custom training script. Repo: https://github.com/lrzjason/T2ITrainer Prodigy, repeats 10, rank 32, capt...
wow thanks this is awesome
will try to make some kolors loras
what is kolors
in dumb cat terms 
Also in BALLS terms please
T5 moment
anything becomes cool if you add 4k background on it lmao
you might not be able to handle this one @edgy kelp
๐ฅฒ
not round enough
ball factory 
true, everyone loves using balls pretty much
If-you-know-what-I-mean
beans are horrible
I always run Foooocus with SDXL on 6Gb before I got my 3090
were you able to hires
bean corn
should be possible, you just need to enable sliced vae decoding and vae tiling. i have no idea how to do that with fooocus or comfy but in diffusers its pretty easy.
How about having some delicious bean popsicles to cool off after that?
with comfy you'd probably have to write a node - i don't think there is one that exists yet
@bitter hearth
anyone tried this?
sd3m finetune
trained on 50000+image
We could run it real quick
Where can we download it locally? Otherwise, I am out ๐
"An ancient castle draped in ivy, looks even more majestic under the setting sun"
Not sure it is worth running
Should be free on Tensor
Hmm yeah itโs a lot more different than the examples, I think probably a sampler issue?
I am not sure. That is a good question.
Can't sell the images on DA, damn
brace yourself
Someone needs to create these in rl, then sell them on food trucks! ๐
customers would be like "how are we supposed to eat these" and you'd just shout "BALLS" then closeup and go to a different location.
so then eveyrone in line would be like "why would you ask!?"
balls
I come back and so many balls
https://suno.com/song/cb2cff79-afd0-4cf2-8d5d-a7a3e05365c8 song about dem balls
90s hiphop funky song. Listen and make your own with Suno.
so many balls
I don't mind pizza being a ball but the pineapples are a disgrace.
at least he didn't mix pineapple and anchovies ๐
sounds kinda good
hmm, one 25steps 1024x1024 required me 20 minutes on my 2GB VRAM setup
it is painful ofc
You are patient sensai.
why would you not use lightning in that situation? LOL
A futuristic, abstract representation of a human brain fused with circuit boards and digital neurons, set against a deep space background. The brain should be partially translucent, revealing pulsing energy and data streams within. Incorporate vibrant, electric blue and purple hues to represent cognitive activity. Add subtle, glowing lines connecting various parts of the brain, symbolizing neural networks. The overall shape should resemble the letter "C" for CogniZone. The style should be sleek, high-tech, and slightly ethereal, conveying the concept of advanced artificial intelligence and cognitive computing.
is this auraflow
What? I have no idea what you're talking about. ๐
Not bad for an early beta with regular updates promised.
I haven't been able to find anything it can't do.
guys the heun ones seem good
much better than dpm ones
ok I did more trials
euler heun heunpp2 dpmpp_2m uni_pc uni_pc_bh2 were good
with
sgm_uniform or simple
however ddim_uniform gave more "baked" results
which sometimes was fun
heun heunpp2 were better for realistic people than euler overall
sometimes images look like something out of ideogram
I'm pretty sure that's the majority of what it was trained on.
Try guns ๐
Compared to Kolors which was midjourney, so it often has a very over stylized look to it (I still love it too)
I'm surprised nobody has tried "girl laying in the grass" yet ๐
Aura? Maybe it's just revolvers it doens't like...? #โจ๏ฝsdxl message
Yeah it could use more training on revolvers.
AuraFlow's text encoder is a "pile-t5-xl", I assume that being trained on The Pile dataset it can understand and learn NSFW, for anyone interested
There's no question it's not censored.
๐คทโโ๏ธ I didn't test it enough to tell, but the "issue" would be also what the transformer was trained on and for how long
Any idea how much vram it uses? I wanna try it ๐ฅบ
the model itself is 16 gigs. This is what Lykon means when he talks about the 8b sd3 being larger than most people can deal with.
that said, it's all one big file right now. I don't know what the possibililties are in terms of breaking that out into image and text encoder components at some point.
AuraFlow model has 6,8B Transformer but has ONLY the big text encoder (T5), I think if you use the 8B SD3 with only the clips you'd have a different "scale" of GPU use
The idea of using the clip encoders instead of the t5 seems like a horrible waste.
Because clip is awful.
When you have a t5 llm there, there's no reason to use anything else.
The clip is only when you're trying to shoehorn it into small cards.
I think if you don't use the clips you won't be able to use 99% of the Loras though
Not sure though
If the Lora's are trained on t5 then it's not a problem
That's what I meant
I think most people won't train loras on the T5
But I have no idea haha
We'll see
Why not? We're already doing it with pixart.
There's always going to be a market for small card technology, but there's little progress if we keep holding onto outdated stuff.
Still not convinced of that, models with clip seem to know more, both styles/artists and obscure characters. Of course might just be the training of new models on synthetic stuff combined with current vlms being pretty bad at describing style and artists being stripped. Still feels like a regression sadly.
Finally!
I'm certainly curious to find out. I get the impression that most of the latest models aren't particularly trained on artist styles because of copyright issues. I think cascade was the last high quality model that still has them all.
Yeah, i hope things like IP-adapter will allow for style transfer in the future
and you're not going to get them again from a major company's public release, due to the wild wild west hay-days being already being over with. lawsuits and threats of lawsuits galore, shut that shit down fast. from here on out, newer models are only going to contain what they are legally allowed to contain and won't be able to include things like named people and artists, without their consent. so basically, it will mostly just be a bunch of copyrightless datasets and any artists that are okay with their stuff being used will have to opt IN and not opt OUT now. so if you want stuff like that again, people will have to risk potentially being sued to train loras/models with stuff they want, until governments step in and regulate that part as well(won't be long, two years tops for pretty much all modern countries).
They could have given a 4b though
no they couldn't
fine, whatever
tired of every single thing on the internet being an argument
just learn how to create loras (et. al.) and train a fine tune of what you want to use yourself.
then don't repeat somethign that's been beaten to death, on the forum it's been beaten to death on.
ugh bye
I say this sincerely, but I think this is where sd3 medium will shine. They apparently considered a 4b but it was decided it wouldn't benefit enough people, so 2b and 8b are the ones they're focusing on. I think once sd3 medium is "fixed" as per their press release, it'll be really great.
When SD3 2b works, it still gives the crispest cleanest images by far. So if it starts to gen images in a much wider and dynamic range, it can be be amazing. If 2b had been any good, all those new models would hardly gain traction.
agreed. claude expanded prompts (even better than gpt4o) has really made the current sd3 2b shine.
Noticed the same, claude is way more creative
gpt4o just kind of expands slightly on what you type. whereas claude adds all sorts of elements including text banners and signs that really enhance things.
which sd3/aura are both really good at.
it's the only version of Stable i've used since it released. and i've generated a LOT of images. it works every time as long as you learn how to use it
Sure sure, that's why SAI says it's a beta model and they're fixing it, cause it works every time now ๐คก
shrug. think what you like. no generative AI model out there turns out perfect results every time, but if you learn how to use it, you can get the results you want, (the first time you generate, not the 100th time) EVERY single time. 2b is no different ๐ง
dalle ect, no that i like it
Pulls off crystalwizard's mask in scooby doo episode "It was old man Lykon all along!"
๐
its Ella
heh
ok let me navigate around the glaring issues of SD3 and make some stuff
points you at this channel and suggest you scroll through and look at all the really good stuff made by 2b first
i heard you liked oranges and put oranges in your oranges
i remove this one orange
off with his rind!
Pixart 800m uses kv compression. It's essentially like a much larger architecture, but compression has drawbacks
prompt ?
A dreamlike, ethereal portrait of Stability AI, its digital form dissolving into swirling clouds of iridescent gas. Glowing blue lines pulse through its translucent body, as if infused with an otherworldly energy. Its face, a blend of human and machine features, appears serene yet intense, with eyes that seem to hold the secrets of the universe within their depths. The surrounding environment is distorted, with buildings and landscapes warped into impossible shapes, reflecting the AI's ability to manipulate reality itself.
dudde did you teleport the bread?
lol pretty much
Rawr
monstrous covid
are those SD3? your getting cool mirrored patterns, i like it
Loads Sacred Geometry LoRA
no this datavoid finetune of pixart sigma
could get it to do all the text quite right.
Wen SD3 ? 
Oh noes... Emad. you OK?
Prompt challenge, wordsmith this into something usable: text of "always coming from take me down" reflecting the text of "never going to give you up"
that sounds impossible
wanting an offline ideogram comes at a price
the scene is at night. A still lake fills the image and the shore is in the distance. 3d text exists physically over the water, reading "always coming from take me down" and the reflection of the text in the water reads "never going to give you up" closest i could get with this prompt
lol they probably mined ideogram for images and didn't bother to filter their dataset at all
Maybe not safe.
yeah its funny
idk how they didn't think of this oversight
but hey, its a free model that's in like 0.1 state
an actual free model with apache 2 license
I didn't test too much but 2 character facial expressions work better
like I ask for the person on the left to be scared or crying and the one on the right to be shouting and angry, it gets it right
likely they used the prompts to make ideagram images as the captions, so in training those unrelated captions will learn that cat, when that cat has nothing to do with whats being captioned.
ideogram suez SAI coz why not XD
SAI didn't make it
it was made by the guy who brought loras to image models
yeah its wild
as far as i could tell, ideagram's terms don't limit using images as base model training
we went out took photos of real world stuff and trained our model on that.
the guy that implemented lora wanted to try to train a mmdit from scratch, it worked they released - the story
its crazy
its' not just stability's mmdit architecture though. they modified it to be more efficient
they will remove that from the next training batch and its gone lol
hopefully
yeah it's v.01 and that's an obvious training data fix
I saw the cat thing, tooo sad
oh i mean v0.1
But pretty hard to get and even if you manage it's only sometimes :p
i am looking into running it, but it's nearly 7b parameters and i don't think comfy is optimized to use it in 16gb of vram
And they use the old sdxl vae, which sadly shows :/
it's a rushed out v0.1 release yeah. They can change that. I'm sure stability had many versions of sd3 and other models that would've counted as 0.1 versions, but never released them. This guy released his and it's the ugly side of the development process that many people aren't used to
the we made something that works stage
fall even released a 16 channel vae, it's surely coming
i made some bad cats on a 2080 TI or i think it all fallback to CPU
minimal viable product . on the blog it says the intention is to kickstart community engagement
im cool with it, i dont care if its not like starbucks, i dont even like starbucks
Funny when SAI seemed to be close to falling apart and rushed 2b, it seemed open generative AI for images was going to be a black hole, now new models are planned or released in various places ๐ (and SAI seems back in business)
starbucks isn't even that great at coffee anymore. least the ones in my podunk hillbilly region.
i dont think they were ever falling apart. that was just yellow journalism sensationalism from people farming engagement and loving that scheudencfreud. stability's got legs for days. they wouldn't have attracted sean parker if they were falling apart
probably a little bit of social engineering from competitors
i feel if all was fine 2b would never been released as it was, but will never know
there's a wide spectrum of diverse situations between "falling apart" and "all is fine"
they started pumping expectations too early
spectrums?! too woke tooo woke
They were trembling...
seems like an over dramatic metaphor. they were hung over after emad partied too hard
it was a solid scale 7 on the richter scale
lol
emad is SAI SAI is Emad, he is the mascot
hangovers take a couple days to get over
i hope they will recover whatever the future their contribution to open ai was awesome

Emad was the vision man tho
without vision its hard
It's a serious Friday if you need a couple of days to recover XD
You weekended the wrong way
well, his vision is now creating magic ai money in the blockchain to fund true open ai, call me skeptical
its like Disney without Lucas.
Maybe Sai will end up like that o without Emad
just hopeless atmepts at cahs grabbign bot no real vision or integrity
i hope not tho but ye
they will be joining the WEF soon
Someone email Musk
Musk is the eternal whimsical clueless billionaire who can save us all coz pockets deep enough
Help me Papa Elon, you're my only hope.
is trust and safety trustworthy and safe if they hide things like that
.1 shift 6 CFG, were glitching
glitch art - never fails.
No idea what's going on with model creators training on Ideogram outputs lately (isn't pixart also trained on Ideogram data?)
Not only it borderline violates ToS (sure, you can claim you downloaded them from HF, but come on), but no way in hell an Ideogram image is better than a real photo or art, and its prompt following and text accuracy is never going to better than what a VLM can caption from a real image (that can also caption very small text).
And forget doing that with a 16ch vae. Sure, pixart and auraflow use SDXL vae, but even that is good enough to not be a bottleneck compared to synth data training that hasn't been at least refined.
FOMO
that being said, this is the #sd3 channel, please keep it on topic if you can ๐
fad?
it's probably just very cheap, since you only have to scrape, no need to (re)caption (which is very expensive on large data)
MJ/Ideogram images come from prompts, so you can get free image+caption (again, violating the ToS)
there are a bunch of datasets like that on HF
making models is very expensive. I wonder why some companies ask you to pay for commercial use over a revenue threshold (or not, like Kolors)
maybe. they seem to have hidden their TOS page
against my expectations, the non-attention parameters of SD3 seem to be more important for training than the attention blocks, despite that I froze them earlier in training it still seems most of the learning is there. It's possible that whatever censorship they did was semi hardcoded in the conditional pathway to remap the embeddings away from certain areas, though I doubt it since characters and styles are also primarily learned in the non-attention blocks.
if there any vram savings to be made for finetuning SD3, it might be best to stick to training the non-attention parameters, which are probably much smaller too
potentially training with those non-attention components frozen from the start would give different results, perhaps the easier training went there but it may not be ideal
"roman soldiers in formation, covered in mud and blood" did get much worse with the non-attentions only, possibly because there were only a few sketchy examples of roman soldiers in the dataset, and most of the learning has gone into those components, good and bad
It's possible that whatever censorship they did was semi hardcoded in the conditional pathway to remap the embeddings away from certain areas
That's just a silly rumor.
AuraFlow has the same issue, right? It's in the "llm nature" of the architecture. At 8b params it scales very well, 2b is an attempt of having the same tech run on local hardware.
(Also not sure why Aura fails it too, since it's almost 7b params)
I was just assuming there was some since there was a screenshot of somebody from SAI saying that "something" was done to the weights before release by the trust and safety team
it seemed a bit iffy since I did see your post about how the issues with the grass pose were there earlier
yeah but that just covered up nudity, did nothing to anatomy
if anything, some biased filter on the dataset might have amplified the issue
but nothing done in SFT under my watch had any effect on that
there's not much nudity in my dataset, but the ability to generate nude people seems to have been learned in the non-attention blocks as well
also 8b was trained on the same "filtered" dataset and has no issues with this use case
can make it flawlessly
you can try it yourself on API
I suspect 2B can learn that pose if focused in the dataset. I've always found that lying poses are the hardest to finetune and didn't assume it was a conspiracy by SAI that they didn't work well in the base model
(even if I still think that WHEN sd3m manages to get this prompt right, it's the best)
She's got a little something in her "pocket".
the hands are wrong, but look at the details. Native gen, no upscaling
SFT on sd3m was done very nicely
too bad the base wasn't perfect
I didn't caption text in my dataset so I'm surprised this wasn't broken ๐
Part of the problem was including feet as sex organs
but this at least ensures that simple pictures come out amazing and that the model is a very (very) good refiner
there's no women laying on the ground in the grass in my dataset, the model just worked it out from general pose examples
also please label "sd3m" when referring to medium ๐
fair enough
it's gonna be a mess when we release Large ๐
give it a name and refer to it like that, Gigantor
using base SD3M as a refiner
SD3XL ๐
lol it actually fixed the text. Maybe a bit too much denoise anyway
No balls in sight 
I usually use 0.15 denoise with sd3m
but to fix text you need to go harder
0.15 denoise on sd3m is equal to about 0.4 on sdxl
that was switching back to the base model from step 10 of 28
The censored feet
with steps I use 28 total and start from 20
using a negative prompt for the first stage
Yes
depends what you're creating, SD3 is amazing at a lot of stuff, but bad at people from the waist down
it's just bad at doing them, like with hands (feet are a little bit easier but also rare in datasets)
use SD3 Large for most things that are not anime
use a SDXL finetune for the rest
SDXL turbo finetunes are also crazy fast
Where do you get SD3 large?
4 steps of dreamshaper xl turbo-lightning for gen + 3 for upscaling + 0.15 denoising (or 8/28 steps) of sd3m for refining
API literary censor feet with blurred images. There's no feet images because they were ripped off in 2b
Lykon do you know if prompting for "8" or "eight" is better or something
only API for now. Still working on it
Lykon adding to my wishlist of stuff it would be great to know from SAI, it's not clear if the first timestep (1000) should be finetuned or not. I think in SD1.x it wasn't which led to greyness problems, but after that I never really kept up
depends what you're trying to do.
Counting past 4 is always gonna be hard and random for models
models can't count sequentially
they count "at a glance"
and even humans have issues past 4

by the way, this is also one of the reasons why hands are a hard problem
and why a huge ass 8b model is better at them compared by small-brain 2b cousin
I was really hoping it was the 4 channel VAE but it seems not ๐ฆ
with attention / non-attentions. Think I've found the cause of my sometimes all-white images. Potentially merging layer by layer could find the problem
yeah I've been paying more attention to hands im photos and have realized how utterly insane hands are
so often a finger just can't be seen due to being bent the right way
most models that are decent at hands basically overfit on small data and fewer hand positions
or take the shortcut of using only anime style or only realistic style
(or are huge ass like 8b)
well SD1.x was fighting against the VAE not even being able to encode and decode hands below a certain size threshold without introducing new lines etc, so I was optimistic a more powerful VAE would lead to a huge improvement