#🏞|general-with-images
1 messages · Page 31 of 1
Sure I suppose

Examples?
Optional
I figure best to not then gen with it
Yeah, thats what I figured, no chance it does it
would be cool
there is like 0 content out there for biblically accurate angels
all we get is the shit ass interpretation people have cherry picked
says something about humans
I wonder how sdxl would draw a caco
stairway to heaven vs a highway to hell, lol
I am not gonna say I am a satanist cause we have cooler merch but I mean... Its a plus lol
btw, I have often wondered if this is partly why 2.1 is so damn hard to train?
SKWISH
I do not remember exactly how to explain this
IIRC, it only pertains to the goof ass 768x copies of SD 2.x
something to do with handling different resolutions, but don't quote me on that
2.1 demands v_parm
Yes, cause SAI did it shitastically where only partially is it 768x768
reason 2.x had stretched necks so much and weird long faces
thing is v2 is 2.0 and v_parm is 2.0+ or 2.1
I tried once to leave it off and total mess
ah
newer SD models are hard to run because they use 768^2 res images
I just remembered
thats not the case
It's not able to do that, unfortunately, it's still applying "word" for whole prompt.
But the problem is - I think it's good in some situations, often we want to apply style to whole thing.
We need seperate "syntax" to "enclose" prompt styling just to one word or phrase, like
woman wearing |red hat| will mean only hat will be red, but there might be problems with implementations of this too...
Since we're typing hat, AI might want to do hats on other objects and it will need it's own enclosure and so...if we going to do that "perfectly" at some point, it'll mean we have to make AI actually understand what is applied to what and "overall" image styles should be it's own separate global thing
What I was talking about....AI needs to be smart, yea.

ppl I see usually get like
20it/s
I get like 10 seconds per iteration

I am just going off of what I have been seeing, and testing, and SDXL DOES listen to that level of granularity, and its tripping me up
yeah, maybe on a super high end GPU
"A photograph of a man with sunglasses, a cowboy hat, blue denim jeans, a red plaid vest, leaning against a beat up pickup truck at sunset with a hand on his hip"
It listens really damn well
based on what I've seen , it applying colors all over the place, as usually - add color all over the place , or at least color shades, not just 1 object
Like 17 on a 4080 or whatever
yeah, so a really high end GPU lol
ye
A bouquet of red white and blue flowers
Look at image colors and shades...and colors you prompted...
I want an amd GPU because I'm on Linux but not really because of AI

4090 is about 28-32 or using the new tech 40-60
those are blue jeans and a red plaid vest... IDK what to say man
tensor power
I solved the issue (mostly) of my screen freezing and my PC locking up after a few generations by using zram
Wasn't using swap before
Now it just eventually freezes and lets me move my cursor
I fixed my issue by going to 48 gigs of ram
my ryzen is not happy though as it wants its CL16 back but can't do it
I was gonna upgrade to a 3900xt
ryzen, especially first gen, is heavily depdant on cl
1.try more specific test, get 1 color and see if it will apply to other objects - I can guarantie - it will apply it to as many objects as it can + add shading on WHOLE image of this color.
2.. Another test would be to specify 2 colors to 2 specific objects only , different objects.
You'll see this 2 colors on everything too
I mean...you was there when Guiz posted images
maybe when SDXXXXXXL comes out
it will render hands correctly
when I train I want to use the ema version, right?
that is a massive skill issue lol
I think it might be a little bit better at hands and holding things, but eh, idk
SD isn't bad at holding stuff anymore I think
if so , not significantly
You guys who are doubting SDXL clearly have never used a base model, cause you would see that SDXL is curb stomping 1.5 lol\
I don't use base models at all
And don't think I can run sdxl
You are just trying to spread false info...that's what I don't like
I see it's decent as base model
How is this false? lol
it has gotten almost everything I have ever tried in it right when all the others are right maybe 15% of the time
statistically, that is a monumental improvement
"A photograph of a purple layered cake ontop of a green table cloth with flowers"
consistent colors, once again
same thing in 1.5
same in 2.1
SDXL is clearly massively better here, I don't know what else I need to say
no layers in the other two, neither of them look like photographs, and the flowers in both others took on the colors described while SDXL didn't
I tested SDXL extensively last night with some friends, and it almost never missed, while the others almost never hit lol
- it can do colors correctly if you specify each thing (but not always), but so can 1.5 models, unless they don't understand words in prompt.
- it keeps problem of applying colors to whole prompt \ image - that problem still persists and feels like it got even worse, but idk
It's improvement on understanding prompt, but not coloring specific parts
these statements seem very unfounded. Maybe its just because I have been messing with it, IDK
"A photograph of a white woman with red white and blue colored braids"
1.5
VS SDXL
I mean its pretty damn obvious, I don't know how much more you want
SDXL can also do images at night
which none of the others have been able to do
Look at this images
it applied colors you specified to all elements on the image, even background is made of shades of this colors
On both models
You are severely nitpicking things
You will see it on every gen you do if you pay attention
if you can't see the leaps and bounds of color and understanding improvements, then my condolences
I think you aren't even listening to what I'm saying...
SDXL continues to drop kick all of the other models
no I am, but its kinda ridiculous IMO
I'm saying - SDXL is better at understanding prompts, it is good base model - I'm not agruing there
Its a fantastic base model, compared to the others
But your statement about coloring is just not correct
1.5 is woke
got it
"A photograph of a husky puppy sitting on the beach at night"
1.4/1.5/2.0/2.1/SDXL
SDXL is the only one sitting, and its the only one undeniably at night
and two of the others don't have beaches
his eyes look like he's coming down from a speedball though
thats a base model thing
if you saw how horrible 1.5's base model was, and then saw how good we have gotten it, then you would see why SDXL is a massive improvement
SDXL with no tweaks listens and produces more reliable results than the best tweaked versions of 1.5 I use
I just got used to prompting on 1.5
"polygon art of a frog"
1.4/1.5/2.0/2.1/SDXL
Only one that made it polygon art, and it also has the best frog consistency and color selection
I mean look at 1.5 lmfao
if we got a base model that listens THAT BAD to get as good as it is now, imagine what we can do with the model that listened to exactly what I asked for
yeah lol
But I hope these examples show you guys just how much better SDXL listens to shit out of the box
cause the jump from 2.1 to SDXL is wayyyy bigger than all the jumps from SD 1.4-2.1
real

hopefully so
using SD for art therapy is kinda meta

Has anyone ever thought of that
im liking xl seems cool so far
inb4 someone makes money from SD and depressed people at once
"A high detail pencil sketch of a forest"
1.4/1.5/2.0/2.1/SDXL
Sure SDXL is a little stylized, but man it is just fucking these other models lmao
noice. appreciate the comparison images
2.1 did good too to be fair, at getting prompt right
it's not high detail though
idk

I love cat smoke emote
oh yea
yeah only xl is high detail
"A simplistic water color painting of an Autumn forest"
1.4/1.5/2.0/2.1/SDXL
You can see just how damn good it listens here
It only has autumn colors, most simplistic yet the best depiction, while having the best water color texture rendering as well
I don't udnerstand what "water color painting" ,means to argue on that 😄
Sytan, how about some photo real comparisons?
Sure, anything in mind?
a water color painting
you can see SDXL nailed it IMO
hmm how about a fantasy knight
something basic probably , without detail or style additions, just subjects
especially for me saying simplistic
Dave chappelle smoking a Cuban cigar
My first image I did in 2.0 was an old prompt and it gave me a trans.
I will try this one

Interesting.
yeah was trying to think of something kind generic but still has style haha
I posted it in here and the next one was as well. We all laughed, then the computer giggled and I pulled its plug.
man I want this xl model now lol. maybe i should pay the 10 usd or whatever for some gens.
"A photo realistic photograph of a fantasy knight in a forest"
1.4/1.5/2.0/2.1/SDXL@wispy nest
wow
SDXL claps again IMO
I will try this one now
do you want it in a specific style, or a photograph?
i am gonna do an oil painting
photo of woman holding spear, wearing long dress, pink hat, shiny ornaments, balloons flying in the air
- that's for "holding" items test + pink is easy to notice if it will apply to other objects.
Ornaments , balloons just for more objects.
@smoky oak A neon cyberpunk mercenary seated at the bar in a crowded pub
if you dont mind a second haha
Alright, let me catch up haha
yea 😄
"A water color painting of Dave Chappelle smoking a Cuban cigar"
1.4/1.5/2.0/2.1/SDXL@wispy nest
SDXL, killing it again lmao
oh wait...added to test I think just yesterday?
yea I did few images basically of simplified version of my swordswoman with ripped clothes lol
couldn't do ripped cloth 😦
I think it udnerstands what it means , but changes are too little , maybe it just doesn't have enough data on it
"photo of woman holding spear, wearing long dress, pink hat, shiny ornaments, balloons flying in the air"
1.4/1.5/2.0/2.1/SDXL
Looks like they all struggled with ornaments and the spear cause I don't think they had enough focus in the prompt lol
SDXL kills, again
@wispy nest
meant to reply, my bad
no spear though
yeah, the spear seemed to disappear from them all lol
Trying to show my mom stable diffusion but my GPU sucks
And it can't even load in 2.1
anyone know when 1.4 was release? just thinking what the time between it and the progress to xl has been
I'm using a model that just
Only renders characters
"Digital art of a neon cyberpunk mercenary seated at the bar in a crowded pub"
1.4/1.5/2.0/2.1/SDXL@wispy nest
She wanted a flower it started making an anime girl
SDXL wins these, all day
ooh thats nice
its understanding, composition, colors, and over all consistency are just... next level compared to the others
hm, yea, actually SDXL did better on applying colors there, less pink everywhere
SDXL I meant*
oh ok, I was gonna say lmfao
yea on everything else pink is just everywhere
all good lol
yeaahhhh
Any other ideas peoples?
I think SDXL has firmly beat everything else on every single one so far
a glowing cyber-frog standing on a tropical leaf in a forest filled with neon lights
sytan, i dont suppose you have adobe firefly access? be fun to compare it to xl haha
firefly is ASS
oh really
i saw some say it was good
just a sec
Looks like they all struggled with ornaments and the spear cause I don't think they had enough focus in the prompt lol
- I think weight might fix it, but also might lead to more issues with color, unless everything is specified
here we go
color 'spread' seems like something theyll fix in time i guess. like hands. its an odd one
Here is the same prompt in MJ, Dalle 2, firefly, bing, and SD
damn thats thorough as hell
A futuristic city skyline at sunset, blending elements of cyberpunk and Art Deco architecture
MJ, Dalle, FireFly, Bing, SD
SD wipes the floor lmao
bing is pretty decent
Its the only one with anything art deco, its way higher res (cause there is no cap), and its also just way more consistent
is bing like dalle 3 or something?
I literally cannot load a model after loading one in when SD starts
It crashes
yeah, bing uses a fancied version of dalle 2
I have a feeling like that wasn't exactly fair comparrison cause of resolution difference
struggle bus gpu life
Y'all remember when ai art generators hated every human of color
maybe the rest of the tools are good, but oh my god
hmm idk looks decent to me
looks decent, bro
this is a paid monthly service lmao
this looks decent to you? lmao
I'm happy to see more and more of them, it keeps the competition going (i hope haha)
They are a multi billion dollar company charging people to use this, they should have something that at LEAST works decent lmfao
right lmao
Firefly base image gen AI is trash
I would rather use dalle mini, cause at least then I wouldn't be having my bank drained lmao
People will still use it for other tools, whatever they was (I already forgot), since it's already integrated with photoshop
a rose i generated for my mom
lots of artists use photoshop already and paying for it, so why not
just default settings on base sd2
lol
Yeah, the other tools are cool, but I hate how many people are praising adobe, AKA one of the shittiest and most corrupt companies alive, for being "ethical" in not using other peoples information
like, good for you, now stop fucking over the people that support you, and also stop stealing money from people?
now we need google to drop one of their models to get some attention lol
and nvidia
oprah gif voice you get a model, and you get a model
2.0 is ok i think
If it is then that means my guess the issue is the V_param thing of 2.1
really is the fault
imagine a dinner where YOU are the dessert...!
yes ma'am
(she's a vamp)

Hi all, can you tell me please , which Prompts to apply to make Stable Diffusion change the interior? Not drastically, but like a new renovation and new furniture? I'm getting unrealistic.
murder scene?)))) I just want to renovate and need ideas)
the words "only on" with more weight would help 🙂 but sadly the models have no parameter logic for this inside
theres a model that used to be forbidden to speak about
is dreamshaper related to it
im using dreamshaper with no vae
the one with a "baked vae" gave me deep fried colors
a forbidden model? cool
"A glowing cyber-frog standing on a tropical leaf in a forest filled with neon lights"
1.4/1.5/2.0/2.1/SDXL
got distracted lol
SDXL continues to win
Fallen Angel in Hell - just for the Lolz
XL?
My SD shat itself again
Making images
I feel bad for my 1650
"Vector art of a cabin in a forest next to a lake at night"
1.4/1.5/2.0/2.1/SDXL
1.4 and 1.4 STOLE that shit lmao
wtf
we can't use it outside of these low res gens yet
lmao
the two first images are pretty much what copilot does but for art
yea back to back comparison with other models shows
Are you doing just 1 gen for each of them, no cherry picking?
A whale playing in tropical beach. Sunset golden hour. In the forest there are Panthera Onca. Ultra-realistic, cinematic, chromatic aberration, incredibly detailed and intricate, FKAA, TXAA, RTX, CGI, VFX, 8K
sd 1.5 RPG-4 Model
Angel with fire wings
@smoky oak see those? points up there
"a realistic clay sculpture of a mans head"
1.4/1.5/2.0/2.1/SDXL
number 2 is fucked up lmao
thatface lmao
I guess all are accurate, just depends on what you imagine the talent level of each sculptor to be lol
anyone happen to remember when 1.5 was release?
"A whale playing in tropical beach. Sunset golden hour. In the forest there are Panthera Onca. Ultra-realistic, cinematic, chromatic aberration, incredibly detailed and intricate, FKAA, TXAA, RTX, CGI, VFX, 8K"
Sorry?
people were posting prompt requests for xl
like salgado and matt
hey its the experiment that is fun to see
few somewhat decent gens on woman wearing cloak, robe, gloves, holding sword in a hand, upper body, close up
,SDXL
Hands looks better at average, holding and sword isn't, unfortunately 😦
yeah
oh well, still is progress
"A whale playing in tropical beach. Sunset golden hour. In the forest there are Panthera Onca. Ultra-realistic, cinematic, chromatic aberration, incredibly detailed and intricate, FKAA, TXAA, RTX, CGI, VFX, 8K"
1.4/1.5/2.0/2.1/SDXL@hazy moth
just quick overview non cherry picked
This is not a good prompt at all for these models IMO. They aren't the same we are used to
so how do you like it from what youve seen so far sytan
you see some good potential or do we have another 2.x on our hands haha
"Clip art of a simplistic pine forest"
1.4/1.5/2.0/2.1/SDXL
Its got massive potential... But we have to hope it doesn't have the same trash text encoder as 2.0-2.1
this base model is by far the biggest improvement we have ever seen, and I am not even showing off its insanely good text
"A wooden sign with the word "beware" on it"
1.4/1.5/2.0/2.1/SDXL
SDXL steamrolls the others on text
damn
didnt know it did text
how many words can it do before it stops making sense?
a full sentence?
"A street sign with the word "caution" on it"
1.4/1.5/2.0/2.1/SDXL
it does text good, and reliably as well
you are using the new dream studio, I am using old dream studio for more control
they are using different SDXL models, IIRC
there are 3 different betas for SDXL, not sure which one we are on comparatively
There is SDXL beta, SDXL Tile Beta, and SDXL 2.2.2 Beta
they are very different from each other
same thing on old DS
If we need just an email and password we can just make infinite amounts of accounts
I switched over to my other account I just made, it has 25 credits
I'll play around a little more and then call it there
a haunted, screaming demonic pig charging towards you in a scene from the movie Mr. Pig's Apocalypse
I am not doing comparisons anymore, just flexing what SDXL can do, cause we have clearly seen its way better than the others lol
yeah, seems to be
Latte art of a cat
Geometric water color art of a rainbow wolf head looking at the viewer on a white background
I'm a sinner.
XL?
Dragon, sleeping inside of a dark cave
1.5, 2.0, 2.1, xl
A photograph of Jesus dancing in a club
Lmfaooo
oops, didn't send the image
Sadly no, I am not there yet.
I can only imagine how hard this will be to train with it knowing text as well as it does.
That's a surprise, 1.5 didn't know flags.
(SDXL)
Pencil sketch art of a man smoking a cigar
My standard SD prompt was, "full body portrait, painting, christian art 1500, happy Jesus sitting in bed dressed in a fluffy fleece pink onesie-pyjamas, looking at the sky, holding cup of steaming hot coffee, landscape with cross in the background".
An anthropomorphic donut
this one is way too cute haha
i hope they release XL public this week, like tomorrow even lol 🙏
I really hope so as well
but I hope even more that its not SD 2.0/2.1 levels of disappointing
although seems like 2.1 knew flags too
A gauche painting of a golden retriever dog
seems great from what ive seen here at least
but i guess we'll see how it does once we get it on auto or whatever
I had a stroke so my brain sometime block things I think I should know, it have happen again, so: What was XL and how do I test?
SDXL is the new version of stable diffusion with around 3x parameters
I believe 2.6 billion instead of 849 million
and you can beta test it on dreamstudio
pls chat : give me some underwater pics with sharks and a treasure chest, sunken pirate sailing ship in background.
So far, its blowing the other ones out of the water. It listens way more to specific things like poses, angles, and distinct colors, and it can also do text with no problem
Ty all. I was almost on right page, it was just that half way thangs was forgotten 🙂
haha yeah it can be a little tough to find kenny.
oh, SDXL can also gen images at night, something that 1.4-2.1 all suck ass at
^1.5 couldn't do that too properly, maybe like 1 out of 20 gens
at first i thought you mean in reality haha
i was like damn, what time is it? too late for image? :checks watch: ⌚
you can see, it can do night time, instead of twilight
something the other modles have always been trash at
I have a better example
although it does look studio lightings
"A photograph of a husky puppy sitting on the beach at night"
1.4/1.5/2.0/2.1/SDXL
coffee cant be a sin 🙂
You can see SDXL got the puppy, its at night, there is a beach, its sitting, and its a husky
all the others said fuck you lol
SDXL has been thoroughly impressing me
The text capabilities also help a lot
you guys ever drink coffee at night?
I do all the time cause I have the gene that makes caffeine useless to me lol
proper night on 1.5 based model
(not base tho)
got it from my mom
nice soulless
yeah, illuminati and other noise offset models helped, but SDXL can already do it massively better than base 1.5 with nothing else. So hyped
maybe stable will get its mojo back lol
wrong one
"polygon art of a frog"
1.4/1.5/2.0/2.1/SDXL
Only one that made it polygon art, and it also has the best frog consistency and color selection
nice. very nice.
SDXL is showing a higher level of understanding over all others
A charcoal portrait of an old man smiling
The fact that the base version is getting results this good is very very very exciting
some of these base SDXL results are matching 1.5 fine tuned results
An oil painting of a small forest town at night
will sd xl a normal version u can isnstall and train own models? or only web-based ?
it will be released for our use soon
Right now it is in beta as they are finishing it up
preview right now for credits on their officials ite
Woah, it can do double xposures out of the box OMG
can someone make a sticky that there is no picture bot here XD^^
well sytan was doing some images for xl so people were asking maybe idk
a geometric double exposure photograph of a city and a mans head
nice. very nice.
It can do double exposure with no TI or LoRA :D
This doesn't look like they are that hungry for money if they just allow us to make new accounts using only email, it takes like what, 15 seconds to register new one with firefox relay
i think the trump pics have reached a lot of people that will be now aware of picture ai's
was funny^^ realistic good job
A rough painting of a frog smoking a cigar in a casino
oh!
one other massive thing about SDXL
it is REALLY good with angles
just a sec
liberty
Oops
from below
above
at night
being demolished
it knows this shit so well
I messed taht up, oh well lol
very impressive
you can expect wayyyy more when I can run this on my own hardware
Same prompt, one made in DreamStudio with XL Beta, the other one Locally with Automatic1111 WebUi, can you see the difference?
the pink color seems not spread over the pic
yeah sd xl seems to haev this problem as always, but now it seems much better
"photo of a anthropomorphic watermelon pose sitting on a forest path, rain"
maybe this is easier to say than did but they should add some of this face filter or beauty you see on MJ. that way when people share they images from XL, they will see pretty stuffs you know
free promotion for how good it can look idk
My favorite model in Automatic1111 manage to do the prompt rather well, "photo of a anthropomorphic watermelon pose sitting on a forest path, rain".
lol yes that is great
nice general. looks like darth vader sister maybe lol
smh
I forgot what I was doing and went back to being silly 🙂
Not what I expected, but it was rather interesting.
my queue is full but...a female Darth Vader XD nice idea 🙂
There he was, 5 generations away.
i swear a female darth vader in a puple Darth Vader Armor sitting in an abadoned rusty Deathstar XD
sadly my queus is full for the next hours XD
Where are you testing this? I've tried it in Dream Studio and 80% of the time it ignores the text entirely, the rest of the time it does about as well as existing models...bad -_- EDIT: nvm. it just needs to be on something other than the "Enhance" style. Photographic works great
ok, the female part is the hard point , to figurte it out^^ but nice anyways XD
seems funny
pink light saber and blue pulsating lightning maybe
but thats to much, just for the lolz this one
hehe ^^ this is nice-the sitting position XD
Darth Vandra
like s(he) comes for a job interview XD
Oh yeah, enhance is to enhance an image you submit haha
a female Darth Vader sitting in an abandoned rusty Deathstar, pink lightsaber blue light
Last one, "watercolor painting of a female Darth Vader sitting in an abandoned rusty Deathstar, pink lightsaber blue light, by Carl Larsson"
very nice
good one
how much for a print?
the light saber seems ähhm ...yeah 😉
I have to do one more.
made for a female it seems
maybe printer sales will shoot up again lol. people can print out they creations on ai now
Seems to have problems on anything larger than 1 word though. "3D Model" and "Photographic" definitely do the best. They usually get 2 words ok, struggle with 3+ but can sometimes get 4 words and drop 2 if you prompt for 6. a million times better than existing, but definitely still not perfect
DeepFloyd seems to be the same from what i have seen
I am not sure I get what you mean
This one was rendered but not upscaled,
he means the more words you use, the less accuracy it has
kenny this is XL?
those line detail are really cool to me
For example:
A wooden sign with the words "This is the story of a girl" on it
usually won't get that whole thing. Got a few that managed "This is the story" and a LOT that got some combination of those words mixed up and missing some portion of it, but nothing that got the whole phrase. EDIT: those seem to struggle even with just 2 words actually. Must have been getting lucky before. Does seem to manage 1 word most of the time though
No, Automatic1111 loacally
Oh yeah, I get what you're saying
oh my god...i expirienced that my modell knows exactly was a wastland raider is XD oh my goddness...i have to sleep but....XD....
I'm certain it will get way better as we learn how to properly prompt for it, as well as with the better fine-tuned models we will see once it releases
i wish writings would be possible on signs and so on
that's what we're discussing with SDXL. it is with one word and that model
We can with SDXL, but it seems to be limited to 1-2 words right now
SDXL can do really good text
lets see how it looks once it gets public release
for instance
seems like some cool things are possible you know
yeah, we need some fine tuned models to know first
but it seems to have more promise than 2.x did
now im hyped for xl...thats a great point for product Marketing
(i used niche because it's not a super common word and will almost never be found on a sign)
everyone seem to know it was going to be rough to work with right away
I found its good at replicating any word, as long as you prompt it right
Would be curious to see if it can do special characters
yeah. single words as i was saying. definitely cool. long way to go though
thats not something I thought to try
so i am not saying that xl is the boss now, just that it at least seem to have some promise for us you know. time and test will reveal the truths
yes
thats great
and adobe and whoever else is releasing theirs lol
based off of how much we have been able to get out of SD1.5, we can reasonably assume (if it doesn't have 2.0/2.1's terrible TE) that we can get monumentally better results out of SDXL
yes sytan
afte rall of my research, I can confidently say the jump between SD2.1 and SDXL is far greater than the jump from 1.4 all the way to 2.1
it does not. i also tried japanese, it doesn't do that well either, but it does try that at least. most special characters are completely ignored and $ and @ are converted to s and a EDIT: scratch that. was definitely lucky flukes again
The fact that they were able to train words suggests that getting fine tunes with other things in it should be easy, assuming the TE isn't trash
if the beta testers are all happy, im sure it will be a good version
a few models/months and maybe text will be easier for all these model who can say you know
I am ecstatic with what i am seeing, but there is the overwhelming sense of dread that SDXL is gonna have promise but be trash for training, just like 2.0/2.1
why?
why what?
why would there be that sense of dread?
text encoder is very important for training/fine tuning
something people have seen?
because it happened with 2,0 and 2.1. They should have been way better than SD 1.5, but because nobody can train them cause they have a horrible text encoder, they are slept on and often produce worse results than SD 1.5
left one was supposed to look more humanoid like
so much for the good mood lol
We just have to PRAY that SDXL is this massive level of power paired with a way to extend and finetune its knowledge
the one that that does give me hope is how much better SDXL listens... The ability for it to listen depends on the text encoder as well
And SDXL blows 2.0/2.1 out of the water with prompt understanding, so there could be a new TE at play here
i myself was a 2.0/2.1 convert for a while...but eventually 1.5 models (and embeddings, loras, locons) just worked better again. there was a -very- short window where i felt 2.0/2.1 was better
same vein as this one, concept art styled
I can train a LoRA for something in less than 10 minutes and have it look phenomenal in 1.5
sorry for repost
I just hope they release it public this week
like, my very early Na'vi (Avatar) LoRA was trained in 6 minutes, and I have results like these out of 1.5
not months lol
i HIGHLY doubt that will happen lol
how long you think custom
genuinely have no idea. but i wouldn't be surprised if months is more accurate
so get you dreamstudio credits if you want XL now lol
that's where all of mine have been going
Think about it this way, the text encoder is the brains here. Its what converts what you say into things the AI can understand and grab onto. Thats why Midjourney listens so damn good, cause it has a much better TE than SD
yeah look at the rose leslie aka snasa stark modell, it floods prompthero at the moment and of course arya stark too (and wierd mixes of them) sd 1.5 is amazing if it goes right with prompt and data
BUT
SDXL is showing leaps and bounds better prompt understanding compared to 1.4/1.5/2.0/2.1
Which suggests the potential of a massive TE improvement
which could also mean they fixed 2.0/2.1's shit ass training
you think we have a winner with this XL here?
I don't wanna get too hyped, but if they got SDXL to train as good as 1.5, we are not gonna be prepared for the sheer quality increase that is gonna come out of SD in tghe next few months
I mean FFS
1.5, the model that gives you this for "Polygon art of a frog" on its base
i think personnally 3 Monts after public sdxl release there will be hotfixes and trained optmized modells , than it would e better. the release VErsion will be good too, but the community makes the job after a time.
is the one I was able to train to do this
So if that happenedon 1.5 with it being that bad base. Imagine what SDXXL can do when its base gave me this
SDXL can do some things better on its base than even fine tuned 1.5 can
So if it has the same headroom for training, we aren't gonna BELIEVE the shit we will be able to do
Assuming they improve like SDXL did, cause 2 more papers from 1.5 landed us on 2.1, which was functionally a massive step back lol
lol true
I just hope stability has listened to the pleads of us all and fixed their trash TE
To tame the hype: it still can't do things that are extremely unusual well. I've tried a lot of ways to get it to output "A banana painted blue next to a pineapple painted red" to no avail.
sigh
reworded, tried using quotes, parentheses, negatives. nothing
oh well, i am just hoping always for some improvements you know
we will get the perfect model in some years yet
and sometimes it just gives weird results. "a banana painted blue" lol
like i get that it was using the "fantasy art" style...but it should still understand basic terms
similarly: "A blue banana" with photographic
bing image, mj5, adobe firefly, gpt4, and now stable XL all releasing around the same time. good time for some ais lol
so with sdxl i can wirte real words like "pls don't touch" on the shirt? YES! - Need that!
As we were saying before...it struggles but is a million times better already. some examples
thx^^
it cant do that yet
does better with the whole word "please"
but it shows clearly a big progress
Remember, 1.5 can't do a lot of shit that people have been able to do with the fine tunes
right
so think of all the stuff SDXL CAN do that 1.5 can't, like proper poses and shit, which SDXL was CRSUHING in my tests
SDXL is reallyyy good with things like "from above, from below, from the side, side view, from behind"
I tested it rigorously and it worked every single time for me
yeah. i'm aware. i'm still excited, but much more tempered until i see final results. right now it reminds me of a few loras i trained that ended up super baked because i trained too long. i could go back, but there's no telling where sdxl will end up at release...and we won't be able to choose an earlier version lol
also, SDXL has a built in way better understanding of styles, like pencil sketches, fresco paintings, etching, water color, vector/clip art, and much more
yeah we cannot judge XL on one night of some official site gens haha
yeah, true
but people are human so the rush to judge will continue of course lol
only good thing is we can directly compare it with the old versions, of which would have the same favoriting on said site
even when it is public release, we must remind others it is still early for this
My initial impressions are fucking impressed
but my expectations are pessimistic after the 2.0/2.1 fail
oh also, at least right now, SDXL is way more open to weapons, something that has been removed from the other models
you think we will see it have some success?
like I got SDXL to generate a fucking fantastic gun
on that i 100% agree. i'm definitely part of the camp that believes we will eventually have ai that can generate an entire movie, including sound visuals and story, before i die.
heh... My friend just sold an AI to Unity game engine that can model, texture, rig, and animate entire game scenes off of just a 10 word text prompt...
We are a lotttt closer than you think
he's buying the beer night time out sytan haha
25 sek vids are avaible, adobe and ms has shown those video ai's. nividia will help, Adobe will open its database... so u can be sure if there is no end of the world u will see those in the next 5-10 years
i'd like to believe that...but honestly i'mma call bs on it -_- only way something like that could even work is with an entire library for specific use cases
I worked on the project with hi, but I was a beta tester, not a dev
just a sec
I am trying to find the video
to be fair, i'll probably be dead a lot earlier than you're imagining lol. i'm not likley to be around in 10+ years
And with some poor hand painting.
I know this is not a demo video, but I got to work with this project just a week after it started, and it could already do 8k resolution textures with normal, color, AO, reflection, emission, transmission, roughness, metallic, and specular maps
https://twitter.com/unitygames/status/1638510865069531138?cxt=HHwWhICzkeu8lL0tAAAA
Unity AI. Here to bring you flying alien mushrooms 🍄
Curious? Apply to be a Unity AI Insider: https://t.co/CLHRYylqbP
1705
391
but adobe sucks hard , judges all softwares like SD says its for the creators bla bla and then a few months later they release an own software and says that the creators given all rights to adobe so it will be OK...
such a evil step
And my friend is the one that made it. Hell, I even did our promotional renders lol
hope sd and yes mj too will win in front of the judges
These were all made with it just 5 days after its conception
I uploaded wrong image... darn my dumb brain.
completely generated by AI from just 10 words each, with all 10 passes
maybe there will be some ai we do not even really know of. it release and we are all in shock from how it is the new king
its ok, the image is nice
that's impressive, if it works as you've said, but there's a huge difference between a major corporation/company having that kind of thing and the average consumer having access to it
All I know is he told me that when he was working with Unity, he can honestly say that the tools they were able to make together make that look like a fucking joke
he was just at GDC doing tech demos of it
It used to be called Barium AI, and I still have a life time subscription to it lol
Ty, but the idea was not mine and it some fast work. But I am still in training.
But from what i remember him saying, its completely free to use with unity
it uses your local GPU to do the calculations, so its free to run without having to use their computers or service
as i said, impressive, but i'll believe it when it comes to us...the huddled masses. i have no doubt in my mind that major corporations already have stuff that blows consumer level ai out of the water. i have major doubts that it will ever reach us in the next few years
the only sad thing is that if picture ai goes more and more public, no news can be known for its correctness-the old internet wisdom "show us a pic or its didnt happen" will not be true anymore cause a pic is avaible every time XD
Unity announced the release of it just a few days ago, but its not fully out yet
I remember when he streamed the announcement to me
Unity gave him a free A6000 to use for his dev process lol
so jealous

if it makes it to us before some company snatches it, that'd be awesome.
there's a lot out there that we could've had by now if previously open source projects didn't close up
did one of u see the latest epic show? it was with much ai stuff, especially the level creation is awesome i have to say
no i didnt but that sounds awesome
Trying to figure out whether or not to use a VAE for dalcefo painting model, sometimes colors get really washed out and im not sure whether or not there's a non-anime vae for that or if its even necessary
What? it was snatched up lol
by unity
look for it, yes its game-related but the possibilities, video ai is crazy if u have nvidia supporting you 🙂 realtime rendering and animate a 3d Modell synchron to speakin out of the box at home...
Unity bought it from him
I don't think anybody is gonna be buying the entire unity game engine any time soon lol
cause the new tool is built into unity
so u can make a video as ur boss and saying that you(he) quits the job XD
unreal engine 5.2 has the first advantages with epic,nvidia and adobe partnership
theire ai buddy right now
unity is open, they could easily sell to someone that isn't
the whole ai topic is in his first few steps, in 10 years-i cant imagine what happens to get the correct information
all u see can be a fake
^^
it's great to be optimistic, and i genuinely hope it all keeps progressing at the consumer level, but it could easily hit snags on the way. i still believe it'll get to us within the next 10 years, and i hope it's half that or better, but i'm not gonna hold my breath until it happens
I guess we just have to remain optimistic
Now I will call it a day, so ...
have you tried the 840000 vae?
you run into some guy wearing that armor in the forest and you just say, oh shit, i guess i never was the main character, and get ready to die
lenin and trotsky longingly look at a chessboard because they forgot the pieces
I made this devious thing
how did you get text to work
I PUT THE NEW FORGIES ON THE JEEP

Can I post this on r/wordington
(don't go there)
No problem...but i am gonna go there
Test
/imagin
How to prompt to generate images?
Human Commander Zavala... Made using Hassanblend
I still can't find the token key/word for that model nor f222.
Took me 8 hours to get that result...it started out like this
aye
damn
Yes, earlier on Sunday
I couldn't find it though
which model?
Joe Rogan and an alien
CivitAI
They are finally dealing with their porn problem
cause JFC, I wasn't expecting to get flashed when I signed up FFS
it won't be the real version I'll publish, I'm not happy with the current tokens but I'm still putting it on my "experiements" page currently if you want
and good morning to you too 🙂
(it's the mix version of the PoW models)
on prompthero ?
have dozen of new favorites since i investigate ai picture software
lots of people do have links in their profiles too
and I may have spammed notifications for you on civit this week end then with all the updated models :p
gz!
I was at 1k2 when I came back to discord/AI some weeks ago, so yeah, it feels motivating
I started getting comments asking for specific tokens to be added, so they could merge it
i have to learn much more to trai nown modells, but its a lil bit hard cause im not an english native, im from germany
it's a complicated thing to learn
a good friend of mine is from germany, and started doing models too around here
got some good vibe to surf at some point
and got a real job making models out of it
i have to look for a good explanation in german
but it seems sdxl will be a good starting point for that
WTF^^ nice one
it's my english guide through google translate
ill give it a try- cause u have a heart for cats XD
thats ok
don't feel like everything is necessary in a first try
i have to learn to swim before i try deep diving
exactly
and it feels like a jumping board
you can prepare for that dive for very long
and feel like it's gonna be the best of dives
the reality is it won't
so prepare to flop, and learn each trick to fix what missed
but don't stay too long waiting to jump
(yeah I'm not english either, I don't think it's "jumping board")
and this will grow, wait ..one year the whole world will be on picture ai generation, the trump pics have shown many people whats possible
did u see them, they're great :-9
I saw a lot yeah, maybe not those ones, I'm not very american politics fan
and chatgpt as a woldwide known peice of ai will get an image generating part aswell
I just had a jumpscare lol
my phone told me I should be at the dentist
x)
good joke phone
so everyone could use it in edge browser in nex tmonths, beta is open
yep, chatgpt and the likes are about to really transfigure a lot of things
google nividia and adobe comes aswell with their own ai s
little training finished just now, for @white pivot . let's test and show a little the results, no idea if it worked yet
it will soon require AI assistance just to follow up on the AI news
look here Projekt M from Epic & Nvidia & Adobe
Learn more about the future of Unreal Engine and more in this latest State of Unreal that was showcased during GDC 2023.
#unrealengine5 #ue5 #gaming
Timestamps:
Intro: 0:00
Unreal Engine 5.2 Demo - 3:51
Fortnite Chapter 4 - 11:32
Cubit Studios - 18:00
King Arthur Legends Rise - 21:28
Lords of the Fallen Technical Showcase - 23:40
Project M (N...
thats crazy runs on 4090 like a charm, so in 2-3 years everyone will do that
New models are available locally?? i missed something !
watch the metahuman part aswell
depends on the version u have
just updated atm
so there are 1.5 and 2.1 , but 1.5 is better in my opinion
sdxl will come soon than it will be the king hopefully
Hmm interesting
@flat pike Thoses models need scripts yaml to works with A1111?
ok which sd version do u use?
Automatic1111
no which version
??
1.5 or 2.1?
Both, have lot of models.
then look at the modell download page, the modell uve postet seems to be for 2.1
so its not trained for 1.5
I know, and i know models 2.x on A1111 needs yaml scripts
forget any scripts. install Stable Diffusion and load a modell - thats all for the start u need
automatic1111
if u installed it u see ur base model, its numbered so u know which model will be supported
if ure unsure just download a 1.5 and a 2.1 and test both...^^
gradio broke some things
was an update and almost all the trouble tickets since is due to gradio
there is a mem leak too
hit me and hit this ticket where he has an A6000 48 gig vram
When i try to load sd21-unclip-h.ckpt on a1111 :
did you update auto1111 in the last two days?
just 5 min ago
do u have other models in ur model directory?
but yeah maybe an update broke stuff, then u have to roll back
Saul Goodman from better call saul/breaking bad
love the Movie "Nobody" from this actor
with good 'ol doc Brown actor from back in time^^
I haven't had time to watch that yet. Where can I watch it?
Nice pictures, but your model needs a vae file for color correction (same goes for your profile pic). Which model did you used ?
anything 4.0
and yeah, my second day using stable diffusion
Ah no problem, on the site of AnythingV.4 there you find also the .vae.pt file,
this file you need to put into the models/Stable-diffusion folder
I can link it if you want.
Stable diffusion
let's do pizzas
Yeah
webUI loading
then make pizza irl
multicolored stones
For most of the Anime models you need one. Most realistic models have the vae already included
I don't have an oven
photo of a delicious multi cheese pizza, tomato sauce, cheese topping, blue cheese topping, crusty crust, very detailed, RAW photo, taken on iphone6, 70mm, 8k, highest quality, MasterChef
let's do one in HD
I'm not, I'm a little scared to update right now, automatic1111 seems broken for too many people, and I don't want to go and bug fix myself x)
1024x1024
the difference in performance is ~10-20% for the better
I have no errors in the program
thanks guys...now im hungry 🥘
the results will be the same
I need to update for sure at one time, but I'm mostly trying to transit more of my process into ComfyUI
yep I know, I was mostly joking on that one ^^ I'm currious about the exact tool you use though, there are like 15 different main "Stable diffusion" going around currently
Discord : https://bit.ly/SECoursesDiscord. Torch 2 / Pytorch 2 is now supported along with new DreamBooth Automatic1111 Web UI extension. If I have been of assistance to you and you would like to show your support for my work, please consider becoming a patron on 🥰 https://www.patreon.com/SECourses
In this video, I am explaining how you can man...
ok so yep, it's AUTOMATIC1111 🙂 we use both the same
I'll try to do an update then, but I'll copy the venv first I think
do u have some with crickets and grasshoppers^^?
PIZZAAAAASSSS
yes, it is desirable to do this
mhhhh i smell a food model coming 'XD
I plan to buy a 2060 with 12 gigabytes, because the 1050ti is already too old for such things
yeah it starts to be a little old for this
I'm on a bigger card so the performance boost there could be nice, but I'm already under 1 second per 512x512
my 512x512 image generation takes 40 seconds
harder to pull out of the model, the insect topings
maybe ill try that one day, im open to try it
I'm really not

cough No thank you. I'm good.
its allowed in food in germany since last year XD but i didnt tryed it yet
I have eaten crickets before
this one is horible
don't click that "spoiler" button guys
this is only for balrogdx
is this one a fake (2.0) ? https://huggingface.co/SG161222/Realistic_Vision_V2.0
i try it at the moment but it has so bad results compared to 1.4
bad is the wrong word, its dumb to understand stuff und geve lower quality strangly
possibly, the person has only 4 models, all versions of that Realistic Vision, but their account is very new, so hard to say
but if they had a beef with CivitAI, they may have switched website
ah ok, thx 4 info
