#💬|general-chat
1 messages · Page 109 of 1
openclip prompts sooo much different and prefers natural langauge over token words
I really appreciate your help. this is the last picture, I just wanna make sure my SD is working as expected. Sometimes I still get something like this, sometimes it's a little better. https://imgur.com/a/bNB0Icg
yeah. that's 2.1 for yeah. the bad cropping was a big part of it.
2.1 hates people. prompt for cats and scenery or food instead 😉
it's 1.5
ah yes. base 1.5 would've had the cropped data set too. been so long since i've used those old base models.
ok, got it, thanks
negative prompts are really powerful. i have one i use on everything. dead, deformed, disfigured, demented, jpeg low quality, compression artifacts and i used to use "cropped" but i'm really not sure that was effectiive. Refined models work great. i like dreamshaper for 1.5
or protogen
I'm dead myself, idk about anyone else
lol negatives don't have to be exact. i think of them more like the latent vectors we're trying to steer away from. dead bodies often have flat or deformed features
sd 1.5 has a base resolution of 512x but i always push it to 640 myself. another thing people do is a hires pass. it'll use the first image it makes to help guide a larger image
often i just set this to 1.5 rather than the full x2 multiplier
1.5 is, in theory, a 1024 model now that we have resadapter, but the extensions for comfy and a111 aren't out yet, it should take a week from what they said
kohya hires extension helped that a lot too before res adapter
kohya deep shrink or hires fix is very good from what I saw, but at least for me it doesn't work with controlnet on comfy
comfy requires you visually code it into the workflow a lot more tediously . i just use it in a1111.
#🏞|general-with-images message your prompt in dreamshaper 8 with kohya hires , rendered at 1280x1280 @stoic nexus
i keep hearing comfy is better. and while i like node graphs for specific productions, they're often very specialized to one purpose and you need wholly new node graphs for every project. Useful in a lot of cases, but ive never understood why people think it's inherently better. i love my UI's to be consistent myself
sometimes extensions come out for a111 first so comfy is not totally better, but definitely more flexible. depends on the workflow needed, sometimes a111 is more than enough
my big pet peeve with comfy you can't page up and down you've got to click drag the ui, zoom out, zoom in on it. misclick and you frig up your whole node graph. ooop didn't have it locked and frozen i guess thats just user error. despite the UI creating the situation to begin with.
nodes graphs are STILL programming. They're visual OOP. comfy requires you know how to build scripts and processes. I do, ist just, that's a whole different gear than my art gear
Another thing to learn about for great generations. 1.5 generations almost always require them for quality imo. Negative Embeddings. https://civitai.com/search/models?baseModel=SD 1.5&modelType=TextualInversion&sortBy=models_v6&query=negative tons here
If I'm using a LoRA and the faces are terrible with anything except closeups, the LoRA is probably justt not trained well enough, correct?
Using a different LoRA, I can get faces that look very good even when reasonably far away
or, small in size
often loras have mostly closeup shots in teh training data, meaning it doesn't know how to do the person at a distance so well. What i like to do is adetailer in those cases
What is adetailer?
an extension. detailer nodes exist on comfyui too. they mask the face, then inpaint it specifically with a specialized prompt.
hmm ok
well, I would like to make my own LoRA, I'm a photographer and I'm going to take a LOT of pictures of my face
near and far, wherever you are 🎵
I'm not sure where to start other than by taking the photos
Up until now I use SD in A1111 webui
and i knowww that my heart will go awwwwwww-onnnnn

you know how meatloaf's version of all coming back to me is better than celine's? i cry myself to sleep thinking about how he never made a version of my heart will go on
dang
Is it possible to make a LoRA using SD in A1111 webui, or do I neeed something else?
His name is Robert Paulson
lora training requires something else. there are extensions for a1, but they're bad. i prefer to use bmaltais' gui for kohya-ss
i had a dream last night i was infected with the protomolecule
love that show. the private eye was my favorite
i still haven't finished the last episode cuz i was so bummed it was ending
i'm hoping three body problem has a similiar hard scifi vibe, but its made by game of thrones producers so i mean, i'm not getting my hopes too high
that is a show that should still be running, best sci fi ever
i didn't feel like they were even close to out of material
ok ty I will look into this
i like the level of scifi coming out now a days. love foundation.
I do miss the expanse a bunch
i haven't seen that one
it was so refreshing to actually see real physics in the expanse
apple tv has a coupel good ones i've seen. foundation and for all mankind
no standing on the holodeck as you pivot 90 degrees at the speed of light while drinking tea and not spilling a drop
for all mankind is real hard science fiction that way
startrek i consider to be the softest of science fictions. it's a story telling vessel. the big thing it doesn't do is special relativity. even with warp engines.
their sub space faster than light communications would break all causality
yep pretty much
dune is kind of hard science fiction in a weird "if magic was real" sorta way
won't spoil it for anyone here but even the protomolecule, the basic concept of how it spreads and evolves, wasn't that far fetched
yeah
why is it whenever I join vc, Everybody is just dead silent
||von neuman probe basically isn't it?|| maybe not. similar idea
@quaint scarab sorry. distracted. meant to give you this link https://github.com/bmaltais/kohya_ss
yeah true that very similar conceptually
got it from his youtube video ty!
from an evolutionary standpoint it does just make sense
space travel is hard for meat
|testing|
||no like this||
but you gotta do \|\| to make them show
||the basic idea of life spreading from one system to another requires adapting to diff ecosystems and diff biology for it to be effective... so it all makes sense. the weird stuff with the physics and the ring? maybe not. but the concept is rock solid imo||
and you gotta do \\|\\| to make the \s show
||the ring just eating all inertia was pretty magical||
yep
Is 8gb worth it for Stable diffusion if you just wanna dink around and have fun? or do you feel limited with 8gb to the point its annoying?
Just try it
If you like it get a better card lol
There are plenty of capable cards for around 200$
definitely enough
Hello! I was wondering is Stable Diffusion 3 out? I haven't updated my Stable Diffusion since like October or earlier haha
Ah, okay. So a couple more months then?
Great! Thank you! I appreciate that update!
My card has 8gb vram and its fun to use to make photos you want but anything above 1000x1000 is likely going to fail
i usually do like 544x700 and use a 1.3-1.4 hires upscale on a seed where I'm satisfied with the lower-quality image's output
my fans be goin crazy
i actually managed to powerkill my gpu once by accident
my PC wasnt accepting GPU at all and didnt recognize it for like a whole day
i killed it mid generation 😭 iwas losing my sh*t
Likely less, as we’re already seeing images
You can go much higher with 8gb (assuming you're using an nvidia gpu). Not that it would make any sense because you'd start getting duplication / visual artifact.
following up with an image in #🏞|general-with-images
When is sdxl going to be back
Beep boop. Never fear, SDXL is here!
another noob question. Can't find in community manual. What do you call the panels that you can use to denote groups in ComfyUI. like the ones shown in the ComfyUII examples - https://comfyanonymous.github.io/ComfyUI_examples/2_pass_txt2img/ txt2img, Hires Fix etc
Guys any ETA of SD3 launch?
yes
now
just found out an exact time
count to 1 billion
it will be out by then
why is the chat so dry
Because it is a very serious time of the day 😛
02:15, and i am sitting here in the UK going through SD examples I could easily do later today. but hey, I'm in the zone
discord version?
#1047610792226340935
Any good free online video upscalers??
Beep boop. Requests only in #🏞|general-with-images . Thank you.
ok ty
Is there an app for iPhone ?
probably the “NO ALCOHOL / DRUGS / HOT WINGS” sign at the entrance
I wonder how hard sd3 will be to train
Hey, just started using Stable Diffusion and Inpainting.. I was wondering if there was an option to extract only the changed inpaint texture, instead of the whole image in Stable diffuse?
the delay i heard is because they are aiming to make it easy
hey
can anybody explain difernce between stable diffusion ai and unstable diffusion ai
ones unstable, the others not
What is this https://ella-diffusion.github.io/
unstable diffusion is a UI that uses Stable Diffusion
...............
...................
1/ it s only be a couple of minutes,
2/ Rule 4 #✍🏼|rules-and-tos so I ll just say that any local tools is capable of anything
@still glacier Cheers for reminding me about #✍🏼|rules-and-tos. Very informative.
I'd also like to point out the link #1080946152318443610 which is a goldmine of information for us Newbies. I was deficient in looking there in the past week and I cut my newbie teeth on this subject matter. Being a tech geek, and not a creative, a lot of the terms etc are alien to me.
Any tips for getting more vivid colors on generations? No matter which model I choose I always seem to get this more "pastel" vibe with soft colors. Works great for some things, not so much for others
Thats a VAE issue
You need to download a VAE for color fix
Realistic models work good with
84000-mse-vae
And Anime models work good with
Kl-f8-anime2 or clearvae
Sdxl needs sdxl VAE or fp16-fix
Do I need to do something to activate it? Cause I'm using Pony Diffusion XL and it already recommended downloading SDXL VAE and putting it on the VAE folder, which I did
VAE quick guide :
1/ What is a VAE.
It's a part of the stable-diffusion pipeline that encode/decode information from tokens to latent space and from latent spaces to pixels. Aka it transforms math statistics in pictures
2/ Where do I put my VAE ?
- VAE with
.vae.pt,.vae.ckpt,.vae.safetensorsextensions go into the models\Stable-diffusion folder - VAE with
.pt,.ckpt,.safetensorsgo into models\VAE
3/ How do I use my VAE ? Three possibilities : - Either you name it similarily to another one of your model (eg : Anything-V3.0.safetensors + Anything-V3.0.vae.pt), by doing that it should automatically load the VAE when you load the associated model.
- You manually load your VAE by going to Settings -> Stable-Diffusion -> sd_vae and selecting your VAE
- You add an easily accessible VAE dropdown at the top of your page to quickly switch back VAE by adding
sd_vaeto your Settings -> User Interface -> Quicksettings list
For step 2, if the format is one that appears on both lists, do I need to put it on both folders?
what
there s no crossover between the two sets
what s your VAE's file extension ?
Oh sorry it's 7 AM and I got some commas and dots mixed up 😅
Anyone know what the best base model to use would be for photorealistic images that don't mess up the models faces when the subject is further away?
I want to train a LoRA but I'm using PhotorealV2.1 and my reference photos always do this
insert picture of a very screwed up face here
seems to be an issue with image models in general, your best bet is probably to just inpaint there
could even crop face out, upscale it and regenerate, then resize it down and cut it into the original
pretty neat trick
no smoke
i recommend to anyone doing text2image, do prompt as negative: Lack on layers.
🤔 what does it do?
Does controlnet tile somehow make it so that large changes aren't made, only details changed?
Because that's what I constantly want.
elon musk
it set the body parts and their paths right and correct a lot of other things.
Like the 3D environment
Someone can just delete the square base of my statue generated with Stable Diffusion? I can send the file.
Just select it in inpaint and use fill, maybe change the prompt to influence what is filled with
This distorts my statue!
Hello, I wonder how can I avoid a anime character always looking front even if i'm prompting for dynamic poses
Is there a community of artists (not prompt artists) were I can chat or ask art related questions.
It shouldn't if you only select the base
Can you delete it for me? This distorts the feet.
Even if I wanted to, I'm on mobile currently, but I'm not interested in modifying someone else's work, sorry
Ok np!
Does anyone have any idea when the bot will be back up again?
Probably after or during the testing of Stable Diffusion 3
no concrete proof though
another tuesday another no stable diffusion 3
Lol yall getting hyped up just to get disappointed
when does sd3 come out?
i dobut it
when it's ready
no images allowed here, chief
looks like youre a community guide, must be new role haha
i'm always disappointed lately. it's american election season. watching my neighbors cook crack is overshadowing everything
the biggest disappointment, T will probably win it. not hyped at all.
ouch, not even typing his whole name, that cant be good lol
i think biden will win again
I think you're gonna lose either way tbh :p
the union is in it's last days and i don't think it'll progress as the USA post 2030. Falling like the USSR
lets not get to negative guys
Nah that I don't see happenning.
i do. it's happening
ussr collapse because its economy was horrible., america is still richest nation on earth
so was russia if you count the ogligarchs money
I'm not gonna pollute #💬|general-chat with politic tho.
russia was depressed when it collapse. it was no small secret. regan think he asked them to tear down this wall but it was little to do with it
its just so disappointing. the whole world
yeah lets be real, nothing good comes from that
stable diffusion 3 not being out is just another drop
well take heart, people been predicting the fall of civilazation since it began
not civilization. just the union.
think of how you would have felt after the civil war
"sure wish i had electricity"
or when segregation was finally smashed
end of the day most people are not about to dirsupt their comforts for breaking of any union. talk tough online sure, thats not nothing new
segregation is hardly smashed. it's still practiced low key.
of course but when the civil rights act was passed the south thought it was the end of all things and many said another civil war is gonna happen
i think things have actually started turning back towards the 60s after the bams. the swing back. the american pendulum. progressive then VERY regressive. and i don't think it'll survive this next cycle
barely half a century out of the civil rights movement. the people who were mad are still alive and raised a generation
well emad said it'll be coming in the next few days on Sunday (March 10)
but uhhh, even I start to lose trust in Emad at this point
yeah that's my wish
yes the discord server bot testing
I have a whole prompt list ready 
hmm I think I should write them into CogVLM compatible type of prompts
cause it works better with human-like descriptions
I hope everyone will know this. I don't want to see people get disappointed when they write "1girl, beautiful, in luxury hotel" and not get exactly what they imagined.
The reddit exploding with negative comments (besides censorship, that's inevitable and somewhat understandable)
people are gonna complain about the lack of chatgpt prompt editor also
Some people really crave for a "scan my brain cause even I don't know myself what I want" button.
there will 100% be a wave of people prompting nai style booru tag prompts and declaring that the model can't do anime
I might even be one of them to some extent :p, it's faster to type that kind of prompt
I learned the way, it's done.
I can speak AI.
I will personally love this method of prompting, even if it will take more time.
Explaining what I want exactly is something I have been craving for a long time.
For sure it will be more effective.
Though maybe a small language model (1B-3B or less that modifies or expands your prompt) running alongside a quantized version of T5 would me magnificent. I just don't know if both could fit in VRAM and RAM.
might even get rid of regional prompting altogether
it has the same clip layers as sdxl PLUS t5, so prompting will have a level of familiarity. I think that was 2's big optics failure. People learned how to speak sd15's clip layer and 2 was like "what are you even talking about?"
Yeah the dataset is half CogVLM (detailed descriptions) and half "Tags" I guess, as in how we used to prompt.
and SD 2's different clip lmao
yup for sure, people didn't took time to learn how to speak sd2
i don't think t5 will do away with loras in regions. while the issue is mitigated a lot there will still be uses for it
They will release controlnets alongside SD3 iirc
honestly if I do rough paintings of what I want and use img2img I will get exactly what I want in the exact colour palette I feel like the image should be in.
Probably yes, but for a lot of people it might work "well enough" out of the box.
When trying to imitate Tekken 3 with AI (I was inspired by someone else). It has issues with multiple characters which was annoying.
yup! prompt game is about to change a lot. but not today
multiple characters is why regional prompting is a big deal
I'm excited about DALLE3 level prompting but without that weird painterly look to everything.
I'm too lazy to set it up in Comfy, I bet it's probably easier in Auto1111/Forge
Its stupid cause I have to set up exact pixel regions
And color bleeding, it'll take a color description and apply it everywhere, that'd be huge to see that fixed
you need a very specific workflow to do it in comfy. just need to open the extension's options to use it in a1.
i love node editors, but people really over hype comfy. It slows me down like speed bumps ever 5m
I use comfy cause I have simple workflows that don't use utilize stuff that would be annoying without GUIs
prompt for red eyes and the whole image will be crimson
and it gets fast updates
there's StableSwarm though, which I'm happy about
Day 1 SD3 support, uses Comfyui as a backend but it has a GUI on top of it that is more beginner friendly
and you can go to the node workflow at any time if you want to modify the workflow
i often am only loading comfy up to try the newest stuff since it gets it faster. thats about all.
It's great for prototyping but i don't find it conducive to flow state very much
I like the comfyUI node philosophy but personally I can't work without a "model/lora/etc viewer"
With SD3, my workflows are gonna get smaller and smaller
yeah i'll likely be using sd3 in swarm on release day
People still do 1.5 word salads on XL, I expect to be amused when we start seeing people's prompts
I just hope that img2img will as effective as I imagine
swarm's a healthy compromise. Esepcially now with the multi controlnets
I still don't get this, does SDXL work somewhat better with natural language (human-like prompting)
like I heard this from someone in here
people often prompt to the clip layer that expects word salads. you can specifically prompt to the openclip layer too, but no one's using that
the open clip layer loves natural language style prompts. word salads too but it has a whole different buffet of words it prefers. "ornate" works better with openclip than og clip
oooooh
right now i think most workflows and ui's just fire one positive prompt into both encoders
I've been using word salads for SD2.X and didn't even notice
it still gets the job done
hmm
I guess the same will be with SD3
it will get the job done
but this time you get rewarded more for using natural language I guess
its the community refinements that add a lot of word salad capability too. since everyone is training with tokened captions
ah
That's a really good point
we're starting to see trends where people use llm to generate large natural language descriptions instead of keyword tokens from wd. there's a hard limit with wd captioning and that kind of captioning style.
i plan to use both in future captioning. a csv list of tokens and one large natural language description blob
we'll see how that goes
Oh damn, bot still down?
Probably in preparation for SD3 testing.
Let's get these devs some coffee and hugs
I'll be sure to post positive things on the subreddit. I will have much to praise the devs for.
As long as prompt adherence is as remarkable as promised.
Hmm some of the prompts that I have written exceed the 77 token limit. I wonder if T5 will help, because it has a 512 token limit, though I heard from someone that T5 is also limited to 77 tokens despite the default 512 token limit.
512 tokens would be plenty 
I wonder if it will be able to do sprite sheets well? Like being able to describe each frame of the animation on a sheet.
that sounds extremely difficult
yeah probably won't happen any time soon
Wait for their Sora alternative lmao
Where did you read that we are still limited to 77 tokens? That's the opposite of "long natural prompts"...
lemme look it up
I know you can do some pretty good stuff with controlnet but I'd like a Text to 2D animation lol
huh... in a graph it shows "77 + 77 tokens"
I need to look more deeply into this
hmmm
Also I need to know how Clip Tokenization works
isn't it more on a word-to-word basis instead of gpt2 tokenization?
Because in either cases, the hippo test exceeds the limit both cases if it's either 77 or 77+77
it consists of 242 words
Yeah... their tests used very long prompts... then add your typical style prompts & negative prompts to the mix and you'd hit the end of the road pretty quickly.
if T5's token amount is not limited to 77 (so it works with 512 and can modify the conditioning) and works similarly to Gpt2 tokenization then the hippo test is below the 512 token limit.
this is a big headache
When training a LoRA with Khoya, do your training images need to be the same resolution as your regularisation images?
When SD3?
I'm using a base model trained in SD 1.5
when it comes out
Okay so I just saw that male nipples are a thing in SD3
This might sound odd but this is good news
The censorship may be SDXL level
heck yes.
maybe this is why Anatomy is fine and not SD2.X level
not just a portrait with SD3
Meaning it can likely be trained in, fine with that
I hope so
With the prompt coherence you'll be able to generate one green nipple and one blue one at the same time
hell yeah SD3 is the best 😂
What a time to be alive lol
but yeah this means they weren't as restrictive with NSFW as with something like SD2.X and Cascade
I haven't used Cascade much yet. I have it installed locally but haven't had time to really test it out
I saw Emad is speaking at a seminar this weekend. Maybe he'll release it after that.
Well actually its Monday
i heard cascade was terrible
😭
i haven’t even gotten access to SD3 🤷♀️
How to use?
Where is the guide?
Help pls
"the censorship" is the dumbest meta. can't wait until it's irrelevant
biggest reason it has any traction is unstability ai's disinfo campaigns. The people who took crowd funding and never did anything with it.
i'm honestly surprised that stability hasn't hit them hard with tm infringement
Already???
bruhh
Yessir holy crap!
Emad:
#SD3 preview really is almost ready to go, 8b Turbo model just done.
We are tuning parameters and picking candidates for the preview so we can get the right feedback for the tuning.
Also team has been cleaning up inference code for partners.
Phew.
This is hilarious, they trained a turbo model whilst also preparing for preview and stuff
but I'm rather impressed
Another congratulation to the dev team 👏
Also working on a program for tuners & more, want full ecosystem supported for the main launch let's make it awesome together.
This is amazing
they want everything to work on Day 1 or very early on
This is such a good decision
day 10000 of waiting for sd3
So it may come this week at last
Turbo models can render at extremely low step counts
do many peoplpe use sdxl turbo?
it's good for realtime usage
to me it's a novelty as it's low resolution and blurry, but there must be usecases
you have to retrain for it
it would be useful if the same seed gave you same results
so u can quickly preview renders
I use Lightning though
its kind of OP
I'm sure the turbo models are good too, the examples look great on DreamShaperXL
I should experiment with more of them.. I dunno.. I can render 4 high res images at a time in a minute with regular models and that feels really fast to me
That's really good
if I could generate 1-2 highresfix images under a minute with regular SD3 I'd be fulfilled
I just don't know how far optimizations can go for SD3
at the expense of...?
I started with Disco Diffusion and 3D fractal programs that took waaaaay longer
detail?
yeah detail
can you pump out like 100 images in turbo model
yeah sure
ohhhh
almost too good to be true
but look, turbo looks good by default on this page https://civitai.com/models/112902?modelVersionId=351306
I use lightning myself and it looks perfect for portraits
It does however I'm a sucker for detail
Even if it's placebo
I'll give it a shot though
How much faster is it
it's just simply less steps iirc
ah
I don't know if you have to use a lower resolution for Turbo like with the base SDXL turbo
with lightning you can use base resolution
I don't exactly know tbh
Both uploaded same time but seems turbo has much better reviews
interesting
i like lcm loras. can do images in a few steps with full models
lightning > lcm
needs the sgm_uniform scheduler, but if you using forge or comfy, lightning definitely the way to go
What about turbo
different beast.. turbo generally requires a model trained on turbo to work really well (i.e Dreamshaper or the base model), whereas lightning gets the same full quality and speed just by use of the lora
Thank you
😠 when you get a picture you want then use it with controlnet and then the lighting for NO REASON goes absolutely whack. Frustrating
I can't wait for SD3
Hopefully they start giving out invites soon
Because the images they showed were quite swanky if I may say so myself
i don't mean to offend, but if i'm being completely honest, swanky is quite the understatement kind sir.
Cherry-picked swanky, but swanky all the same
does anyone know if the images shown for sd3 were done with basis input (ie- no loras, refiners, optimal settings of any kind)?
not sure about cherry picking. i've seen a wide variety of styles and quality
also what type of loras will be needed to use with sd3? Will the same base models/loras etc be usable in sd3 to create more advanced images or will we need to wait for a whole new set of those things to make use of sd3?
@astral goblet
sweet, ive just seen the twitter ones and havent looked further. I suppose my view is blinkered
I think there'll probably be an x-adapter at some point, but we'll probably need new loras
sd3 is a whole new base architecture. old loras won't work
it's also a transformer-diffusion model rather than just a diffusion model. so xadapter models might need some retooling to train properly
didnt think the would, wonder how long it will be before suitable models get created to make good use of sd3🤔
apparantly training code will be released with it
what type of hardware will we need to run sd3 and will we need to download a whole new version of foocus to run it?
I ma guessing I will be able to sun SD3 and SDXL on the same box as long as the installations are in their own paths and I fire it up with the respective gpu.bat file
lots of scaling options for sd3
wouldn't be surprised if people get it running on 4gb
Interesting, puts my 4090 24GB, and my 192GB of system RAM way out there
Do you know if we will be able to use it on the current focus model by any chance or will we need to download a whole new version of fooocus?
💪
how many dimms you got for 192?
@opal hedge thank you dont know about x-adapters..got do some reading on that sounds like
well you got to futureprof yourself. fo at least a couple of years> 4 x 48GB corsair
Comfy/a1111/forge/fooocus will all need updates to run sd3
you'll want new procs before you dent that pool
@nova zodiac ahh thank you..im sure they will be tons of tutorials on what to do when it releases, so probably dont need to worry myself too much to figure it out now
you'll be able to cache t5 and maybe a couple other llm's as well as use the full 8b param model on your card
comfy is being used internally for sd3 generation, so they're getting it working that way
i really hope they impliment things like consistent character and make it a bit easier than it currently is. I have figured out how to do it pretty well with inpaint/image prompt mixing and image prompt/upscaling but i havent got it perfect and it is a bit of work and time consuming. Would be nice to kill Mid Journeys new update and do a better methond on SD3
Nice, I have been learning Comfy for teh past two weeks. slowly getting to grips wiith it
does anyone know if something liek this is in the pipeline for the near future of SD3
its already possible today. midjourny is just launching their implementation of ip-adapter face swaps today
they would've had to train their own model instead of using insightface
I was on Midjourney for a good part of the day and they got osme issues/compalints with the cref and other option they mention for consistent character output (im sure it will get better) but you an only keep your images private for a fee of 60$ weith stealth. very expensive. (you could do it the long way with saving images and gen-id and deleting images) but it isnt the best workflow and i dont know if it is full proof.
mj's playing catch up today. there's even instant-id for sdxl
So, i am hoping SD3 will create a better method to create consistent character in the upcoming releases. Do you know if that is in the pipeline for the near future for SD3 @astral goblet
where is that instant-id for sdxl? Is this process something that solves (consistent character output) @astral goblet
we've also been able to use LoRA on model weights this entire time too.
SD3 will have all this too since it's open weights
if you had MJ weights, you would've been able to do all of this last year too
in the earlier images some stability staff said that it was only the best of 4 (if at all, the bottle one was nailed all the time)
but then again that was a way inferior quality version
it was half baked
being able to pick so many different ones though? you know?
you're kind of dealing with a cherry orchard in season at some point
yeah
I don't really care much for the image quality with sd3, that can be fixed. I'm more excited about the immense prompt comprehension
not the same thing tho
one you have to pay to use
dont need sd3 for that
exactly
How so? Do you do img2img?
see ella paper
this is an interesting paper
They didn't even release the code for ella so it's basically worthless to me
models next week! ||heh||

yes they did say weights are out next week
it looks great, but also will apply to sd3 too
t5 isn't an llm, but encoding text embeds from an llm into t5.. woweee
not sure if it will help sd3 since its already a transformer
SD3 has T5 so I'm not sure how much it'll help
a llm? or an llm? hmm
Sorry i know your answering alot of people at once, me being one of them. But I am hopnig if you can if you coudl clear up 3 questions for me..1.) you mentioned there is ip-adapter face swap for MidJourney. Is that different than the cref and the other method they mentioned from this very last update? 2.) you mentioned SDXL had instan-id-(where can I find that) ? 3.) if i am incorrect in understanding you. Is img2img and consistent character generation planned for SD3 in the near future? @astral goblet
i'm thinking it'll help with that feature chatgpt has, where it'll do a story book sort of prompting. but i'm really not knowledgable enough to say for certain. just a speculation i have
I mean, if ella helps I'd be super happy I just don't think it'll be really necessary for SD3 users. XL and 1.5 users will rightfully be hyped though
There'll still be a transition period before sd3 becomes useful so to speak
wouldn't SD3 replace XL though
unless they can't figure out how to run SD3 with T5 on 8-12GB vram
-
- the new feature is basically the same thing as ip-adapters we use in the open weight world. they aren't using the insight-face model though. I like instant-id the best. Forge webui has it built in iirc. you might need to download the weights yourself.
- sd3 will almost certainly do image to image. being open weights, it's easy to implement. consistent characters may be very possible through a variety of methods. Can't wait to dive in myself!
sdxl has legs since it has momentum.
Makes sense
its still better without t5
Though SD3 will have controlnets and stuff on arrival
Ella i think is infant stages. The potential of hooking all manner of LLMs up to diffusion models? ugngn.
and I am afraid if finetunes will actually hurt prompt adherence or not
one rando idea is a discord bot that just turns the conversation ongoing into highlight images
I think it might if you go the ponyxl route and basically nuke the model into behaving
lmao
the encoding layers are frozen/static. What we will see though is the same kind of destruction of latent knowledge that current refines do. The base models KNOW that belly buttons don't appear over clothing. The community refinements think that's how it works
i hit that know for no reason. ohwell
In regards to the next week release, was that for ella or sd3
ella
we'll get the preview soon. sd 3 release in disguise. transformers
I wish we'd get a proper release date timeline for sd3
I wonder if we'll get the weights next month 🤔
The sooner they release weights, the sooner they get a surge of membership subscriptions right?
lets goooooo
@astral goblet thank you for the info. Im looking into sdxl instant-id i see a download on civitai, i wonder if it is what im looking for to create consistent faceswap from base image..i dont see any comments or much tlk aboutn it on civitai (for the model i found) nor have i found much tutorials on it for fooocus but seems to be some for A1111 and comfyui
My conspiracy brain guess is that Emad thinks chatgpt will release a new model soon (March 19th?), and releasing sd3 now will only get overshadowed in the news.
Gotta get that venture capital money
god damn corporate conspiracies /tableflip
takin ma sd3s
They're taking our jerbs
TOK ER JOERRBS
speaking of jorbs. you see the new homestar runner?
over shadowed by ai releases too
I haven't even heard of it
Must have been all the sora sd3 hype
https://www.youtube.com/watch?v=2-QQpL1YjeU just like the classics, but higher tech
Hi all, I have a question which might be very dumb but wanted to run it past everyone anyway. I'm an artist currently studying my masters and was wondering whether the terms of use for Stable Diffusion includes artistic work as "commercial use"? I'm unsure whether creating a work within stable diffusion and then selling that work as an artwork would count or whether it would come under copyright unless I was to alter it to a certain extent. Anyway, any help would be appreciated or if anyone knows any information which could help. Cheers xox
Yep stable diffusion allows commercial usage, just make sure the content you generate isn’t copyrighted already, like popular characters or celebrities
Does anyone know how to properly download instant-id for fooocus? cant find a tutorial for fooocus jsut comfy and A1111 ( I see there is ip-adapter instant id sdx ) and (control instant id sdxl on civitai) - {not sure if this is the one i need or not} but if so no tutorial on the process to Downlaod it properly. I really want to get consistent faces sorted and hopnig this will make my current process a bit more effective..ANY help would be great
yah, seemed to go by quickly
dont think fooocus has support for instant id yet. the guy has been busy with putting forge together.
but fooocus does have a ipadapter face swap built in already
you should be able to find a few how tos on how to turn that on. my big pro tip for it, is have the face swap start a ways into the diffusion. start set to 0.3 or 0.5 for half the steps
https://twitter.com/Lykon4072/status/1767681747490116053 current weights aren't democratized yet
wtf? weren't they just saying the 8b ran fine with 24gb vram?
¯_(ツ)_/¯
i'm sure theres some rejiggering to be done first
earlier versions had that done
@astral goblet I have been searching high n low for a vid on a proper faceswap outside of inpaint, mixing/inpaint, and mixing image prompt/upscale etc, I havtn found anything for fooocus - and so far nothing I can find with ipadapter faceswap either. ( I do see an upload on civitai as i mentioned) but not sure what to do with it exactly since I only have messed with loras and checkpoints..(maybe you know which folder ot put it in?.... I did do a quick look again for vids n posts since u mentioned it but so far nothing for fooocus as of yet
Soooo we should all volunteer for bigtime models
focus has a bunch of advanced features to it
https://github.com/lllyasviel/Fooocus listed here
sometimes i take it for granted. i ALWAYS go directly to the readme file
im trying to send a prtscreen or uplaod of the civit page i was tlkn bout but some reason this channel wont let me upload it
trying to figure something out is basically a hunt for the readme
i dont think you can add controlnet stuff to fooocus. its all built in tight
hmm
i mean it shows it is compatible with SDXL
you could click this link if u want ot see it
https://github.com/lllyasviel/Fooocus?tab=readme-ov-file#moving-from-midjourney-to-fooocus what you need to read specifically
those would upload into forge or webui's controlnet models folder
fooocus takes care of all that automatically to make it simpler
i dont use forge (unless it is built into fooocus and i dont know it) but I do see this webui folder (found inside my fooocus folder)
i was trying to post that here also but looks like I just cant upload or prtscreen nothing in this channel
are you saying I can downlaod the ipadapter shown in my civitai link into my webui's folder?
i see the readme files for the downlaod found on civitai. I notices it says to download it via huggingface or download model in python script...so seems that my webui folder (found in fooocus) is where python source files go. SO I assume downloading it from civitai and place the Download inside that webui folder makes sense :/
no no. if you use the automatic1111 webui, that's what the controlnet is for. Fooocus is a lot tighter. you dont need to copy files into it to use it's face swap. did you look at the readme link i sent?
fooocus has the face features long before midjourney did
yeah i read it, it basically is saying it has all the things like faceswap etc, I use the built in features with Fooocus already, But i know you mentioend you liked faceid and so when I went on the hunt for it i came across civitai that mentions it has it for sdxl so i figured it's an additional add on that can be implimented into fooocus to make better faceswaps...but clearly im unsure
@astral goblet
instant-id. it's trained on sdxl. foocus is jus one of the UI's for using stable diffusion. you can't bring new controlnet/ip-adapter models into it. it's more basic for people to not worry about that stuff as much.
automatic1111 has a controlnet extension thats more advance. exposes all the options. lets you copy new models in. comfyui is visual coding of your own workflows. there are other UIs for stable diffusion that offer much different capability
yeh i thought based on this msg that maybe it was something else that could be added on to foocus ( i thoguht civitai link was possibly that 'thing' i was looking for)
nope. fooocus doesn't have support for instant-id. the author hasn't got to that at all since he's been busy with forge ui.
he maintains both.
sure he is very busy..but i saw on civitai and saw that instant id (sdxl) and figured that i may of been onto something by finding that
🤷♂️
did u look at that link form civit by any chance?
yup. those are the files you'd use in other UIs
plenty of others yeah. those are the 2 big shows
yeh that the ones i look into but i stay away from comfy atm
gotta learn one at a time but overall and so far i like how fooocus runs (i want other things added ot make it better) but it has some good things and it has elements of A1111 in there so i stick with that for now
and ok well i was hoping i could o gotten some better results for faceswap ...what did u use to make the isaac newton pic?
fooocus has a lot of cool little enhancements. i find it gets great variety. the fooocus v2 style enhances prompts with gpt2. its good stuff
was instant-id but i can get equal results from ip-adapter face swaps too
yeh i do liek it and have been learning bit by bit how to work it..ifi had a stronger gpu i could do a bit more but overall its pretty good. but what web UI did u use the ip adapter on i mean?
fooocus is a great experience, i gotta chime in there
i use forge mainly. automatic1111 when i want to do animatediff stuff. it has better compatibility there. and i use comfy for new cutting edge stuff like stable cascade or playground 2.5
yeee @karmic cedar i jsut hope we get better img2img and maybe make the process a bit more intuitive with all the little controls. Or atleast get very clear documentation on it. So far I have to look for all the details on a forum here or a forum there or on a vid, etc. (would be nice ot have it all cohesive
if i want to show a friend the new toy , i load fooocus
it has great img2img functions for the most part
but only within the prebaked parameters really
and when i say i load comfyui, i mean i load stableswarm ui and it has comfyui as the backend
^
it isnt bad rly..but something a bit easier..you do need to mess with the inpaint/mixing with promp and img prompt with upscale/enhance ot get it relatviely good
but it is a real prcoess of trial and error until u nail it.
for sure. I enjoy the subtle variations it can do
and then nailing it ofr one type of pic doesnt nail it for the other pic u try the same process with so u need to tweak again
yeah this is true
i feel like the guidance scale makes a much bigger difference in fooocus than it does in a lot of other workflows
https://twitter.com/PurzBeats/status/1767253827688730889 comfy can do crazy workflows like this if you fire it up well. ip adapters in different regions driving animatediff. uhgn
ooo
getting there! at the same time, I’ve seen some comfy workflows where people are clearly trying to out-diffuse each other, National Lampoon’s Christmas Vacation-style
the process jsut needs to be smoother ( by smooth maybe some additional features that can rly nail a faceswap perfectly)..easier means of creating various posture, stances etc, visual references for aspect ratios perhaps...things like this.. /good documentation/ and of course a good jsut better skin texture etc (but that is more for the loras/checkpoints etc
i get that reference
i only really care about comfyui workflows when i can do stuff with them i can't manage with a static unchanging ui
yep, there are pros and cons to the undying UI
lol
anyone here watch Two Minute Papers?
haha prompt worms yeah. i haven't seen his vid on it yet though but i heard of those
hey yall. jsut saying goodnight
yall take care
thanks @astral goblet for all the feedback and help today
🏆
i can't think of really anything comfy isn't the best at
even inpainting it's the best if you know the right way to do it
what qualifies that
Is it just my imagination or was SD3 announced to have video capabilities
All I found was: "SD3 will be able to accept a range of inputs including video and image."
using differential diffusion, unsampling/resampling, automated cutting of masked regions and inpainting those, etc
yes, Emad said something about it when Sora came out. Probably SD3 or the architecture is capable of managing 3d or video, but I don't know
you can make inpainting models of checkpoints within the workflow
idk why all of a sudden my stable diffusion won't work and the console says
"The future belongs to a different loop than the one specified as the loop argument"
reminds me of emacs vs vi, endless debates that never went anywhere, both are still around decades later, use the tool that appeals to you, the end
there is no fate but what we make
it really sounds like some weird saying like that
thats how it feels to me. i don't get how any of those convoluted node networks actually improves anything.
can not wait to test SD3
https://twitter.com/Lykon4072/status/1767359656853189025 love this flex
Wow, it took an 8 paragraph prompt like a champ
Now that is a flex
I want to create paintings anyway so this is pretty epic to see.
stable diffusion dont come back?
idk man it seems kind of like SD3 is bouta kill midjourney (never thought I'd say that for open source anytime soon 🥲)
my language is weird tonight sorry sorry
at a certain point, stable diffusion just has too many upsides and control
peopel will still hate python. mj will be fine. if anything, mj will just start using sd3
Yeah I think he said the architecture could eventually do what Sora does but would need alot more compute.
would be cool if they do another one of those streams where they all just geek out about the roles they've had in building the model, like they did for xl
you literally asked how to make an "anti-china covid flag" and have a slur in your bio- i think you can do the math
Hey fruit, any timeline for the SD3 bots you can share with us?
Cuz thats what discord mods do 😂. Get triggered over anything
yea nothing to do with the stupid stuff he posts here
Honestly who cares. Ignore him and move on
nah its better to time-out otherwise they wouldnt learn
🤷♂️ welp imma follow my own advice and mind my biz
does anyone have the timeline or leaked information about the release of SD3?
I heard SD3 was gonna... was gonna... improve image gen
It already has, coz we can run SD1.5 on local, midjurey is discord only
¯_(ツ)_/¯
yeah but the midjourney discord has 19 million and us still pretty active
yea because is for ppl who know nothing about ai,the type of ppl who just wanna type a word and get a pic
WORD
literally
EXCEL
I mean
if you don't feel like installing loras, models, UIs and tinkering a tiny bit to get it to work midjourney is fine for getting somewhat specific characters from the Internet
but you lose a lot of control because of that
I find I can get really detailed images based on the prompt, hires fix, and set resolution
ya
You kinda become a prompt programmer in a way too
Pretty much changing words till it works and debugging
but if you wanted, for example, Pikachu riding a bicycle in an anime style you have to go through a lot of models and digging if you're completely new to AI
I still haven’t learnt how to use those things that people do like (good:1) weird things like that
those are like a mini CFG scale from a specific phrase
Oh, how does one know what words to use?
Or are the words whatever like the prompt?
um, I don't know lol. I look at the example prompts on the model read me to get a general understanding but if you ever used something like danbooru you can use tags like that
every fine-tune interprets words slightly different, just try a few that feel right and see how it effects the image
want me to do my favorite images and geek out?
This
gotta be pikachu
uhhhh
the what.
haaaadouken
😄
after SD3, MJ retaliates by introducing a controlnet
actually I think the reason they haven't already is because it would make it easy to find loopholes around their censoring
which would get them in trouble because they are very public facing
guys any idea sd3 release when?
Emad tweeted it is on the way, but what do you think when will we get hands on experience?
like even if it's a HuggingFace space that would do for me...I just wanna try SD3 as I'm super hyped for it
SD3, SD3, SD3.....
The people from the waitlist will slowly be invited to try it once its ready it seems. Weights probably wont be released until its actually done. The early testing is there to further finetune the final model from my understanding. TLDR: 
Which doesn´t really say anthing new tbh 😄 😉
Yeah cause I dont have any information, but the question keeps rising every day 
Here as well 😄 Would at least be interesting when the early invited ones will be able
like at least an estimate
Those few demo images I could find so far just aren´t really doing it in terms of deeper insights 🙂
I think its fine they dont give a date, gives them more time to focus on the important things and they dont have to rush it
I guess, it´s simply about being able seeing more footage so to speak. Would even suffice if there were additional example images available
I think people just became really impatient with AI cause were used to the light speed progress now
Just follow Lykon’s twitter - they coming nearly daily on there
I think its cause we are all dopamine addicts looking for a bigger better hit
If you get your daily dose of dopamine from SD you should rethink your life 
Same happened with phones
No biggy, just saying it would be pleasant having a little more to look at by now 🙂
Ah, interesting, thank you
Lets be honest, you get the same effect from hitting the generate button as a gambler does pulling the lever on the one armed bandit. Both people are hoping to get the result they want
Yeah obviously this can lead to addiction, I dont wanna say "touch some grass" to people generating stuff all day, but hey 
young Chinese girl in glasses typing on a computer in 3d digital illustration 2, in the style of quirky character designs, warm color palette, unicorncore, vray, study, sharp & vivid colors, studyblr --ar 4:5
We're still waiting for it. But I can confirm that it will be released before Sagrada Familia is finished
ok now what is the best stable difusion image generator right now?
Depends on what you want to do. I'm a fan of sdxl turbo
Cool
whats the difference in Lora ans Textual Embeddings ? i can train faces in both, train certain Outfits(like traditional jP outfits,Chinese outfits) e.t.c Certain Styles. i dont see whats the difference is one detter then the other ?
Embeddings is old tech by now, it has its ups.... But it's overall less effective than Loras. cf https://www.youtube.com/watch?v=dVjMiJsuR5o
guys
iam getting OSError: stabilityai/sdxl-turbo does not appear to have a file named config.json error
Where are you getting that error
code:
import transformers
from huggingface_hub import login
login("myloginkey")
Load the model directly from the Hugging Face Hub
model = transformers.AutoModelForSequenceClassification.from_pretrained("stabilityai/sdxl-turbo")
@sudden ruin
Im not sure what youre even doing, please use #🤝|tech-support
And check out common UIs, because I dont think youre doing the right thing right now
k
。。。
shout out to @bleak matrix and @sudden ruin ! how u doing frens?
I'M ALIVE!
Was just makin some crepes with apples
How are you?
ohh i really cant wait to try it out! thanks for the info
All good here! i see you are having a nice breakfast as always : )
Doing great, hbu
Why do I write what photo I need and he doesn’t take it or send it?
I've been in conversation with the author of the XL Realistic Tile controlnet model (bdsqlsz_controlllite_xl_tile_realistic [12b261fe]), and they seem to suggest that it is not actually ideal for tiled upscaling. Rather, it improves results for HR Fix / high res value Img2Img - but not for tiled diffusion methods. I'm putting this out there for anyone else who, like me, may have assumed otherwise.
https://huggingface.co/bdsqlsz/qinglong_controlnet-lllite/discussions/7#65ec7831d7d63c2ed0a90a49
That's good info, thanks for sharing!
there was a bit of excitement when this tile model was released, but I was a bit puzzled by personal results, as it didn't seem to work the same as prior SD1.5 tile models. I had yet to see anyone come out and recommend a workflow with it, so decided to go question the author
I'm really getting into that cooking thing! I found that using this yogurt really vibes with the crepes!
为什么我的models文件夹里面没有esrgan文件夹呀,有没有人解答一下
pretty impressive, any one of those scenes would be days of work
The part at 1:14 was the only thing that looked pretty wonky (would upload image if this were images channel)
Help plz, How generated images?
english please.... And the folder is not created if there's no need for it. You haven't tried to use any gan upscalers => there's no ERSGAN folder
my sd just randomly stops doesn't give me an error or something it just stops in the middle of genarating and doesn;t do anything
when I run out of vram it jsut gives an error here it just stops
an inconclusive observation
How long until the bot is back?
lol, they really should allow memes in here
granted that would drown out the actual conversation
but would drowning out "when will sd3 be released" be all that bad
I wonder what people try to achieve by asking this here, like we know any more than they do or some dev drops in and tells you the exact date? 
ah I just noticed that impressionist paintings look similar to the paintings I made in SD2.X
I loved those paintings and I could not replicate them in SDXL that well
I love big visible brush strokes
I have a lot more faith in running SD3 on 8-12GB now
since Comfy offloads so efficiently
I suppose this is why I could run Stable Cascade
I load the b and c models separately into VRAM when they are generating
then it gets offloaded to RAM when its unused
@charred mesa what you looking forward to the most about sd3?
prompt adherence, possibly expanded knowledge about things (such as games, genres, etc)
as long as I can generate highresfix images with SD3 at like uhh, 1-2 images per minute I'd be the happiest man on earth
it looks so good with highresfix just like SDXL if not way better
you are one of the few who isn't looking for booba
yeah
idc about censorship unless it f*cks up anatomy or takes away my blood and guns (I want zombies and soldiers)
i just want controlnet to actually work in sd3
I do hope that people will be able to train corn back in, as I can only scratch the surface of what could be possible with such prompt adherence
sdxl controlnets don't compare to sd1.5
they are going to launch it with controlnets iirc
yeah its sad
I barely use controlnets, but when I tried them with SDXL they sucked iirc
only depth was okay at like very low strength
that's good, cause controlnet solve alot of problems for me. I use them alot for setting up positioning and poses
my god with SD3 I could make rough colour blobs and use them with like 90% denoising img2img and get exactly what I want in the perfect colour palette
its the dream
i also hope sd3 2b model will generate the same images (lower quality) as the 8b for the same seed and sampler
and no controlnets that could fail upon me
2B models will get good finetunes im sure
it might eventually look almost as good as 8B base
but then 8B finetunes are going to go even further than everything
this is such a bright future for offline image generation
yeah, i am gonna upgrade for sure if 8b is really leagues better than 2b
oh my god, imagine using SD3 to immerse yourself into LLM RolePlays
did you see that photo lykon posted of the black/white man?
wait lemme look
what you think about it? for me i really like it, the skin doesn't look as reflective and rubbery as sdxl skin. Really hopeful for the photorealistic scene of sd3
yeah it looks more like 1.5/2.1 skin, less rubbery
I really love the black rocky person with white hair and lightning around him
I tried to recreate it in SDXL and Playground and I couldn't get it right
nipple man looks crazy
and yeah, nipples, what a surprise
and atleast the eyes don't look like sdxl base
it seems they didn't lobotomize this model after all 🙏 (not that I believed in that)
sdxl at launch, fucked up all the eyes
lmao
I thought the prompts I made for my SD3 preview prompt list were too long but they are way below the 512 context limit 
it might have some falloff towards the end of that 512 limit but it's a huge improvement over 77
have you ran them through ideogram?
no
I bet ideogram would have gotten it right or very close
Ideogram is like if SD3 was a heavily fine-tuned paid online service 
do you think we have reached point of diminishing returns on image models?
Emad thinks so
I personally think we can still push further in prompt adherence and possibly knowledge
and most importantly, optimizations
that still needs to be reached for me to think we're at "the point of diminishing returns"
for sure, what would be "it" for prompt adherence for you? like what prompt would you want to run with a model that will 100% adhere to your input?
hmm maybe 90%
actually...
for simpler prompts it already gets like 99% accuracy lol
with the coloured bottles for example, when it was just a half-baked model at the time
who knows, it might get even better when the model has finished training
wait will they release a new paper with the finetune model?
maybe not a full new paper but redo the comparisions between other models
finetune model?
also with the current images I have seen it gets to about +80% prompt adherence which is nice to see
especially with that beach demo
the model they release weights for, cause the paper came out with the half baked model
it only missed out on the palm trees
beach demo?
nai v3 still solos
mj doesn't get the palm tree also
the dune grass is subjective cause it can be both green and yellow
oh right
plus i think inpaint would be crazy on sd3
hmmmm
i'm okay with sd3 not generating images perfect one-shot if i can fix them easily with inpaint
eh this is why I'll be experimenting with img2img colour blobs
so subjects will be exactly where I'd want them to be in almost the exact colours
but inpainting might help a lot with fingers
you just reminded me, i hope sd3 has a tile controlnet, i wanna remake all my old images
Can you change the size of the tiles
cause I might wanna do higher denoising than what would work with images that have bokeh
well the tile controlnet was just a way to do image recreation, you put in an input image and the controlnet will force the model to create something close to that
alot of ppl used it for upscale
yeah for upscaling
I'm looking forward to the "tuner program"... let's see if it'll be "just" early Kohya support or something else.
doing too much denoising might make some parts of the image more focused than the others
i wonder if SAI have a secret 12b model for commercial use that they won't opensource
you can use threestudio, but it is tricky to run on Windows
you will also need a lot of VRAM
oh so it takes more than stable diffusion turbo too?
I just have 6 gb vram so not enough?
finally it starts! 🙂
hope i get invited soon :D
WHO GOT IT
none here 😦
i sure didn't, oh wait i did not even applied for the testing Lol
imagine now we see all pros and cons of this model, getting really bored of Lykon boring lame SD3 images from twitter 😄
I always forget what do I have to put in the Discord ID field when applying for these things, is it just the name or a code?
It's the code
i don't think this is the community for you if images bore you
i guess lykon's doesn't have porn. bit of a vapid opinion but it's your opinion to have
yeah guess i'm not going to get an invite. i sent my name in. just going to toss all the prompts i had ready for it then
guess who got it 🙂
his SD3 images were mostly just lame portrait stuff that SDXL can easily do, this is BORING!
this is not how u showcase new model like SD3
i don't know why bullies always gotta flex on people who got more skills than they do
insecurities must suck to carry
just my opinion on what i saw))
Not really a code, your number ID
yeah i put my discord name. guess i'm fucked
how do I get in the waitlist for sd3??
<3ily got it?
you say that, but instead of criticizing authentically, you just said something lame ass and void meaning like "lykon only posts lame pics"
Just checked and most of it I knew already. Do you know of someone doing stuff outside the standard anime, dragon, fantasy stuff?
yeah i did the same, and i was one of the first i think but i only figure out i did it wrong when i read people had received a confirmation mail, which was days later 😦
it's an emotional opinion. kind of like ones bullies have while kicking down sandcastles
he kind of do, i'm not impressed with SD3 so far!
then don't use it
stick to sdxl
why be here waiting for it then? weird thing to do
otoh i'm enjoying cascade right now, so no hurries
already deleted all SDXL models 😄
if you're happy with the porn riddled sd15 go use that. its there already
the porn trolls need a good hammering tbh. why come in here and rip on creativity?
Does anyone here have sd3 yet?
Agree, at least to a certain extent, there is too much bokeh or blurry backgrounds in those generations. If we get access we need to push it to the limit
nah i don't care about corn, i just fing all this single person-portrait stuff SDXL era, you know where i'm getting?
i pinned you immediately. you're an unstable diffusion fan
Lykon stuck in SDXL era dude.
Yeah he just likes portraits
i can see that Lol
he made finetunes of SDXL such as DreamShaperXL
but it's his twitter account so he will post what he wants
and he mostly makes portraits of single subjects
anyone have experience with mov2mov? it seems to completely ignore stuff, and does its own thing
i don't see the point of ripping on better people's creativity and calling it lame. it's pretty obvious why you think they're lame.
your truth lies in the corn fields
:3 no invite for me yet
me too
I had more whole prompt list ready lmao
bless me! 🙂
When theres prob at least 5k+ its hard
yeah
what hurts the most is that if it's based on who applied faster then a lot of people got effed over
as on the first day it didn't send out emails to a lot of people including me
I dont think it is on who did it first
i hope they won't forget the ones who are here from (almost) the start
I applied withen like a hour though soooooo
yeah I think it's like CS2 limited test, random and youtubers
maybe it's based off behavior in server
who knows
ohhh if we didn't get the mail then we're not in the wait list ??
possibly
thats lame. moderators can get personal grudges all the time.
dang
i applied within the hour and got a email back within 2 minutes that i was in waitlist
so some personal grudge will prevent someone from getting a preview invite. super lame
not the case
so strange I also applied in the hour
yup
im guessing the waitlist is like 500 people
Ha probably not
lol
Satire
I had applied the same day that the waitlists were opened, only after a week I realized that nothing had arrived via email so I requested again and then I received the email that I had entered the waitlist 
official representatives aren't the typical source of satire
I mean anyone can do satire
alright I'll just apply again then, I just hope it's not on a "first arrived first served" (even if that'd be more fair)...
Well, go to work and show us the results 😃
?
I mean, the ones who got access
i hope so
its based on who has the best grammar in their prompts
even I can't test stuff myself I want to see more unique and complex prompts either way
I hope everyone who got access has a lot of fun
I used up my invite luck with Tekken 8 CNT and CBT anyway
lol
@fervent thunder could i bother you to ask, when did you get your confirmation waitlist email?
9am central like 1-2 minutes after signing up
I REALLY REALLY hope that it's also not based upon countries, that would be annoying
I think I had access to Dalle for like a month before I realized it cause it was in my spam folder
i got mine at 6pm pst. ish.
lmao
yeah, art has no borders 😄
on comfy, is it possible to load/unload models without queing a whole prompt?
it would make sense to balance different countries so its 24 hours image voting/creation
Haha same here 😂
it does unfortuantely. there are some countries which have extreme laws against particular art or creativity. won't get into naming which.
Not ripping, yet tbf those images (even if portraits are the subject) are pretty standard motif-wise, not really showcasing SD3´s capabilities regarding creative possibilities along its quality 🙂
well Tekken did something with regions
like 5000-10000 invite each region
idk
I got super lucky so I'm not even mad for not getting SD3 access lmao
thats better than just right out calling all their images "lame". I wouldn't even say all their images are standard. people just see what they want to
One sec ima do the waitlist again but change regon to some random ialand :3
lmao
And then I found MJ and a week later got access to stable diffusio so lost all interest for dalle
Dark was a good netflix show btw. just sayin
I only really use dalle3 to make some deltarune fanart
Tbh I think showing that SD3 can do some great-looking images with very simple prompts has its value, given that previous versions weren't great at that
Its really good at following directions
Not as good as googles other image model though
I dunno if you have found it yet but damn its so much better then there bard one
Not the best for understanding but great at characters
Thing i don't like about Dall-E. No knobbies. HOw do i increase step count? fiddle with cfg? scribble on images before denoising? HOW?
Sure sure i say "prompting is king" like it's a holy mantra, but the king needs subjects to knobbie
I think prompt control is absolutely necessary along with everything else
Sd3 looks like it has much better prompt control
not sure bout that, "looks at Lykon portraits" 😄
more vapid criticism
"I can't therefore i will tear down what those who can, do"
hope negative prompting will be supported during the test period
I was especially mentioning it in terms of showcasing what SD3 is actually capable of. The images look certainly pleasant overall quality/shape-wise, yet at least those I did find there stay within the realm of what is basically already known, just in better quality. I´m simply interested in SD3´s potential when it comes to complex depiction along the improved understanding. I know the girl at the beach demonstrates the understanding pretty well and the image looks pleasant overall, yet I would like to see something like beyond the usual suspects 🙂