#🏞|general-with-images
1 messages · Page 57 of 1
deepfloyd is just overbloated with parameters, but SD 3.0 will be way more reinforced high quality and lesser parameters, based off what the devs said
tbh i might as well find myself an m2 while im on this site since i need a new storage space for my os anyway
an A100 runs games like sh*t
yeah, my friend has one lol
those high numbers of VRAM etc. mean nothing for gaming with those cards
he has two GPU's, one for gaming, one for AI
because they arent made for games
he has one thing in the first place: Money
or he took loan from bank
which is also an option
nope, got it for free lol
or he stole the money from a bank :-)
i dont know if i want to take a bank loan for that
He might be getting a H100 soon as well, but it doesn't seem to be very beneficial to AI right now strangely
But he got his 3090ti and A100 for free
Being the head of one of Unity's AI development teams has its perks lol
yo what..
He and I made a tool that unity bought from us, and they took him to be part of their team. I didn't know enough code to be taken
It was called Barium AI, and its now one of the core parts of their next level AI suite they are working on (of which he is overseeing)
damn lucky dude
Yeah, he got his own department and stuff, and hes mentoring me to be hired under him as a machine learning engineer
openjourney-v4 is interesting, i don't think it is useful to put in as a CTU base model
i love having to upscale my images by exactly 1.334 for high res fix :-)
1.33333333333333...
CTU
what if he wants to be addicted?
loool. then welcome to the club
dog this is lowkey awesome
you could generate 8bit splash screens for a game with a retro theme
you should try 16-bit next
noone said it was for gaming m8
https://www.youtube.com/watch?v=zBAxiQi2nPc this video pretty much sums it up
Get 50% Off the First Year of Bull Phish ID and 50% off setup at https://it.idagent.com/Linus
SmartDeploy: Claim your FREE IT software (worth $580!) at https://lmg.gg/Jpt4k
We've experienced a lot of crazy, top-of-the-line graphics cards on LinusTechTips, but we've been unable to get our hands on one famed card - the NVIDIA A100..... until now...
it doesnt even support directX so you physically cannot use it for gaming
the fastest gpu that sucks ass for anything other than server or ai related stuff
hey, by any chance does anyone know how much you can mine with a 3090 per hour in terms of USD
🤨
in ethereum
once all the conversions are made
so maybe 50c per day on a 3090?
because I have a few business ideas for setting up a mining rig somewhere where electricity is cheap, like in third world countries
my life goal is to never have to work again
👀
@bold cave bam, CTU'd
what have i done to spiderman...
I thought I was having a stroke there for a second
tyrone biggums
i haven't tried the GPT3.5 prompter til now on the fixed pipeline 
it looks like it gave spider-man toes
ok.. now this is where i want it to go

perfect 😈
dats tony stonks
yes but actually no
man even good images become better with CTU
@bold cave have you plugged DF into CTU yet?
deep floyd into ctu?
using that instead of the stage 3 upscaler
whats CTU im behind lol
dont think too hard lol
deep floyd
o
Im behind on that apparently
oh its for text
eh not interested although would be nice to learn
i love using high res fix :D
can someone help me settle whether or not its better to do base gen > CTU or if it's better to do base gen > hires fix > CTU
im too lazy to try it and im about to finish my first dreambooth on 1.7
interesting
because highres fix seems to do a world of good for gens
i doubt that its even going to work
really? huh..
the issue is that the smaller the starting image, the less faithful the upscale will be to the original
i am trying to make samurai spider-man now :)
there we go. this is CTU's version of upscaling the DeepFloyd bear.
is it img2img
yes
I really dont want to have to learn something new. I cant keep up with all these new tools
any new best photorealistic models since 2 days ago?
ai moves too fast
ultra
called zegovya or zegoya or some shit like that
i just got edge or real and cyber
is that the 16gb one?
is it on civit?
yh m8
cranking it up lmao 
that one guy who ran all those tests said it was that good, you found a noticeable difference over edge and cyber?
It was better than my own model which was already pretty damn good
its zovya
sytan tested edge and cyber and I tested ultra, ultra is unbeatable atm
i mean i do have the space, might as well try
if you mean this thing sen
yh if its 16gb then its the one
yep its 16gb
16.3 💀
i aint even got 7gb free
i really need to clean my hard drive
do my method
2x SSD m.2 x 2TB each
and 1x HDD 4tb
6tb I mean
that's 10tb space total for the best bang for ur buck
I moved 118gb of old models to my HDD
using your method of passing the same negative/positive prompt embeds from deepfloyd to CTU 👀
man this uses way less VRAM than the default DF setup
brooo that is dope
and it's just a simple python script
not even tangled up in discord code
lemme run the DF version to compare against
i still dont understand what makes DF better than good ol sd
it has a text embedder called T5 which on its own is essentially a LLM comparable to GPT3
you can add a 'head' to T5 and use it that way, but DeepFloyd is apparently also a possible 'head' to use it with
this allows more easily tuning the prompt engine eg. to add instruction based prompting "make this ball red" rather than "red ball (red) ((red))"
ohh I see
hi guys
i have problem
why my generates goes this?
in realtime render all looking good but in final...
ghosts
pass the prompt data
what are your settings?
you didn't get one
or --no-half-vae. I saw that problem a long while ago
ok well i just moved my sd 1.5 and 2.1 to my 2tb hard drive
now i have the space for that model
@bold cave i just realised CTU nuked the deepfloyd watermark ahaha
thats the same principle behind patching images
if theres a spot missing for example, u color it in photoshop or even paint and run it thru img2img and tadah. it makes it look real
i just realized i have 14gb worth of recordings ._.
atleast that makes more space for the model
anyway sen how good is that model?
https://gist.github.com/bghira/5cccb5220ec101fd48c4dd9329754945
here you go, if you'd like to try the DeepFloyd-to-CTU pipeline, this one will do both. you can set deepfloyd_stage2 = False to use CTU, etc
Using DeepFloyd for image generation and then CTU for Upscaling. - DeepFloyd-CTU.py
you'll have to do the login() jazz if you've never done that before - had to fix a line.
no-half calls to cpu render?
cuz i have runtime error
RuntimeError: [enforce fail at ..\c10\core\impl\alloc_cpu.cpp:72] data. DefaultCPUAllocator: not enough memory: you tried to allocate 58982400 bytes.
no thats a ram error
have u already modified ur pagefile size
I cant guarantee this is the fix tbh. im just guessing
u can also ask #🤝|tech-support
i have did nothing
i'm just add --no-half in start file
what kind of gpu you got?
💀
but now work only one 3060
bloody hell
@bold cave so CTU is a superior upscaler to DF's stage 2/3
unless the bear was intended to go through battle, idk
here's deepfloyd stage 3 and CTU side by side
i imagine you could play with the noise strength to improve either side's result
time for the caustics prompt 
remember that part in ace ventura, when the helicopter pilot guy is like "I wouldn't do that if you were you!" and Ace is like "well if you were me, i'd be you, and i'd use YOUR body to get to the top!" ?
does anyone know if this is outdated or makes a difference in db
yh its probably outdated shit
well.. i tried to train a lora but it didnt work
the image directory wouldnt work 😭
i was using a1111
loras are overrated af
can i use db on low vram?
how much vram u got
6gb
10 min 12 better 16 is great and Joe Penna DB 24
Yeah it is
I ran some batches and forgot to uncheck resize 
too low sorry
yeah which is why i wanted to try making a lora...
even the 3060ti is too low
model doesnt matter, vram my friend
What about the p40 or m40
thts all that matters in the world of ai. vram vram vram
loras use less vram
and thus are less powerful than db
yep, and the 3060ti is 8gb vs the 3060 of 12
12 can do DB training while the 8 is doa
this model im training has kyrsten sinema in it, and ima challenge the so called lora master sytan to do one better
the 16gb model seems terrible so far, but its probably a prompting style issue
ima guarantee you, with this model and an expert dreamboother, he's not gonna beat my sinema
yes it is
I really need to just buy a 4090 24gb
this is what the model produced
T swift
ye, upscaled, in detail, and thats without using a finetuned model
I just want to do portraits of myself smh
I already do
🤨
sadly they will give you insecurity issues bc of unrealistic beauty standards
Do you look like tailor swift
nope
I hair a thin beard and the model keeps giving me a butt chin
But I want to see smh
i kinda just realized its probably going to be a looong while before the model even ends up loading
bloody 16gb 💀
around 10s for me also
Spotted the European
but the results were terrible lol, id need to figure out the prompting for it
yeah well my sd install is on a crappy old hard drive
did u also download the vae
u gotta make sure ur using the right vae too
you need to download a vae too?
its not in the description, but ill try with the vae
i need to see what a vae even is
i dont see anything about a vae in the description 💀
yea its not there
vae-ft-mse-840000-ema-prune
pruned
jus google that joint n put it in the same folder as ultra
anything to enable it in the ui or once its in the folder it works its magic nnnnnnnnnnnb
LMFAO
ew
you guise like candy?
looks like jawbreakers
looks like a wiiild plate
right, yea i mean hollywood people do photoshop themselves beyond recognition
just used that VAE on digital diffusion
and it completely changed how my prompt worked
yea
for better?
it should do way better stuff
@bold cave sorry babe i didn't know you'd like it or i'd have CTU'd it in advance lol
oh god
ew...
i mean its a better looking person but looks nothing like miley
i didn't actually put miley into the prompt
that's just who it looks like to me 😄
holy shit snoop, way better
just finished filling out the concepts list for db lets hope this works
last time I used a concepts list it was bugged
please db gods just let me have this one
you also need to clone your reg image folder which is 2000 images each lmfao
just because of a db bug
looking good so far. 16000 reg images
awww crap it went poorly this time, don't look sen
i just think it's really interesting what happened to his mouth/face and how it hurts to look at
its 2000 images, but I have to clone the folder for each subject due to an a1111 db bug
at least he looks realistic
I do 20:1, and I was doing about 15:1 prior, Im confident they'll work fine
8:1 with that many images should be fine.
yaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaas it works
ok im gonna have to close discord to not risk having this shit crash
gonna take about 3 hours, will post results later. cya guys
3 hrs for 8 subjects is pretty good tho
You know auto done fucked up
I couldn't get prompt editing to work then remembered the setting I had to turn off which means now all my () no longer works.
prompt editing, and dephasis use the same bracketing scheme [] which is bad
soon as I unchecked it my prompt editing works again
switch one to {}s 
I tried that but no dice.
i mean in the sauce code
Cyber and edge of real are 98% as good as Zegoya, but Zegoya does win sometimes
reminds me my picture from dailies. Why you dont take a part?
yeah i wish i could unsee that
😦
pwned 😄
🙂
Kinda looks like a kiwi
me: "what does this number change?"
SD: "this is what hell looks like"
i have 64gb of system memory and 16gb of vram. will the 64 benefit me for any kind of ml stuff?
no
anything more than 32gb of ram is a waste of money although that's just my (generally agreed on) opinion.
unless ur doing what shrek is doing
which is rare
less rare now
my training is taking a whole 22gb of vram. hoooly fook
I can see why so few ppl like dreambooth now
also, the area where my pc is feels warm af
it just heated up its entire surrounding lool
now I know why my cat and dog like to lay down next to the pc.
its because you never feed them 
that was true until I splurged on an autofeeder
so they will let me sleep in peace :kek:
im scared, i dont know if i like this anymore
high-guidance CTU is weird
with my current prompts, edge seems to work best, training a model on it now to compare it with the rv2.0 model with the same dataset
anyone use xformers when training in dreambooth? I've installed it a couple ways according to different guides but it never shows up as an option for me in the memory attention setting
man, MSE is just so good on its own. this has had zero CTU applied
progress bars
ive been "generate forever"ing for about an hour
it randomly downlaods a json file
i didnt change anything
well it will go on forever
it does have an overall progress bar at the very bottom fwiw
no, my point is the json that it just randomly downloads
that shouldnt have occured to my knowledge
no idea
oh i take it back, i changed themes about an hour ago and completely forgot about it hahah. this was just the theme being downloaded. whewhh all clear.
what you guys think they're doing for their new model for this?
it has really good text whatever it is. seems like finetuned DF maybe
ive been wondering why tile wasnt working as great for me as everyone else
turns out there was newer updated version that came out after the initial 1.1 models were released. i still have the old one 😅
reddit SD threads are nothing but CTU praise now
this thing really took SD by storm
ughh my models ended up a tad undertrained. this is the plight of training. every model trains differently. looks like I wont see the good shit until tomorrow
you will see the massive difference tomorrow. I fink instead of increasing the learning rate I'm just gonna let it do twice as many steps
what is CTU?
controlnet tile upscaler
oh okay, is that an extension for A111?
yeh
Manga girl
I just thought of something genius. What if you deliberately overtrain your model so that when you generate at higher resolutions, it doesn't deform it as much yet you get much better base definition?
Ima try that shit o:
has anyone tried this extension?
original, 0.3, 1.0
it really wants to make the groundskeepers ashamed of their lawns
sigh why do I keep forgetting to check the enable box on controlnet
i think the original pic works just fine for basic needs, the .3 is probably the most appealing detail-wise and the 1.0 almost looks fucked up / dirty but it looks very HDR
the one above it is 1.0 and it somewhat looks like it fixed a major terrain issue behind the upper pup's head. it's still there, but more sensibly
@bold cave
r0fl
the 1.0 turned the grass incredibly aggressive
it hurts to look at it
the second one is just perfect tho. excellent balance
agreed but if you're trying to shame the local town council you use 1.0
"Papa, where's Riley? And why do these other dogs look and smell funny?"
the dreamlike models are resistive to making sensible and coherent results. it's fun
genda
I'm trying to make some desktop backgrounds, anyone got any checkpoints that are for a more "fantasy stylistic" feel? I'm currently using rev animated but Im open to anything
@bold cave well it's like discovering AI image gen again in a way. i am going to be connecting DeepFloyd up to CTU in a more solidified pipeline in my bot later this week. it's going to be tricky because it'll make me implement some other architectural stuff that'll make it easier to eg. unload models that are no longer needed, when a new one is going to be loaded. currently it's super easy to do that for my image gen models but i need to do that for TTS and LLMs now too
@lime lotus isometric dreams maybe
same model
prompts:
- space llamas
- space-bound traveller wandering through mossy PNW forest, looking up at spaceship looming overhead, misty foggy
can you run deepfloyd locally
cuz if u cant its garbage
yes
igave you my script for it earlier
https://gist.github.com/bghira/5cccb5220ec101fd48c4dd9329754945
it works on Apple Silicon now thanks to someone who reported the required changes
Using DeepFloyd for image generation and then CTU for Upscaling. - DeepFloyd-CTU.py
used about 24GB of VRAM on MPS 
good thing he has unified memory architecture and a 32gb laptop
this isn't a script you use in a1111 btw it's just a terminal script and you can set it up to go through CTU or DeepFloyd's later stages, it might use less vmem to do it through CTU.
I made my SD icon using SD... it essentially created itself.
there's something so sci-fi about that I can't describe the feeling loool
so no gui yet
eh... I'll put it on my to-do list
by the time I get to it tho, something better will have probably come out. Im just not a big fan of sending shit thru a cmd prompt
I swear torch2.0 somehow made my gens better...
I also need to stop simping swift and find a new muse 
I just think this jumbo-tittied hoe is way overhyped
aliens are staring at jumbo.
been trying James Bond's main enemy cat but cant get close with necklace and fur.
face is super cooked but i like how the hat says HD instead of FD lol happy accident
jesus christ that is overprocessed lmao
ye lol
gotta mess with the controlnet tile/upscale loop thing
ah yeah, i already figured out how to stop it from doing that :p
ye those are crisp lol
mind sharing 🤔 ig trial and error is fun lol trying it with just clothing rn, gonna start easy
I am still optimizing its use, so for now I am gonna keep it to myself
but I do have goals to share
what specifically do you usually change to make it seem less overprocessed
I only played with it a bit so far but in retrospect it all got pretty overprocessed by the 3rd upscale
@bold cave thank you for this model
it is very damn good
i didnt even have to use the face restoration thing
what model did you use to generate this
my own custom model which is based on Zegoya Ultra
Zovya
^
i think the model has been reduced in size
yeh, but I dont bother since bigger is always better
spidey got money now
@smoky oak should I add positive offset noise or negative if my gens are looking too shiny
this new option got me stumped
U were talking about noise offset earlier and all the explanations out there suck ass
I never talked about noise offset?
oh, several days ago lmao
yeh I hadnt seen this new feature until I updated torch
I would assume Noise offset for a model and for a CN are two different things
I am not sure how to use it or train it, I just know how to generate with it
mind telling me how you trained ur own model?
I have no idea, cause I have models that can go pure white or black
I merged checkpoints. I merged 80% Zovya with 20% Senblend
shit. I hate having to do things via trial and error.
alrigt... fingers crossed this doesnt ruin the model...
I am sold
nah G are u sure its the same model
yes its the same
but for now, I really need to sleep
as longs as its the ultra
oh, the website works now
or i think it is
oh i didnt see that
I will test the difference tomorrow
oh...
See if its worth 3x the size
its still 16gb 💀
TOLD YOU
Offset noise is to get darker images and more Contrast
anyways how do i make it do the ultra thing?
so positive offset = darker images and viceversa?
cuz the issue im having is that at cfg7 they already look overcooked and oversaturated
but it also works in the opposite direction, so I am not sure how people make models that can go pure white or pure black
I need to go down to cfg4 for them to look normal
sounds like an overdone VAE
its the same vae that im using for zegova
Yes the over8comer has also a lora for making image brighter
No I mean like
Noise offset models can go pure white or black
I am not sure how you train that
lmaoo let's just call it the zoidberg model
Here is the blog from the guys who made the offset noise:
https://crosslabs.org/blog/diffusion-with-offset-noise
I saw it, but its all stuff unrelated to dreambooth
I will have to read sometime, cause I am not sure how to use that slider to go brighter AND darker, like in models I use
this is why im such a square and never update, more stuff for me to learn with 0 documentation to back it up
cause it seems like negative would be darker, and positive would be brighter but like... how are there models that do both?
Its for dreambooth training
anyway another gen
Noise offset is a huge deal, it really makes a difference
yeh thats what I picked up from last time so I decided not to ignore it
it allows you to go way darker or brighter with generations
im gonna train at 0.1 and see what happens
Its model dependent, you need a noise offset model
I have a custom mix noise offset model, and man is it magical
hopefully no more pictures taken at the surface of the sun
if we can get a proper full quality Zozya photoreal with noise offset, then I would go as far as to say thats as good as it gets
0 clue
If I set it to 2, it will instead throw the second to last image as the result, right?
Have you guys ever actually seen just how amazing noise offset is?
it might also help with overcooked images?
ive seen examples
but if it works in my model you will see it first hand 😏
I LOVE ZOVYA
I don't have much faith, as I have never met somebody that knows how to train noise offset
yeah, but the problem with models trained that way is they will all be darker
gonna leave this training overnight and see what happens
I have used models trained at 0.15, and they are all naturally darker images
I hope so. at cfg7 they all look blindingly bright
which is not how its supposed to work
aight ima do 0.15
you're probably gonna end up with half assed noise offset unfortunately
zovya is making me extremely damn good gens
its what we've been telling u broh
i had to load the model overnight 💀
I am still using Cyber and edge of real over Zovya ATM
and then i learnt this morning that i needed the yaml file
I wish we had proper documentation on good noise offset
cause noise offset .15 is just gonna make all of the images a little darker, not actual noise offset
ima do 0.15 to see what the difference is
I have a feeling its a 3 step model merge process
main model, then merged with a second model with high noise offset (like .3) and images that are all very dark in the data set, tagged appropriately, then the same at -.3 with all bright images
then merge at 50% main, 25% dark and 25% light
any idea or is this chinese to you all too
0 clue
I wish I knew what every single contraption did
I really want a properly trained noise offset realism model
that will be end game for 1.5
ur about to get one 😏
its based on zoyva
noise offset .15 will not work
thats the best spidey chick ive ever seen
I keep saying that
so then what amount
its not a one number solution
but its a start right
either you have nutral images, dark images, or bright images with that slider
aight ima do 0.1
proper noise offset can do all 3. I have a model that can do all 3
and the difference is monumental
without
with
I wanted a dark and ominous rabbit at night
ima do 0.1 and see how that goes
thats the thing, with .15, they will all be darker, but the model I use only gets dark when using words like "underexposed" and "dark" which makes me think its 3 models in 1, at different noise offsets
cause it does this too
without
with
i want noise offset now 🥹
i want evil ominous bunnies
It is actually incredible
aw man that would make things way better
if my images are gonna be looking like that, ima have a heart attack
i was doing night time car things earlier
especially since I suspect torch 2.0 also is helping with the quality
and it wouldnt go dark 🥹
they won't ._.
we have had that for well over a month
my noise offset model I have is well over a month old
I wish I could train dreambooths just cause I am pretty sure I know how it would work
Noise offset is also why MJ gens look so much more moody than SD gens
I am also finally training with xformers now instead of flashattention o:
MJ has had noise offset for like a year
aight tomorrow's gonna be looking like a good day to run tests
training's gonna take 6hrs so perfect to run while I sleep
Don't be discouraged when it does create full noise offset, but it may help with your issue
Also, I have my own hacky noise offset Cyber Realistic model, and while the colors and tones look better, the realism is hurt
oi, does the scheduler u train with have any impact
cuz ddim has always been the scheduler of choice and I have no idea why
and when I tried euler a the training went 💩
the reddit post does say that it hurts the training
yo sen do you know if zovya can do you know images 
but I got a strategy to ensure I dont undertrain nor overtrain
im saving 8 checkpoints x 17gb = over 100gb of models lamo
It can

well, it can do women, no realism model can do men
but once I narrow down the good ones I will trash em and keep the goldilock ones
This is the difference between my non noise offset and noise offset versions of cyber realistic
no idea, and its not fixed
oh yeah it can definitely do the you know what gens 
yeah, if its women
it does everything broh I have no idea how that bastard managed it
magic and a 16gb model
he destroyed all models on civitai singlehandedly
I added 20% senblend out of pity
just hoping I can at least beat him in flexibility looolol
well apparently its what Im doing
or not according to sy
we will find out tomorrow ig
what you are doing is making images darker on average, which is a form of noise offset, but not the holy grail noise offset
but thts all I need. Imagine the extra contrast without having to lower cfg
anyways why the hell are the hands so damn good
the skin details will look real
ur a fool to believe 768 was any good
in certain cases it can be
I just merge the noise offset model into my merged models to get the nice Effect
Some models need 1.0 some need 0.5
yeah, only problem is they also shift in the direction of the noise offset model
so mixing a noise offset model with photoreal will degrade its realism
i only just realized that zovya is on front page of civit
bru it had like 20 5-star votes on the same day it was uploaded
I knew from when I saw it that it was gonna blow up
I didnt tried it with Photoreal models yet
That would be a good test
Yeah, it works good for stylized ones, but I have tried with several photo real models, and it suffers a lot
dude. torch2.0 + xformers is literally twice as fast in dreambooth
my last training took the same amount of time but with half the steps lol...
i have just done my first time using img2img properly
neutral exposure (SD signature)
Moody noise offset exposure (Midjourney signature)
That noise offset look is just so much more appealing IMO
I have an idea...
guys is it possible to change the brush size for img2img?
yes
how?
no worries, glad to help

how do you guys even generate images like this? mine gives really bad images
Good prompting, models, and 1000's of hours of experience
@bold cave I have made a monster...
43 seconds to load model
💀
ok at the start of this
i was very confused because it looked like a large mess
and now its going not messy
You need Community models for better Images, also good prompting, using negative prompts too
al alright, ima try different models, tysm!
through 16gb of pain :-)
._.
my drive is gonna be full soon
theres a 5gb version of the model
of wat model
zovya
where do I find models? huggingface?
looks like sfinx 🙂
@tight mural i like deliberate model and it is very small 2GB!
@hearty karma Hello cross breed 😄
my drive size isnt the main issue, the issue is cf storage is really slow
suggesting saving in jpg png disk is full very soon
🙂 yes it is challenging!
@tight mural looking forward to see your first image! 🙂
to your album 🙂 @hearty karma
how can i upscale or make something like this https://media.discordapp.net/attachments/1046079014156107896/1102781507904479322/nicequality.mp4
Mia Sara 1985
Not bad
crystal caves
@kind quartz original, CTU 1.0 strength to 1080p, and then 1080p re-CTU'd to 1.0 a 2nd time
.3 strength to 1080p adds more depth to her
whats that
FHD is 1920x1080
oh
would love to but huggingface started 404'ing for all the models
404 Client Error: Not Found for url: https://huggingface.co/api/models/Duskfallcrew/isometric-dreams-sd-1-5
@oak osprey thanks, must learn a lot about ctu!
np
https://www.reddit.com/r/StableDiffusion/comments/13bp1pb/here_are_some_stunning_highresolution_images/ thoughts on these? besides some flaws I think they look really good overall, any idea how these were created? model/workflow etc, Id like to recreate this but also train the face to be a consistent person
"loopback upscaler"
likely using MSE as the VAE and/or Karras sigmas as a sampler
yea was looking into that, but even the base render before any scaling, wonder how they got that
unless its a selfie, my renders come out very unclean for full body, would i need to upscale then inpaint the specific parts?
you're likely using VAE tiling if things start to go incoherent like that
i couldn't get the faces correct when doing full body shots but portraits went well
disabling VAE tiling eats more memory but it allows the MSE to do its thing better
i don't think i ever saw any tiling artifacts with the EMA version of the VAE, but it had the same garbage output when there was more going on in the shot. almost like it couldn't focus on the mountains/trees and the subject's face
some of these terms going over my head, with base a1111 i had this issue, not so much with models straight from civitai but when I train my own
I did get this vae recently https://huggingface.co/stabilityai/sd-vae-ft-mse-original/tree/main
ill have to explore the ui to see what this vae tiling is and how to disable it
it might be im running on 12gb vram so I do turn off ema settings for training otherwise it doesnt work
using ema versions of models produces better outputs for inference
ill have to retrain then, ill probably have to lower training steps per image to make room for ema
ill try to get an output I want from a base model before I worry about training a face
Overview: The Loopback Scaler is an Automatic1111 Python script that enhances image resolution and quality using an iterative process. The code tak...
Did you put her in a swamp since you are Shrek
Im trying to generate variations of this mascot logo. E.g:It wearing different clothes, different poses etc. Is there a way I can do that?
inpainting for different clothes for sure, different poses tho? i guess it could but not sure how well
id be interested in results if youre testing it
Is there a way to train my own model using multiple images. Lora maybe?
how much vram you got?
i personally use dreambooth, but lora for low vram yes
12gb
oh you should be fine then
happens when the cuda kernel hangs. watch your cpu use in resource mon. it will skyrocket
lmk how your training goes, im also on 12gb but i often have to compromise some settings so it doesnt run out of space
I'll try to learn dreambooth
now thats a hot lady
mixed feelings about my new model
I honestly think the noise offset thing is marring the quality of the model. the extra darkness looks fake af. but I havent tested it thoroughly yet
amen
I don't consider myself a religious person but she makes me feel like I could become one
for some reason i cant seem to get good skin on this model ;-;
which model G
because looks like it's pulling unreal engine 5 training data
the photoreal one
good lord something is getting fucked up with my training. now it cant even get tay right
yeah fellas please dont fuck with this noise offset shit
ima show you the atrocities it is creating
👀
wait it got fixed by loweing cfg 
let me do some more tests
cuz now she looks just right

do you see the extra darkness
it's there, but it doesnt look good
I might redoit but with something like 0.02
oh just learnt zovya has noise offsets in it
also I hate how this model gives everyone chickenpox scars
yeah I probably shouldnt fuck with it then
sytan is a lying sob
yea and thats hard to get consistent with, identifying features gonna be different in every render
how do i mess with it for my generations 😢
really?
yh
🥹
taylor swift after having a botched face surgery
^
btw sen this is a thing i did before
i fused some actresses
is that Emma Stwatson
nope, guess again
one of my dreamboothed girls. Im getting closer to making this shit look on point
sort of looks like a female mark hamil
I love the detail on her nose bridge
cara and idk who else
cara is first in row, correct ^^
ben shapiro is the second one?
i kind of see it
sen how did you get super detailed faces?
DDIM 90 steps, CFG 4
but these are just tests
I also think training multiple subjects in 1 model is somehow diminishing their quality, but it may be because Im naming them too similar to each other
naming?
i solve, its: cara delevingne, angelina jolie, christina ricci
yea cara is the overwhelming face here
howd you get it so consistently though damn
even with single subject training mines not as consistent
deliberate model
hmm, i always thought deliberate wasnt as realistic as others ive used
can you get full body renders out of this or is it just portraits
personally think the best model rn is Zovya
deliberate was among the best until Z came out
Zovya is godly
full body too 😉
i mean...
the thing is i havent even used restore faces on my Zovya gens
dont
and the faces are pretty damn perfect
restore faces destroys details
this is just straight from the model
used dreambooth?
emo gwen
i do not plan to use it
nah, local. bodies not as easy and good though
dreambooh is training not the website
yes it can noob
do what specifically?
peter ur outdated af
peter still thinks realistic vision 2.0 is the bomb
dont even bother. it doesnt matter what you show him, he'll find a way to naysay it
tbh realv has given me better results than most thatve been recommended over it, but its all prompts and setting adjustments
we've done all the testing here G
between me and sytan
there is only one clear winner
i think euler a might be the best sampler for zovya
we even did some blind tests here
but im only saying that without trying any others
Ok time to retrain my model without the noise offset and with even higher learning rate. Catch u guys in a bit
see you in a few hours
nah G torch 2.0 and xformers made it twice as fast
oh 🤔
what gpu you using
3090
paying for itself. the training uses up 22gb vram lolol
I might try doing a higher batch count too
for whatever reason i can never get the xformers option to show no matter what i install or what commands i put in batch file
whats the vram on that
24gb
wtf
whyd i get a 4070ti lol im dumb
oh nvm they expensive as shit no wonder
my trainings pretty limited with only 12gb i guess
what the hell am i looking at...

the bottom right character looks like ash ketchum mixed with mario
Yo I just noticed something
the final dreambooth models weigh only 5gb 😮
so I must be doing something thats causing it to lose data? it's gonna be a nightmare figuring this out
I think it may be cuz i've been extracting the ema weights 
is there a way to train in pieces so my 12gb can handle it
12gb can handle training
i often run into batch size error
doctor strange?
run batch size 8
i do
did
i stopped training
8 is too high
hmm. thats what I use. perhaps I should start experimenting with much higher values
I told u not to argue with peter
who is peter
theres so many variables to experiment with


