#🏞|general-with-images
1 messages · Page 28 of 1
Yes I tried but maybe I did it wrong. I'm lost...I see a LOT of informations around the Ai so maybe I left something. For the moment, I write prompts, then I write Negative prompts, I work with like 30-50 steps, I work with Euler A, PLMS or DDIM, I have the InkPunk.ckpt as a checkpoint, I understood the Seed, the CFG Scale and the Denoise Strengh. I only work with Txt2img, helped by ControlNet.
I never trained nothing (I don't even know what it is), I don't use LORA (I don't even know what it is)...LOL'
I have really bad result with Restore Face (I activated the CodeFormer model in the settings)
idk if there's a point in using restore face rn, since we have updated vae
vae-ft-mse-840000-ema-pruned
vae-ft-ema-560000-ema-pruned
If you have one of those , you shouldn't have face problems
(unless character somewhere far away from camera)
Yes I got this:
working with SD 1.5
@tropic shell when I tried adding the add on in blender it doesn't seem to be showing up
I'm a blender user so check if the addon work on the Blender version you have. You'll need 3 or 5 different version of Blender to be sure all of your addons work! 😅
Maybe it helped!
3.4
I do see Skinify plugin tho create a mesh from bones but idk if that it is
and that is only available in pose mode which idk anymore how I can get there
Sorry I really don't know what you're looking for but I allow myself to answer you because I'm a Blender user...I have BIG troubles with Stable Diffusion but if I can help with Blender...hehe! So the Pose Mode is like the Object Mode or the Edit Mode. You access to it by pressing Ctrl+Tab
I don't see it there
If you selected the right element (a Bone) you'll be able to access the Pose Mode.
how do I add bones?
I haven't used Invoke in awhile. I do love what they have to offer! I only recently got Auto to work
how do I get Make Human to work?
Make Human is its own application! And you can import to model into Blender, then use the Blender SD plugin with it! Just google MakeHuman. You can also search for VRoid
You can export it as a fbx, or dae file
it doens't show up in the blender add on...
vroid studio: https://vroid.com/en/studio
But you'll need the CATS plugin for Blender to import for Blender
Or again, MakeHuman
Once you have created your character, like so
hey!!! I remember now my teacher showed us this
but tbf I should stick with using it as a blender addon cuz I need it for school
You can then export this as a fbx
isn't there like a standalone thing?
Yes, it is standalone
Then you can import it into Blender
Ofc, you can do this with any 3D character, but since MakeHuman is free
I should unzip the zip after downloading it?
does it have like different hair styles etc as well?
Yes, although, I am actually working on some stuff for the community
Rn
To make it easier for ppl to use the plugin
woah nice
When I get the tutorials out, I'll post links
feel free to ping me with them if you remember
Anyway, once you have the .dae file, you can import. It will bring in a weird animation. It'll be distorted
Dont worry. Just reset the pose. Also, make sure the bones are connected
wait .dae from the downloaded zip? or when I exported the character from MakeHuman
ah so from makehum to blender?
yup
The eyes and eyelashes will be a little messed up but it's not a problem
You might also need to sculpt some adjusments
these are the files I got how do I open it?
Lol, I forgot the eyelashes. You'll need to adjust the alpha settings for the eye texture, etc
Anyway, I have to go now
I have an appointment
@tropic shell how do I open?
Did you make an icon on your desktop? Apparently, that file links to
think I found it ye it's on my desktop
euhmmmmm??
I got this now...It's better but it's really not what I'm looking for...it's rough, it's sloppy...
I have put it on alpha hashed but I still don't see the eyelashes?
How does it suck for you?
Because I see stuffs like that on the web...😟
alpaca is out for PCs. The best (30b) version can be run on 32gb RAM (not vram). Anyone know how it helps in prompts?
alpaca was trained in part from chat gpt3 but they're not the same. I just wondered if it could help with SD because that would be cool. It can help in writing projects though
well, for prompting I dunno I am on the side of the people that if you can't even think of a prompt that is pretty lazy and/or lame
No better than any other prompt generator really. Since it's trained with a different LLM, it might not align with the image prompt systems as well as we'd hope. Ideally, we want something that is trained with the same clip system to generate prompts, so that it uses tokens and phrases that affect the model the most
there is that as well
i think it'd be worth experimenting!
with the advent of SDXL everything is on hold for me
flow
see the above pic I posted?
i don't think it's so much as lazy as it is a move towards automation and batching. also, there's the bill gates quote i love. not sure if its real or apocryphal though. "I'd rather hire a lazy programmer because he'll find the easier way to do things and create the better tool"
that is some of what I am talking about that would be bad to train with due to the cold temp and glowing objects.
naw, he is too lazy to create anything let alone a tool
cutting corners is the bad kind of lazy, but there's also the good kind of lazy, like being smart enough to have a reaching stick instead of getting up out of your chair ! 😄
i see it
DB, and lora/locon will see it and bake it in
real, gen, or painted
then all faces come out distorted
I have no idea how I could change that in say photoshop
would describing the color grading of each pic in the caption help?
no, but cool_lighting, or warm, etc... warm_glow does but my lord the time involved for each pic to do that is immense
some I looked at and had no idea what it would be considered
i get that. i'm captioning pic sets of friends lately and it's exhausting. i really need to level up that game. learn some vi skills again
when it takes you hours to caption it then half an hour per try then go back and try to find the something you need to add it is hell
i get done captioning 20 pics and i'm just over the whole process
yeah, I had 81 and I wanted to commit murder. I was frustrated with some as I had no idea what the ai would consider it.
people training on 10k+ images astound me
yeah
base set is just unfathomable
75k-300k seems to be a big thing
i'm trying ot get to a point where 'm happy about captioning 1000
thats my spring ML goal
See, why isn't there a tool that has AI and says "I see this in this image" now I mean the stuff that it sees when it trains YET we are not told about in the captioning models?
the lighting for instance. If it sees it to train on, and can be captioned away then where is the tools to make that caption for us?
How long until decent auto tagging comes you think?
I was thinking soon but now I am not sure
I mean this thing is screwing me over because it has cool lighting
or warm lighting
The interrogate function exists but idk how good that is for training
no model I have used even mentions lighting. glow yes but not lighting and its temps
text to image turns out is an easier problem to solve once cracked, than image to text description has been
GPT4 seems to have good capability for oddball instances , but it's trained on openAI's clip model not SD's openclip
I can't use interrogate as it is all OOM 😦
blip has stopped working for me in webui
the one in webui I can use but is pretty worthless the one in the extension they all downloaded but I just need 12gigs vram to run them
there's an extension? i didn't even look because the one in webui is there
Ideally we could get away from human language altogether. That's why I don't like to use text much. Ai is best without it
naw. we want to supplement natural language descriptions, not move completely away from it
what I wonder is for training I get the sense it really only trains style well if you have 5500Kelvin studio lighting
somethings are just much better described through text. if we start using emojis to describe ideas, then we just made another pictograph version of human language.
devolution is real
we are devo! d e v o!
yep
🎶
how would I colour gradient that pic above to work with training?
I tried normalizing histogram by hand for 81 pics and it baked in the glow around the gen subject that it saw in the master images from the normalization
colour grading? i'm not sure. in my experience (very little!) i'd assume that pic was fine for training! i wonder if you're running into the same problem that the illuminati diffusion guys trained against. The offset noise situation where it tends to average the luminosity of every generated image
i've got to learn about offset noise yet. i've seen sliders for it in the dreambooth extension
no, because I had someone tell me, without even seeing my training data what it was. Sure enough they were spot on. They had 100 images and 2 had a green tint and from then on all people generated looked like they had vomit on their faces. He captioned the green light and POOF, perfect.
"Dark templar lighting" or whatever their protoss dark homeworld is called
for instance I have some wearing a halloween mask of a skull. Yepo, sure enough the people gen have a slightly elongated face but put skull_mask in the tags and train again gone to normal faces
"purple lighting" may not do it
@last pawn you answered some of my questions before about blender, what's this arrow called between the nodes? it's to create hair. (for those that are out of context, I am creating my SD generated character as a 3D model so I can make her do different poses etc and perhaps even train a mode on it)
from what i understand for captioning, you want to caption everything that you don't want to appear in every image otherwise the training generalizes that image concept as one that fits everything
I have heard it both ways but the tagger should see it. It sees most things but what it doesn't see is still a major issue
if i were smart about command line stuff still, i bet there is one command i could type with a regex in it, taht would append lighting tags to a ton of files easy. for that process i would sort everything into folders of similar lighting, then run the command in each folder for each similar tag.
oh, I can do that already, sort of, but the problem is orange, warm, blue cold. what about the rest?
it only know warm and cold lighting so you have to figure out which is what
orange glow is warm_glow
red warm etc...
some I have no idea.
yeah. i would call that one "magical dark energy" or something. it's hard to intuitively choose a color grading for that
why not? lighting temperature only works because it's natural language and it has references to higher or lower temperature images
color grading is something different from color temperature
magical dark energy means exactly what to a blind person?
what would warm or cool lighting mean to a blind person?
would a blind person intuitively know hot means orange?
you'd be surprised
anyway, the point is training magical dark energy will not do as you expect
I lock onto dark and think of no light
they were told if they do. or they had sight once. we're talking intuitively. Anwyays, we're here now. Arguing about the advice i'm being asked to offer. I told you how i feel about that images lighting. I'd call it dark magic or dark templar energy since blizzard and gamesworkshop lean into that style a lot.
I didn't consider this arguing I just said it doesn't work like that in the real training world. Try it.
"go do it for me" another cliche people pull after asking for advice. Look, i'm not standing in your way at all brother. you asked how i would describe it. i did. "dark magic" has succesful prompts with that sort of lighting too.
i'm not that experienced, but you came to me. I'm not doing your HW for you now because you're disagreeing and arguing with my idea.
What model is it? was looking for realistic model which can do magic
(kinda found one already, but wanna know other options)
yea, you can make things like that with right prompt and right model, but that's too big of a change to go from your image, it'll change completely...and well , it needs too...
So sorry, I fell asleep early last night
There is a video on how to do it, but it's by AItrepreneur
I know many people dislike him, myself included
Lmfao? Who on their right mind would dislike aitrepreneur
The guy is a saint. Or should I say the machine
If anyone should be getting hate is that bald fuck olivios l0l
i dont have anything against him personally, but i think he's a victim of youtube's profit cycle and he's been groomed to chase those view counts and click ratios.
His info is still legit for the most part
rushing content to release with hot takes that fire up the community for example
Ehh I wish he would focus more on technical aspects but I don't know any better alternatives
SEcourses does videos that are way too long
and others are just flat out annoying with their clickbait
Great, so I see Adobe just unveiled their new AI generator
Good thing it looks solidly behind Sd
UX though it's way ahead and tehy're only going to improve the model side
i think they keep theirs as SAAS
@stark vine @grizzled sage
My problem with AItrepreneur that I have come to notice is he gives really bad advice for a lot of things. Like his videos on training LoRA's/Textural Inversions are really damn bad. Not even just bad, but legitimately damaging. He spreads really bad information that damages people's understanding of how all of this works
I did catch him slipping one time I'm not gonna lie, but other than that pretty much everyone gives good feedback on his advice at the very least that's a good place to start
I was watching his lora videos and it was as if he was just reading from the first guide he googled and hadn't verified the methods at all himself. It's a trend of rushing content to release
What I'm trying to say is my main problem here is that so many people rush to defend him, especially in his comment sections, to where people who actually know what they're talking about get dogpiled under the guise of hating AI
It's happened to both I and @dense tapir
honeslty i hate video guides since the youtuber makes it all about them generally. i just want a page to read through and study. I can pull in so much information so much faster than listening to a guy blab on about his week for 3min
His video on training LoRA's is truthfully terrible. He has no understanding of how it works, and he spreads techniques that actively damage the community
But if you try to comment on that, his comment section eats you alive
yeah a lot of youtube culture is it's fan bases too. A lot of "celebrity" worship there and i gotta wonder why, they're just youtubers
not trying to diminish how hard of work it is to do, but wow. People get defensive and worshippy
I only click on his videos long enough to find what he's talking about, so I can research it from people who actually know what they're talking about lol
But yes, in the future as a little PSA, he does not know what he's talking about when it comes to training LoRA's, or TI's. Hell he probably has no idea what dream booths either
Sure, the information from his video is what turned me on to LoRA's, which is cool. But the information in his video is also what took me so damn long to figure out how to do them, because his information is just objectively bad
and usually made very specific to whatever version was out at that time, because it's just rehashing online step by step guides instead of explaining what it is each setting does
I say just take everything with a grain of salt from him. He is far from a good source of information, letalone reliable information
I kind of disagree with that, it was because of him that I learned how to do dreambooth
And no, I saw his Lora videos and TIS and I honestly don't see what's wrong with them. Then again, I don't specialize on those because they just tend to be shittier than dreambooth
i like the longer videos when they're going into explanations about what a learning rate might affect, or why gradient accumulation helps, things like that. Instead of quick "do this step, just set batch and gradient to 1 because lowvram. no explanations here"
He probably doesn't know, but then again if you ask around here, everyone will give you a different explanation.
another misconception that comes out of youtube hype culture. "TI is shit", because they're trying to hype their DB videos for views.
they push these extreme hyperbolic positions and their audiences worship and adore them
He was also the first one to publicly call out what was wrong with dreambooth when it broke and how to get it fixed. I literally saw zero people give advice on how to get it fixed. So he gets credit for that
hyperboles become gospel and its just a wildfire of toxicity in the community. started because a youtuber wanted more views
Textural Inversions are less diverse than LoRA's, that is a fact, but TI's have their use
It's another tool and it has uses for sure. Hyperbole leaves zero room for nuance
I think the deluge of merges that peaked for a bit, where every new model were teh same 5 merged a different way with a few differences, was because of youtube hype on merge tutorials
model merging is another great tool, but it got REALLY popular there for a minute
neat ot watch this all go down though. Barely 6 month old field and all this stuff flares up often. I just wish there was some more "responsible journalism" surrounding coverage
( i hate alluding to that responsibility since gamer gate toxicity tainted the idea )
It got popular because it's literally the best way to generate images…
I would say anyone who's not using merges is a noob
we're all noobs, that's the thing.
how can you be an elitist when youv'e been at this a matter of months?
That's why I try to share as much as I can, cause I know how hard I work to forward my understanding
Oh, I saw that midjourney made a magazine
God I hate midjourney
Of course a scummy company like midjourney is everybody's first exposure to AI inage generation
I yearn for the day someone leaks their model for the public
Their entire business model would collapse
I don't even want to use their shit, I just want them gone
R0fl
Greedy ass company
they're still very niche. soon it will be adobe
Luckily at least for now, adobes image generator is very far behind
What's that wallpaper app for cellphones ? it has ai generation now too
most in my local circle bring that up to me
zedge
Oh great, there goes my plans of putting my content on there
heh
God it annoys me
@tropic shell you know a faster way of doing the hair I want?
i dont blame them but yeah, it's annoying a lot fo companies want to do what i'm wanting todo and monetize this
getting in my way
All these shitty half assed implementations of what can only be a watered down diet lite 0 calorie version of stable diffusion
hope you remember what I am talking about
At least I can confidently say that as of right now, stable diffusion is still the best AI image generation platform, and it's also free, which is a huge accomplishment we should be proud of, cause it's not SD that made it great, it's this community that did
Stable diffusion needs to learn that the only chance they have of continuing to stand out in this world of AI generation is to embrace their extremely generous community, rather than fight them every step of the way like they seem to want to
it's better then just free
it's open-source
For now, yeah
It's important to not smoke screen ourselves with what's now. Cause the truth is, a lot of people including myself feel that stability AI are just abusing this community to learn for free until they can steal it all back and monopolize it
wdym for now? Emad is big fan of open source and is CEO of Stability.AI he will never allow a paywall
or lack of transparancy
hence it's the reason he hates OpenAI
Don't be so sure of that
With how shitty they have been to the community, I wouldn't wager too much on them as a company caring about us
flower, vibrant, vivid, intense, (masterpiece:1.2), colorful, vivid, (sharp details:1.2), extremely detailed, vibrant lighting, (highly detailed:1.3), (sharp focus:1.2), illustration, beautiful, (trending on artstation:1.2), cinematic, (8k:1.3)
Negative: (low quality, worst quality:1.4)
OpenJourney v4
look even if they go ahead and put it back to closed source what does it matter all the code has been released to the public we can just build further even when it's back closed sourced, so who cares if Emad would decide to put a paywall behind it XDDDDDDDDDDDD
Stable diffusion is definitely the best platform but definitely not the one with the best model.
Mj is SD but with heavy filters
That is not true
That's the indictment of capitalism. Those who have the most money get to hog the best product
Im sorry but v5 is insane
Meanwhile, SD gets no funding so who's going to spend all the resources in making a good competitor for V5?
I've been begging for a 1024 model for ages and I'm starting to think it's not gonna happen especially after the lawsuits started flying off
But since MJ gets to raking the money from all the subscribers each month, they can do whatever they want… It's a bummer
I really wanna see midjourney burn man
it is tho, MJ uses SD as base model
proof me wrong with sources
It's a lot more than that man
I can't deny the amount of work they've put into it
Midjourney is SD if stability actually gave a shot
*shit
Midjourney took the trash out of SD, but then limited it massively in other eays
No controlnet, no, Boobies no pussy, a really half assed inpainting
I'm so passionate about this, I just want stable diffusion to do good. But instead they choose to fight the community every step of the way, and they're making their shit worse because of it
Not to mention a discord-only interface lmAfo
Mj should only be used as training wheels for AI enthusiasts
If stability would swallow their pride in just admit that they are not the driving force of stable diffusion, but rather the community is, and they would come into the community and actively embrace feedback and resources from the rest of the community, stable diffusion would rapidly dominate
It's not that they don't care about the community. It's just that they don't wanna get sued out of existence.
It's the reason why the multi-billion dollar dalle 2 looks like dog shit compared to a standard stable diffusion model.
Blame the social justice warriors and all the snowflakes from twitter
No it's both of that, they actively choose to ignore the community
I sub him, but I am not being mean here but he is an idiot. He was telling people, flat out nonsense, and wasted so many people's time. I sort of went on a crusade to help them and I had to break it to them that his "special" numbers and saves were 100% BS. When I would bring anything up to him he refused to answer. Fact is I have yet to see him actually talk to anyone in his comment section. If he does it will probably be from someone licking his feet or something else.
That's ironic giving that what you're doing right now is giving fake news.
He most certainly response to all of my comments. In fact, he was on the on unstable diffusion discord not too long ago, answering questions, and talking to his fans.
I don't wanna stan him but also examples would be nice to see what he's doing wrong because I would say he's like 85% accurate on the stuff that he teaches
You would be about 60% inaccurate lol
I said I had not seen it. Fake news. Go fuck yourself you ...
I use his vids now just to see what is popping but far more creators out there, like koiboi, that go into GREAT detail about subjects not clicky shit. Sad thing is in the age of the tiktoker the clicky bait wins and the ones with vary good info are "boring" and get little subs. Idiocracy.
I agree
@stark vine I would encourage you to go out and find information on these topics on your own, so you could see just how bad his info is, but you don't seem like the type of person to do your own research
Please, try and use his tutorials for TI's or LoRA's and let us know how it goes over, cause I can assure you it won't be well lol
He's not a bad person, I don't think, but he's far from accurate
wdym no controlnet and half assed inpainting??? and NSFW can be trained very easily with your own models with SD as base
I'm talking about MJ lol
ah XD
literally the only thing they have going for them is that badass model
I would suck dick to dreambooth that shit
just use SD model that is trained similarly as MJ and you're of
the base for that model is SD
nah G v5 is like base 768 or 1024 at the very least
also has more params
see some of their gens, the amount of detail cannot be replicated with any sd model sadly
the base is SD but of course they added more to it
but they didn't built from scratch
Not with that attitude at least lol. I'm pretty confident I could match it
let me grab some examples
just off the top of their showcase
the tiny details are really hard to emulate in sd
the style yes but the stuff like the tiny creases in her gloves are too intricate for SD for the most part
You can do inpainting n stuff but you cant get it done in one go
Maybe it's just me, but that doesn't look that impressive to me, IDK
Feel like I've seen plenty of SD subreddit s that look better
That looks like 100 other examples on civit
another example of tiny details
those are pretty damn hard to get in SD
not to mention the custom resolution
midjourney level images have been possible since before 2.x dropped. people just prompt poorly then blame the entire system
So by tiny details... You mean grain
Nah look at the bubbles and the water detail
Bro, if you think SD can't do that, then you're high or not doing things right lmfao
embeddings are total shit though so they won't help achieve these results right? even though midjourney is certainly using extra networks on their backend too
of course! they wont help get that detail at all! 🙄
There is your problem lol. Too limited, too slow, too clunky, and you inherit all issues of the base models
how do you even help someone who insists they know everything already?
r0fl. So ur saying TIs > DB
Dreambooth only makes sense for huge data sets
For specific things absolutely lmfao
embeddings are a useful tool in a lot of cases
Just sit and watch as they fail lol
Those youtubers who told you TI is dead because long live DB, well, they were just hyping you up for click bait
they lied bro
LoRA's too lol
I might have outdated info but embeddings only worked well on 2.x models
on 1.5 they were pretty poopy
What lmfao

You are a work of art man. And I mean like first version of SD work of art
first you think pixel upscalers are better than diffusive upscalers, which they aren't
Then you start spouting off nonsense about MJ v5 being insane, when its still really unimpressive, and now you are shilling dreambooths and saying embeddings and other networks are worse than dreambooth... Come on man
Even if you were a broken clock, you would have been right at least once by now
You can do that with SD on higher resolutions and better.
but yea, MJ*knows more words and easier to prompt things
I guess it depends on the case usage
Since I pretty much only do people, no Lora no TI nothing has even come close to me than the power of dreambooth
Dreambooth recreates people better than anything out there, I don't even bother combining db with a TI just because db gets it so right the first time
not to mention db has gotten buffed over and over again. I can train 2 subjects now with like 30 mins of processing
no idea about dreambooth
ur missing out
on what?
I don't even want to waste anymore time with somebody so closed minded
SD without db is like peanut butter without jelly
Something more exact?
Please don't fall for their over hyping of dreambooth
essentially training ur own subjects lol
nah dont listen to the haters
dreambooth is flat out magic
no, its not lmfao
idc if its 4gb a pop lol
worth the space
just imo tho. I have like 30 models I can vouch for
I don't do training anyway, my gpu won't train shit, so whatever
Dreambooth is only reasonable for huge datasets
I think simple stylized stuff is really good for AI regardless, especially when making in quantity. Less detail, way easier to edit in PS/CSP, etc.
do u have any loras or tis of specific subjects
Yeah, tons lmao
cuz if u wanna put ur money where ur mouth is I'm game
wanna see my WIP Na'vi LoRA?
Blue people from avatar
ooh
yeh share
no, I am not sharing anything with you
You are not worth reaping my time and efforts when you are so misguided and obliviously wrong about all of this
do something for yourself for once
I can literally prompt avatar high resolution and get that picture
let me see some custom shit
do it then, I'd love to see you try lmao
no, I am not sharing anything with you
You are not worth reaping my time and efforts when you are so misguided and obliviously wrong about all of this
exactly, now keep your trap shut
😆
I don't think any of us want you here
maybe some cows, people like cows
please, just stop feeding people false info man
I have multiple cats, I am a cat, but I can still love cows. I don't see any incompatibilty there
There should be a rule or something in this server against people spouting straight up nonsense, ISTG
then we wouldn't have to deal with people who mooch off others while feeding people misinformation
I am fine with people being wrong on their own cause its funny, but when it draws back others, it sucks
yeah I'll be honest there
So many of us spend dozens if not hundreds of hours to work on this shit, just to have a troll tell people its bad/useless, while also asking is for help? lmao
Making it really hard to want to give back to the community rather than gatekeep
I don't like false info, and I'm not sure we do have a rule about that specifically, mostly sure not.
But I also didn't look up the whole chat and I frankly don't want to start and say who's right and wrong
why not just stop there when you disagree that much and feel the other person isn't genuine ? just block them and move on ?
don't harm yourself in the process I would say mostly.
Like the famous cow story says :
Once upon a time, there was a false fact that claimed cows could predict the future. According to this false story, cows would line up in a specific order before a big storm, and their positioning would indicate the severity of the coming weather.
People started to believe in this false fact and would consult cows before making important decisions or planning outdoor events. However, as time passed, it became clear that the cow predictions were not always accurate, and people began to question the validity of the cow oracle.
Despite this, some people felt obligated to continue discussing the cow predictions, even though they didn't really believe in them. They would argue over the accuracy of the predictions and become increasingly heated and argumentative.
Eventually, the discussions turned into full-blown arguments, with people berating each other and refusing to listen to any opposing viewpoints. It became clear that the false cow prediction had caused more harm than good, and people decided to put an end to it once and for all.
and if you feel there is really dangerous info going around in there, I'm all for checking it on our side too, and in those case, using the ⚠️ reaction or "report to staff" application let us do that exactly
kill da cow, kill da cow
Because I would rather people receive help over hindrance. I spend HOURS working on this stuff, just to be questioned by somebody whos not even willing to work for their own stuff. Its depressing, and it brings down the collective of the server.
I feel very strongly that people who routinely give out bad/false information should receive some sort of role that indicates that they are not exactly reliable as a source of information
i'm more of a cat person than a dog person. dogs are needy and need that constant "what next boss?" where as cat's just bug you when they need something and figure the rest of their business out on their own
also, sometimes, i'll use dreamboothed models AND textual inversions at the same time 😮
how are we supposed to give back and support this community when people are actively trying to spread misinformation about important topics. Its frustrating and demoralizing
I'm not even talking about anybody specific anymore, I should clarify
I just mean in general now
Sorry if I get/got cross. I am passionate about helping people not hindering them
that's a very logical and well positioned argument.
About the first part, I reiterate, don't let yourself feel berated/diminished when you know your stuff. but yeah, easier said than done.
I'm not sure we should tag bad advice givers, but I get your point though...
and yeah, I feel you on the misinformation being so depressing. I'm fighting against scams and trolls half of the day, so I can see how much we get too.
I'm also quite tired and I can't say I'm in the best position to analyze correctly the complete situation, so I'll take the time I need on that and chat with the mod team too, I get that this is a real problem that does impact you and others.
and no, don't feel sorry for spitting your guts. but do preserve yourself, call on us if you feel you are getting in a position of danger too.
i'm reminded of how i was having problems and someone insisted i swap to use an older python for many reasons. 3.10.6 i think it was. insisted like they had good reason that it would cause instabilities with pytorch.
after i solved that initial problem for other reasons, i kept the older python and couldn't ever figure out why i had so many issues with NAN errors. a month later i updated python to a newer 3.10 and all my woes faded. once i started taking the initiative and finally learning more about python instead of just doing what i could based on advice, i came to find that there is no reason to recommend the older one other than it being in a wiki on one unupdated page. all the "it causes so many errors" was just made up to sound expert i feel.
i've come to stop trusting advice that lacks explanation and beat my own path since then.
Thank you for remaining level headed, I appreciate it and I will try to as well
Its hard for me to keep my cool around people who so confidently spew out nonsense like its a game. Somebody who complains about why their stuff isn't good/doesn't work, demands other people help them, shits on their work/advice cause they don't listen to it, and then from there on preaches gospel that its all bad.
Like why would you ask how to do better and then write everything off as bad before shilling the things that are making things worse for you.
I give so much back to this community, and I just want it to be a place where people can learn, and ask stupid questions without being fed misinformation or be made fun of.
I wish I had some better form of way to share my/other peoples genuinely amazing information on here for people to see, rather than an echo chamber of people being hostile and downing each other, which I realize I am now a part of because of my actions, but it comes from a place of caring for this tool, community, and the betterment of AI as a whole
Sorry to bombard you with my great wall of text, I am just really feeling tired of people trying to make things actively worse for others while telling them to not listen to the people who are actually dedicating massive amounts of time to the betterment of this tool as a whole
Also, felt the need to post this in response to some false claims about MJ V5 outdoing SD in terms of detail. no inpainting, no editing, straight out of SD (cropped cause she got a little nippy lower down lol)
I think its pretty clear that stable diffusion is still winning
very very high quality images. but because of ||nitpicked subjective taste issues|| it's not as good
I'll make party cat come out
Party cat?
but yeah, hard to say A is better than B imo.
Its a mix of skill issue, lack of experience, and various other things
I can agree with colors, but the argument was made that there is no way SD could match the detail level of MJ V5, which is just... clearly false lol
i do agree the colors on MJV5 are better, but you can color grade the SD output as well
i certainly have some metaphors to describe this aspect of internet culture. It's nothing new of course. disrupting discourse has been something people done for fun since usenet
color grade is different from model to model too, idk if mj is better there
I don't care for people in this community trying to put down... well, this community
People in here trying to discourage people saying competitors are doin better, and discouraging them from even trying
"you can't"
it's not a fair fight to said "SD" of "MJ" already. like what SD? the base brick, or the extended feature set with custom model and style randomization ?
what MJ ? the black box that we imagine is just a model, or a more detailed pipeline with models, prompt interpreter, hypernetworks or embeddings ?
And also for what "better" result ? Can I get my very simple photo of a cat in MJ or will I be obligated to have lots of artsy artefact around it even if I don't want ?
Here take another party cat
I put maybe 5 minutes of work into that example. I could easily make something WAY better
haha, this one is dope lol
I think some people do want MJ though. A black box you just yell at and it does stuff. But then the best MJ users get into advanced API stuff with it instead. So it's a mixed bag still
there's just a lot of "you can't" all the time
One more Chester. Enhanced the prompt a little
we need more neon lights 
I agree, I am not saying MJ isn't good, cause it is, and it has its own use case and user base that it serves very well. But to actively put down a community you are a part of is just so toxic, OMG
thinking MJ is better is cool but i do think there's a bit of skill level difference with more advanced tools and a magical black box with presets
If you wanna just have a fast, simple, and fairly good looking output with no hassle or additional work, by all means, use MJ
for sure, lots and lots of people do want MJ. it's fun, it's good, it's effective.
Lots of people need the more versatile version that is SD though.
Like any professional that would need to train its product in their model.
exactly for sure. A lot of us here are high level/advanced SD users, myself included. Its not unrealistic to expect that MJ will get better results for the average person over SD. But the part I hate is the idea that "SD could never"
when it quite literally does lol
Imagine if Stability.ai and midjourney worked together
Could buy that mall kiosk pocket knife or get a proper tool that Mick Dundee would carry
(this is #🏞|general-with-images people, don't forget your pictures if you want to make an argument, temporary rule)
mall knives are cool and useful too
Midjourney is too money hungry to ever consider doing something for the betterment of people around them lmao
maybe crocodile dundee is a bad analogy
Unfortuanetly
Stability acquires MJ, now they will
Still think SD is better thanks to community who makes extensions, models and modifications for it.
that party cat is hype
rip
Stable diffusion is infinitely more powerful / diverse than mid journey is, simply off the fact that mid-Journey is so closed down and censored, and they share absolutely nothing for the betterment of AI for the masses
inpainting. boom. done. i've always said that.
they automate some common remix tricks too like masking out a subject, rising or mirroring it, etc..
probably have a script that randomizes a few subject removal and positioning tricks
i suspect they have other tricks for the way they blend two images that might be more advanced and possibly novel
i'm thinking something about the way it feeds both images into the noise alternating. its a neat looking mode. not sure what they're doing with it tbh
here's the big example google pulls up
too lazy to gen more at higher resolution or fixing hands, but that's my contribution on that 😄
Light prompt too
i think now a days with controlnet and some jiggering between clipvision and color inputs, you could blend two images. i haven't played around on those fronts yet
See! Its not hard to mimic/surpass the detail level of SD haha
His image is 900x2k or something, I mean...what kind of detail you expect
MJ just isn't worth what you pay for it. do the math and buy a SD capable machine instead of a subscription
my thing in 768x768
That's weird upscale, noisy
I asked before...what model are you using for it?
Was looking for realistic model which can do magic, already have one, but another option would be nice too
I use a couple but I mostly use the one @smoky oak uses the realistic 1.4
the image is not optimal because of the lyrics, probably. But I've seen/done much worse.
lyrics?
see EXIF on pict, and link on top of image
One of my favourites. ❤️🔥
oh...I mean video upscale is weird, I can see white\ gray dots everywhere on 4k
ah, have lossless album on my comp, don't need listen it on YT, but yeah
it can only be an upscaling, making videos in 1080p at that time was a dream
yea
@wispy nestI use Realistic vision 1.3/1.4 99.9% of the time
Still so glad the creator got off his high horse and gave back to the community
RealisticVision for the win
underwater water drops lol
Spongebob kind of thing
only now that he made the model available
I vibe haha
50 steps instead of 20
heh, the foot is backwards
the whole bottom leg too
yea
SD doesn't know right from left
theno ose
o o
o o
oh, the double horizon mirroring with the cloud bottom looks reallyyy nice
I have been meaning to see if I can train my first style LoRA
I would share on what style, but some people here 
like to steal ideas before people can make them
So I will keep you all updated with progress, assuming all things go well
thats totally me with the DF model i want to work towards. sorry. i'll get it done first though really! i swear
A stunningly beautiful girl. Cinematic in nature, this hyper-detailed scene is filled with insane details and beautifully color graded using Unreal Engine. The use of DOF, Super-Resolution, Megapixel, Cinematic Lightning, Anti-Aliasing, FKAA, TXAA, RTX, SSAO, Post Processing, Post Production, Tone Mapping, CGI, VFX, and SFX has created an insanely detailed and intricate world. The hyper maximalist approach and hyper realistic Volumetric and Photorealistic rendering bring out the ultra photoreal and ultra-detailed aspects of the scene. With 8K and super detailed visuals, this scene bursts with full color and Volumetric lightning, using HDR to create a realistic and breathtaking environment. Powered by Unreal Engine and rendered in 16K with sharp focus, the intricate details of this scene are truly mesmerizing.
@green plover @dense tapir Ok, so adobe firefly is awful lmfao
I take it as a challenge!
I am happy to see that Adobe Firefly is nowhere near as good as SD or Midjourney, and is in fact quite horrible lmao
And if it says Adobe on the box it probably comes with a price tag. Probably a monthly one
excuse me, how do i use my seed to recreate the same photo
i dont seem be able to enter the seed anywhere
Somebody just released a comparison of firefly vs SD, and its fucking HILARIOUS
Directly below CFG scale
@smoky oak you see what I've been doing with SD and iClone?
am i using a wrong version?
oh, this is not normal SD, best of luck
What interface is that?
yeah
ask in #1025467151206854736
thank you!
which url do you generate your prompts?
I use automatic1111. it's a local install
Is there a link for me to get it
one sec...
Thank you DF
Sorry for the delay. I'm baking a couple normal maps. Slows down my system
Thank you!
NP
This prompt breaks the record of max meaningless for AI words I've seen so far 😄
hm...
idk what happenned to gravestones there, but I still like that image 
LMFAO
oh...I think I know what happened lol...
I put "grave stone" into prompt as 2 seperate words
although other gen was fine...hm
kinda, it added just "stones" too lol
Besides the classic photosh00p does anyone know an easy way to inpaint a specific object? I wanted a rusty wooden cross on the blank space to the left but inpainting even at max denoise doesnt work. I remember there being an easy way but I already forgor
Also, after further experimentation I can definitely vouch for the ultimate upscaler but only for far away subjects like my girl. I would say it is slightly superior to controlnetting the original image and regenerating at a higher resolution
Then inpaint the artifacts and badaboom, no more warped faces on far subjects, no need for facefix
For closeups, the artifacts become too much of a pain to sh00p out, classic upscaling is preferable until we get a dope 1024 model. That's my two cents on the extension
Which 1024
This one? https://civitai.com/models/20842/char
People have recommended it but it seems to be pretty inflexible
I'm currently genning with this one on 1024x768, but idk if it's actually trained on that high
never seen this one
did you get it from civitai?
I havent even tried newer models since RV1.3 lol
Just because the blend I made works so well, and db models arent transferrable between models yet sadly
yea
also does anyone know if training a lora with 1024x1024 would work on 24gb vram?
There was other models which was saying they are trained on 768 or 1024, I just don't remember even all models I have...
It would work, but it would probably be useless
Unless they are all 100% 1024, then you're gonna have artifacts and distortions all over the place sadly
Yeh I figured, thats why I havent messed with em yet
That's why the 768 SDv2 sucks ass
It was trained on 512 most of the way, and 768 at the end
wut
It led to some very weird issues, like cars being super stretched and flat, and people having abnormally long necks
Parts of it are, but not consistently
768 is the 512 v2 model, further refined with 768
yeh idk why so many ppl advocate that crap model
There is a full 768 prune that somebody did, however the results out of it are terrible, just like all stable diffusion base models
even their custom models I dont touch with a 10 foot pole
better clip knowledge
The base models aren't meant to be good, they're just meant to be diverse. They give us the base models to train into several focused high quality models
All of the base models are basically raw data with no proper direction
i've seen some 1.5 models that were refined on 768 too. while it's not an ideal 768 generation result, these extended models do see improvements
Jack of all trades, Master of none as everybody likes to say
2.x is very diverse, however it uses a completely different text encoder, which makes it extremely hard to get specific ethnicities or poses out of it
if you cant make b👀bies with it it's garbage
yeah that philosophy is important. people keep treating the base model like it should be hyper stylized midjourney
Stable diffusion V2 should be significantly better than stable diffusion v 1.5, but sadly they introduced an absolutely abysmal text encoder that makes it nearly impossible to train, and then on top of that they decided to withhold information from the community
That is why you can get significantly better results out of well-trained stable diffusion 1.5 models
they also explicitly said they removed nsfw because of kids being in the model
so I know for a fact it's adulterated
They went back on that and stable diffusion 2.1 I believe
I'm pretty sure 2.0 had less celebrities and less NSFW, and then for 2.1 they brought it back after backlash
the question is, who will be the brave adventurer to train a native 1024 from scratch with as much variation as 1.5?
Don't quote me on that, I know a lot of things, but I'm not certain on that one lol
nah both stink
That would cost millions of dollars
2.1 i thought was extended off 2.0 and not a fresh start
However, let's just say that @green plover And I have some extremely lofty goals of retraining stable diffusion 1.5 all over again using completely free open source and non-intringing photographs, while also improving stylistic understanding and consistency
*non infringing
I don't want to explain the exact process here, as we are both slowly working out the details in private, and we don't want somebody else to mooch off of our work
any ideas why my auto1111 suddenly looks like this? I tried a full clean reinstall and all
sounds like a bad idea
Oh Jesus Christ
2.x from what a lot of people say is already trained on copyright-free image
That may win for the most curse a1111 installation I've already seen
and errubody knows what a steaming pile of pooh that turned out
but its not an installation yet since I can't get it to actually run
The problem with stable diffusion 2.x is not in its data set, it's in its text encoder
Stable diffusion 2.x should theoretically be better than stable diffusion 1.5 in every single way, however it's using a significantly worse text encoder, which makes it impossible to unlock the potential
And to put that into perspective, a lot of people like to bring up how much better mid journey is at delivering on what you ask for then stable diffusion. That all comes down to their text encoder. It's better than stable diffusions
2.0 should be bit isnt, in their effort to make it PG13 they screwed something up
It has absolutely nothing to do with them making a PG-13
Just for experiment, have you tried other browser?
The failure of stable diffusion 2.x has nothing to do with the data set. It's entirely the encoder
This is a factual statement, that is the exact reason why control knit doesn't work on it
yeaeh I also cleared cache and cookies
In fact the developers of control net themselves quite literally told stability AI that they need to get their head out of their ass and fix it
yea, that was my thought...
inpaint that hole in the middle of the suit into heart shape please?
I should try to restore my kaggle account...my google collab doesn't work anymore, genning on my gpu kinda sucks 
I managed to add a custom ui after strugling and I think it partly fixed it
very partly
no actually not at all
maybe it's one of extentions which adds something to UI?
which colab does not work? lastbens'?
What GPU? I may be able to speed it up drastically depending
but I said I did a clean install
Define your clean install, please
I don't think anyone but me using it here lol
it's sagiodev's , at least it was broken few days ago, idk , maybe fixed now
as in what? The entire folder, the venv, the original cached files?
Doubt it, it's 1050ti lol
Haha, i could
did you update python recently?
no
oh god, you can even get it to start with that?
that is impressive lmao
2gb VRAM
wow
I am genuinely amazed it even starts!
unless it was through a venv but that would be localized
yea ,I have no problems with it, it's just super slow
it's 4gb vram
That is insane. I guess Xformers is working magic
Imma erase venv from this auto build and rerun it and see what it does
uno-momento
best of luck
you may need to delete the cached files as well
that way it direct downloads all of the components, incase one of the cached files is messed up
I am not sure sadly, though I might know who can help
yea I'm out of ideas on that one...except fixing css manually and saving it somewhere or using plugins which adds custom css to specific sites
I am very new to all of this venv stuff
Im just gonna do a whole new reinstall
@dense tapir Are you here by chance? If so, do you know how to delete the cached files used to make a venv?
but still need to know cache
Let me see if I can find that
ok, ok
so it looks like you can add --no-cache-dir
DO NOT QUOTE ME
that should stop it from checking for previously cached installations
why do I like weird images sometimes more than actually decent ones
I still want to delete them though cause they'd be using up a shitload of space
Its a common thing. I tend to like some AI wonkiness haha
alright, let me see if I can find info
no problem. Love learning new things, and I hope I can help too haha
@pastel mangoDo you know what version of pip you are on?
pip 23.0.1 from G:\Programs\Python\lib\site-packages\pip (python 3.10)
ok, so it looks like this command should work
pip cache purge
it should get rid of all cached/wheel files
I love it how people have to chime in with whether or not someone else's project is a good idea. When you take on a project, you can decide on the viability of it
have a look for yourself if you would like
right
ERROR: pip cache commands can not function since cache is disabled. apparently
hmmm
I am in uncharted territory sadly
I know 0 about pip or python. I am just a very fast learner and I am good with google haha
so when you make a new VENV, does it say its using cached files?
I dont know
gonna run an update on my gpu drivers first, before I do anything else
safer to do it first
%LocalAppData%\pip\Cache
- pip cache location on windows
yeah
doing an install with studio drivers for nvidia for more vram optimization
meanwhile I will make a comedy conversation by people who don't exist, with people who don't exist
that pip purge command worked for me 👍
had 7gb, now 2kb
relatable

weird how it doesnt have auto cleanup
its actually quite nice for some things
oh, that's nice
ah got most down now
do i need to download anything else to be able to use this?
The UI is normal again @smoky oak
well there wasnt really a mistake, it just sorta worked when I went to bed, then it didnt
it might have had to do with an extension or a python update
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Install-and-Run-on-NVidia-GPUs Here's the install instructions
oh dear... looks complicated for a non-tech person.. and ill probably need a good hardware
these kind of things can be run on laptops? or not powerful enough. im not going to do too crazy things
damn I lost what extensions I was using though
some toolkit, some additional network, and the rest I think I got handled
actually got additional now too
ok got model toolkit too, yay
Depends on what GPU you have.
👍
I'm sorry, I had to share
My mom and I (mostly her) spent a ton of time working on her spice collection, and I wanted to show the fruits of our effort
These are all of the ones that can fit inside the drawer. The rest are stashed in our garage pantry
I'm just glad that she uses them all. My mom cooks authentic Chinese, Japanese, Korean, Vietnamese, Cajun, southern soul, Indian, Mexican, Italian, you name it, she cooks it lol
I find your lack of cloves disturbing.
She goes to every extreme length she can to go to the most authentic places, and talk to people of native culture in order to get recipe straight from the source
Not all of the spices are in yet, we had to throw out our clothes because the original bottle broke in moving
All of the empty ones ended up shattering and we had to get replacements
Fair enough. Otherwise quite cool.
Though I would suggest grating/grinding nutmeg fresh. It is easy to do and has a much better flavor than pre-ground.
All the extra ones she doesn't normally use are either here or in the garage pantry lol
She typically does that as well, most of our herbs are freshly grown as well
She just has all the stuff for all the reasons lol
She takes pride in her collection haha. Most people try to give her shit for having tasteless food just because she has light skin lmao
Fresh herbs are the best. I need to set up an indoor garden for them.
She is actually out buying a whole new harvest of plants for the season. Now that we are in our new house we are going to be building our own corrugated metal freestanding planters with a water irrigation system in our backyard. We live in the middle of the desert, but this section of the desert is shaded quite well by a large mountain behind us
I also do aquarium keeping and aquarium plant growth, so all of my residual fast-growing plants can be decomposed into fertilizer for her plants. It really is quite self-sufficient
It would be pretty crazy to set up, but you should look at aquaponics. I saw a home setup a few months ago, and it was quite the thing. Especially if you like eating fish.
I love that herbs are easy to grow, though. Managed to keep some basil in just an apartment window and was having to figure out recipes to use it because it grew so fast.
BRO NO FUCKING WAAAAAAAAAAAAAAAAAY
@fiery lichen I just came up with the most insane method of producing high detailed images with the ultimate upscaler
with minimal photoshop involved. this is insane
check out the examples. sorry that I had to censor them lamo. Left is before, right is after.
Check out the before and check out the after in open browser. Night and day. No artifacts. Not just the face look at the right hand
Video TuT coming soon? 😏
my aquariums are all nano tanks (sub 20 gallons) so there is no fish eating for us, and probably not enough nutrients to grow plants. But my floating plants grow like weeds (I may or may not have 11 species of floating plants that I shouldn't lol)
And by like weeds, I mean more than double in volume every 5 days. So I can grow an insane amount and use them as compost to fertilize our garden. thats the goal at least
I'm officially addicted to this shit.
Good. Glad you are experimenting for yourself
Glad you decided to give it a shot after all
I think you owe @smoky oak a thank you
good work. a lil write up would be good, vid would be great.
Yeh I need to play with it even more but yes I'm glad you introduced me to this tool cuz I initially thought it was shit
If I can make it work even better than I did with that example ima drop a tut
it just takes a lot of gpu power and sadly the extension is glitched, it doesnt generate batches
I am assuming you did multiple upscales on the same image back to back?
As in you are batch upscaling, or what? I may be able to fix it for you @stark vine
No I just need several different upscales of the same image
If you can fix the code that would be sweet
otherwise I have a workaround. I might also dig in there to see if its an easy fix
I might even get greedy and find a way to use this with canny I aint even bullshittin dawg
nah it cant work with canny cuz it doesnt split the mask
if it did tho, it would work with higher denoising
cuz it would stop it from generating garbage
it may seems like it works at lower denoising because it doesnt change much
@stark vineCanny can work with it, I can prove it right now
you just have to do it differently
you have to save the canny image from the annator and then feed it back in
You can see I saved this canny map
and you can see the hair in the water
it does work. Only issue is they need to be the same base res I believe, cause you can see mine is not lined up. Which shouldn't be too hard for the same image
gonna see if I can get a better result with it on the base image itself
aha, I see
so its hard cause the canny map res needs to be the same res as the final output
which I might be able to do, just a sec
ok, I was able to get canny to work is what I am saying
the upscaler works with the image in chunks
I used the canny of one image while generating a different one
yeah but that doesnt make sense
yes, and it chops up the canny map as well
lol let me try it
but thats how it works
im sure it doesnt
look above
or well rather, no it doesn't
sorry, I mispoke
you need the canny map to be the same res as the final output, then it lines up
let me try
ok, I was able to get it working and properly lined up with a high res output. I am trying it on a higher scale now
an actually good image, i mean
how do you get the canny map to be the upscaled resolution
you have to do math and trick the canny exporter
just regular upscale it as an image?
you figure out the res of the image you are upscaling it, and then multiply the smallest value by the amount you are upscaling
then you take that result and make that the res of the annotator
then you preview it, save it, and feed it in with no pre process
I am about to do it
ok, it seems like it may be detrimental to the process
I am about to see if thats the case
@stark vineCanny makes the results wayyy worse, OMG
it does work, just not beneficially
Low res base
normal 2x upscale
2x upscale with canny
Here is the canny map
every other setting is 100% identical
same seed, everything
so canny does clearly work here, but it is not beneficial to the process
the issue is with the current implementation, you have to do a canny process on an image that is 4x lower res than the canny map itself, so I am sure that makes a lot of noise
let me see what the results are
you just can't let it process the canny live
I... just showed them lmao
they are above
it makes it look worse
but that doesnt make sense, why would it
all the canny does is tell it not to color outside the lines l0l
damn still getting mad artifacts
7 min till the final thing is complete tho
I'll be the judge to see if there's an improvement
the reason its not working is its having to draw a canny map at 4x the res of the actual image its looking at
so its probably very noisy (I could also have guidance too high, we are in uncharted waters after all)
I have used depth with ultimate upscale as well
yeah I can already tell from the previews its way worse
well sadly I was right lol. sort of. I mean it can process it it just doesnt do what we expect it to do
Yeah, we were both right
it does "work" but it doesn't benefit
so it works, but it doesn't... work lol
its at 1.5 weight in mine
I am proving that it does "work" by putting one of the other canny maps ontop
it just still doesnt do what its supposed to
is the map supposed to be black lines with white bg or viceversa
yes
ok, its very hard to see here, but there is a different canny map ontop of this image
this is the canny on top
you can see the hair waves on her left shoulder
wait thats black bg white lines
here, so it is clearly applying, but its now working how we wanted
I gotta try this at 2 weight, I just got greedy with a 4x upscale lolol
ugh I wanna see the final result already
oh, did you set the annotator res to 4x the base image res? if not, its not gonna line up
yeh I upscaled the canny map and set the rez to the same proportion
but Im still confused why it shows black bg white lines
oh, you upscaled it. Curious to see how that works
on the preview yet the output its inverse
oh waiitttt
and I have no way of knowing if its reading it correctly cuz it looks like ass as it usually does
let me try it with inverted colors to see if it works
yeh I have no idea which colors its supposed to be
trying inverted, weight 2x
yaaas
I am seeing both online, so IDK
only one way to find out lol
fuck around and find out time!
My favorite heh
canny is white lines, scribble is black lines
ok, so canny IS supposed to be black with white lines?
well fuck it, I am trying it backwards lol
I already have one
we also need to try low guidance. It could be too strong
no the issue is that its not guiding it well enough
and that could be messing things up. Maybe it just needs a subtle push
if it did we wouldnt have bs artifacts
bro, inverted worked way better
I gotta try that
its still not the bestest, but its a lottt better
I think the weight is too high now
not inverted
inverted
she has teeth again, and her eyes are less fucked
alright, trying some weight variations
.5 first
oh man, if thats the case lmao
that could be the case lmao
however, the disabled version looks way different (identical settings) than the inverted one
this is the disabled one
nah its still ass for me
hmmmmmmmmmmm
let me try with my method but I aint optimistic cuz right off the bat I saw artifacts
ok, this is off vs 0.5
kinda sorta, I think
off vs 0.5 on
actually, they look really close
only difference seems to be in the hair
ok this is strange.
I can see the white lines on the output
looks like absolute dogshite of course, but its weird that those lines are there
let me try lower weight actually
nah still a bunch of horseshit artifacts. what a disappointment.
I guess if you think about it, even if we can get the proper map it still wouldn't work, let's say it's just working on a chunk of hair
what in the heeeeeeeeeeell happened here
the prompt still says "photo of xyz high definition face"
mine look worse, but not that bad, jeez
its gonna try to apply that prompt to the hair alone whether canny is there or not which is gonna look like bullshit lol
nah the solution is native 1024 model trolol
or my method 😏
I'll try to do a good prompt tomorrow with my method, if u can fix the code that would be pretty neato mosquito
im too lazy to try to fix it myself cuz i suspect its not an easy fix
when you upscale dont use any descriptive words for the subject. Use hi-res, masterpiece, etc
well im trying to replicate the exact same image
So it looks as similar to the original as possible
with more detail of course
weird, I tested that for a while and found it yielded way worse results
I do suppose I have a better setup now. I could try it
you'll waste your time
if the image doesnt match then piecing it together wont work either
I have time to waste in the pursuit of better SD stuff. How do you think I found all the tips I did to get SD upscale good enough to do a 6x upscale with no detail loss? lol
I am testing!
insanely better results with lower weight on canny
actually hold on
let me do without it next
that seems to be because its just paying less attention to it 😅
it seems to do way better when there is no canny, from what I can see lol
gen your image, regular dimensions. Use hi-res fix to make it a bit larger within vram you have. This fixes most problems. Then ultimate upscale.
