#🍥|anime
1 messages · Page 148 of 1
thanks
anyone know a model that would produce results like this?
With those nice colors and depth of field?
I’m thinking models from ctd might be able to achieve that
what is that haha 😅
If you want, you can try some of her models
Closertodeath
I think if you search that with huggingface in google, you can probably find her profile
ill try
But this also probably requires a lot of Lora
what's lora
Lora is a concept that can be loaded into other models to “teach” them
Ok, color me impressed with Bing's DALL-E 3 integration:
ive never seen that interface in my life
how would i go about doing that
LoRA stands for Low-Rank Adaptation. It allows you to use low-rank adaptation technology to quickly fine-tune diffusion models. To put it in simple terms, the LoRA training model makes it easier to train Stable Diffusion on different concepts, such as characters or a specific style. These trained models then can be exported and used by others in their own generations.
Stable Diffusion models have been gaining popularity in the field of machine learning for their ability to generate high-quality images and text. However, one major drawback of these models is their large file size, making it difficult for users to maintain a collection on their personal computers. This is where LoRA comes in as a training technique to fine-tune Stable Diffusion models while maintaining manageable file sizes.
LoRA models are small Stable Diffusion models that apply smaller changes to standard checkpoint models, resulting in a reduced file size of 2-500 MBs, much smaller than checkpoint files. LoRA offers a good trade-off between file size and training power, making them an attractive solution for users who have an extensive collection of models.
Basically if you want to generate a character, but the model you used don’t know the character, then you can download a Lora someone made of that character and use it
Thank you chatgpt
😄
i just downloaded automatic111
You can find Lora at civitai
Beware, there’s a lot of nsfw stuff there
LoRAs are network extensions for Stable Diffusion. They're a type of model that extends into a certain concept or idea that other people have trained. They help your existing model reach into more difficult to gen concepts, or styles.
Simply use <Lora:name of the lora you added to the Lora folder>
Okay, since everyone is around now to help, imma go back to bed
gnight Smol

Hello, and Goodbye/G'night Smol!
Have a nice rest
Hello and goodnight everyone! 
how do i download these models https://huggingface.co/closertodeath
Take it slowly, it’s not gonna go away
You are at her profile page, go into one of the ctdmixs
And there should be a lot of files
File ending with safetensors/ckpt are most likely model
Unless it’s really small, like less than 1gb, then it’s likely Lora
You almost always wanna use file with safetensor tho
You should understand what you're downloading first. I'd start with Civitai.com, as it's easier to download and see what you're getting. I'd just make sure your browsing mode at the top right corner of the screen is on safe, to avoid most of the... more unwholesome elements of the site.
except there's like 20 safetensors
Safetensor file will be “safe” from executable malware
Yup, so you can pick one to try it out
Huggingface really isn't the best place to start when you're still trying to figure out how everything works.
That’s true
Maybe save ctd profile as bookmark for now, then go to civitai and find models
Find something that you kinda like to test it out and get used to it
Yep. Start slow like that.
Before using huggingface and stuff
Experiment. See how things work with your existing workflow and when you change things. Always keep the same seed number when testing, so you can see the difference your changes are making.
@untold glacier if you are gonna stick around, teach about the vae as well
No idea tbh, I think most of her stuff are good, just need to get used to the prompting style
You need a VAE, like Smol was saying. Lemme pull one up.
This is already pretty good
where do i put the downloaded file?
In your VAE folder. Within your SD installation folder > Models.
for reference this is what im used to..
Then you'll need to select it within Auto's interface.
Well, you will be able to achieve something similar soon
hopefully lol
you guys seem so knowledgable, im curious to see what your generations look like
Abe...
In technical terms, there’s actually vae for all models. VAE is needed to convert those noise to pixel, or something like that, not sure the correct terms for that
You got this ❤️
If you post multiple images, can you please post them in a single post? We're trying to talk at the same time, lol.
Having knowledge doesn’t mean good generation 
They can vary, as many of us like to change, try, and experiment with new stuff all the time.
anyone know why im getting a grey thing now 
tf do you mean, your gens are great
What have you changed since last gen?
Did you interrupt it? 
i just added the vae in the folder and restarted it
nope.. i ran it twice to see if it was a bug or smt
This takes me back... I seem to remember this meant something early in my days of genning. Lemme try and remember.
I mean, it’s not bad but can be better if I’m not lazy
appreciate it tho 
do this
What’s your gpu?
3060
Then you don’t need medvram
whats that
You can just use xformers
Yeah, I agree with the commandlines above, aside from the --medvram and the --opt-split-attention.
^
--opt-split-attention should be on by default (not require a command line) for quite a few versions now.
thanks for helping me btw guys
Really needed is just xformers, other optimisation methods can be toggled on and off in settings when webui is up
Except for xformers
Somehow my eyes glazed over the no half vae. Lol
DON'T put no half vae.
so the same thing happened...
It's a private one, but I'm using some LoRAs tho
Yeah, there's a reason for it. Lemme try and find what I remember.
okk
I don’t really remember this issue so 
Ok, bad news. You're going to have to delete your venv folder and have it reinstall the guts by starting it again.
https://civitai.com/models/24584?modelVersionId=29405 (1.0)
https://tusiart.com/models/605586978534138789

great! i love that so much


Hahahaha... this is one time that I can feel the sarcasm through the internet.
wait, u need to register now?
Oh well... You need a chinese number now... so...

Guess I can't use tusiart anymore
Does it need to be a Chinese one?
Yes

It fucking sucks
Imma try with my aMeRiCaN one.
so many good shit in there locked behind a chinese number
?
ITS STILL GREY
Well... I'm waiting for a text... but I probably won't get one.
I DID THAT FOR NOTHING
It's reinstalled already?
yeah

I even tried using those public chinese numbers to see if I get lucky and nothing

Well, I'm not finding anything on the web. My next suggestion would be to start a new installation with a different folder, re-getting everything how you did before, whether redownloading or pulling from github.
It obviously worked for you at the start, so something changed after that.
https://tusiart.com/models/609957850803901222 I really wanted to download this 
使画面和构图偏向电影风格,且能在一定程度上增加画面细节和增强光影,作用于超宽画幅的风景图有很好的效果使用时请注意:权重推荐0.75左右可以尝试配合一些光影增强模型使用,会有更好的效果不推荐将权重设置到1模型推荐使用WhitepaperMIXC站链接:https://civitai.com/models/10140...
Isn't that this...? https://civitai.com/models/101402/movie-style-filter

It's even in the discord description of the link, lol.
all of them are on civit i believe ..not under the same name
Yeah I think the other one was too.
Not reading moment
That LoRA does look nice though.
I hope it doesn't look like shit when I try it
Hmm, judging from the preview images it seems a bit overtrained. But it might still be useful at lower weights.

Lol
Suspicions confirmed.
Probably need it at 0.6 or less to mitigate the overtraining. 
@untold glacier turns out it was a problem with the checkpoint
Fantastic news.
I recommend Aurora, if you know booru tags. Or DarkAlfa if you want a more natural language prompting.
let's take the gens one step further. do you know how to remove the grain there
Are you using the VAE?
yes
maybe he didn't select the vae?
Pretty sure that's the right VAE. Are you sure you're using it??
oh no it was js a bad model
And you for sure selected the VAE to use within the interface itself?
you had to do that?
Lol
yes
Yep
I customized mine, so I can't remember. @unreal ridge ? Can you point it out?
settings>VAE
same
mine is at the top
Darn lol
can't find on settings
Yeah, it's not there
It's on settings... somewhere
oh ok
Huh. So it just plain removes it from the settings if you put it at the top of the interface. 
Nice album cover.
im getting better and better generations
U using hires?
started from this
whats that
it downscales the generation and then upscales it again for better quality
how can i check if that's enabled
set steps higher than 0
I use it at 15 steps/0.45~0.52 denoising
Also, try starting with this in your negative prompt area. Any further negative prompting you want try adding it in the area before the break:
BREAK
(disfigured, unclear, indistinct:1.3), (bad anatomy:1.3), misplaced hand, misplaced foot, (text:1.3), (signature:1.3), (title:1.3)```
I don't remember where I downloaded my upscaler
alr let me try thta
Additionally (sorry, I know this is all a lot to start with) download this and put it in your Models > ESRGAN folder: https://openmodeldb.info/models/4x-Fatal-Anime
i always download directly in the ESRGAN folder 😄
how do i fix this
lower resolution used
i used 150 sampling steps 🙂
U don't need to go that high
i wanna see the max potential
Most of the samplers converge around 30.
Steps doesn't have anything to do with resolution, also, I don't recommend higher than 50 steps for anything, lol.
32 steps is max for most models 😄
I'd say 50, but it really depends.
A good starting sampler is DPM++ 2M Karras at around 20-50 steps.
Yeah any of the DPM samplers actually converge a bit earlier if I remember right.
25 steps is a good number for a DPM sampler.
Also lower your res or use CPU offloading.
If you still run into memory issues, then at that point I would put medvram in the commandline arguments, and then restart SD.
As long as you're not trying to hires upscale at absurd levels.
using vectorscope extension
I think he has the idea by now, lol.

Local AI image generation ..lots of experimentation and headaches .. but once you get it it just works
this still happens at 50 steps
Are you using hires fix?
yes
At what scale?
2x
Then, at that point I'd say: #🍥|anime message
weird solution but try enabling or disabling if ON gpu acceleration on your browser
Oh, and ensure you don't have anything else taking up your GPU memory. Firefox, other things, etc.
what's sd
Stable Diffusion
how od i check that
switch to Opera GX 😉 not having any issues with that on my gtx 1070
I'm ComfyUI Pilled
ive not used auto1111 in two months now
My brain hurts whenever I look at comfyUI
I liked the idea of Fooocus refiner technique but you can't use any LoRAs or anything else with it anyways.
Open Task Manager. Go to Details tab. Right click in an empty area where the sections are and click "Select Columns", then add the Dedicated GPU memory and Shared GPU memory columns.
there's no select collumns for me (im on win 11 btw)
its done
Erm, firstly close the Refiner tab. Then if you want to use latent upscaling, then use Latent (nearest-exact).
That's where it stretches the latent image (the image that it has in mind before generating it out) to fit the upscale. I'd recommend usually using an upscaling model like I linked earlier though.
I also recommend setting it to 0.4 or 0.5 denoise.
this is the progress so far:
Wow. Getting much better!
You have to make sure your task manager is showing you everything. I believe it's in some sort of simple mode by default.
Something's grabbing her hair though 
Man, can't think of any prompts
M I K U
grab/modify anything from https://docs.qq.com/doc/DWHl3am5Zb05QbGVs
These prompts are ridiculous and artless.
i got this again
i did that already
what settings are you using?
did you add --medvram to the webui-user.bat commandline args yet (and restarted the webui)
Moment
As Kuromi said, did you restart it?
yes
Well... then the only thing left to do is to get the Tiled Diffusion/VAE extension and set the settings lower.
Go to Extensions Tab > Available > and look for Multidiffusion Upscaler.
Fooocus might be a bit better to use (has better optimizations for 4GB and 6GB VRAM cards
Could be
I've not used it.
needs more passes 🙂
i have 16gb of ram though
I love Auto1111's interface and workflow the best though.
VRAM is what your GPU uses, RAM is seperate
how do i increase that then
Goofy legs
Just download more VRAM.
HAHAH
I'm kidding. You'll need a new GPU.
But you could just follow what I said above, with the extension.
That will split the gen into tiles for the upscale.
🙂
i have to restart sd after every gen to get rid of that
how do i clear the memory
Just so you know, this is what it looked like for me to gen something with similar settings to yours. It usually always takes 6gb to gen something at that size.
On Auto's build of SD anyway.
As far as I know, you clear the memory by restarting SD. 
I haven't heard of any other way.
https://github.com/IgorMundstein/WinMemoryCleaner/ this might help ..but not sure as it is not for VRAM
Uhhhh. Yeah I wouldn't use that.
If you want to move to something like ComfyUI or others, I won't be of much help to you. All of these are different builds for SD.
It's a lot more complicated to figure out, but here's ComfyUI: https://github.com/comfyanonymous/ComfyUI
That depends on the size of the gen.
13-14.
https://github.com/lllyasviel/Fooocus#download
one click install pretty much
I was looking at it at first and thinking, "Well that's not too bad..." before seeing that the "s" and "it" were switched around, lol.
@supple raptor You seen this? https://developer.nvidia.com/blog/unlock-faster-image-generation-in-stable-diffusion-web-ui-with-nvidia-tensorrt/

it is what is powering text to 3D scene experiments yea
4096 px frames in < 2ms on RTX 40xx and TensorRT cards
Oh damn, 640x640 is pretty good
i'm building engines for all resolutions i use xD
and because of that, this thing now exists

tasty
wtf did I just watch
xD
that's... why
WHY?!
what did humanity do to you, for you to punish us with... THAT
He is your creation. Do you not love him? 
i love all my children
with a shotgun to the face
ai based animation?
Pika Labs Ai image to video animation yea
this failed gen reminds me of system shock
at least she's covering up her uhh... private parts
kinda overcooked much
hot
or would that be cold in this case?
Lol
the good model finally downloaded
why is cat in Venezuela?
which one?
first one

well, i did specify post-apocalyptic... i suppose the model has some thoughts about what resembles it
what makes it venezuelan, actually?
Could not not understand why my upscaled pictures looked like someone washed them with soap, turned out custom VAE encoder kinda important sometimes X.x
it was just a joke cus the country isnt in a good place
would Chicago have been more accurate?
this feels like lyricsprompting
its not 😄
she made herself as an android and now they've fallen in love (narcissism)
better version ❤️
K Loki
wildcarded the animals 😄
thats hot
worked pretty well on jolteon... slowpoke however...
She's certainly on a mission.
tactical panty-flashbang
bit overcooked ...
her trash bin is under attack
pokemon!
zoom out and credits 🙂
oh cool
Difference between using TensorRT Unet and not. It was a pain to set up but seems to work now. Its limitations are many though, and I can't seem to build a Unet model that's large enough to use for a Hires. fix.
With and without Unet.
dis spearrow
unet?
the tensorrt engine stuff?
You use it with Hires. fix?
yeah, built 2x single models
using a mmodel that supports both is incredibly slow
(in my case)
but building the 2048 model took me all 24gb of vram i have
The RAM I used balooned to 40gb with python and the thing just froze when I tried to build a unet with it.
yeah, it's SLOW and takes up huge amounts of resources
And I was just trying for a 1024x1536 hires model.
How slow is slow though? Did it freeze on you? Use your RAM in large amounts?
it felt like freezing up... i think it was occupied for like 30 minutes
same, someone mentioned to try tiled diffusion but it seemed like even if the tiles should fit the TRT, it didnt' work
Yep.
Sure, brag it up 
what's the non-TRT time?
was the key just building a 2048x2048 TRT?
yeah, built a 1024x1024 model and a similar 2048x2048 trt
does it still switch TRTs partway thru?
switching models
TRTs you mean?
You use static or dynamic models?
oh the static one
dynamic -> else i can't make an engine that supports 75 to 450 tokens
but keep it on single res only
WORTH.
thats over 2x speedup
@untold glacier this one suicided it's run on your pc?
I only tried a static one of those, not dynamic, and only with 150 tokens.
I'm going to make a few small sizes for aurora first before switching to trying to make a larger BreakdomainAnime one.
do it like this

keep optimal to 75, and only change max prompt -> this way it'll support 75-whatever tokens
if you're on HDD -> perhaps double resolution models, but if on SSD -> try making a model for every size, i feel that's WAY faster in inference
(for sdxl)
Most definitely an SSD.
Yeah, it almost seems like it would make SDXL viable on auto's build.
i mean, 40 seconds for a 50 steps restart, 15 steps highres?
It's too bad I can't queue up these model builds, heh.
What GPU you running?

Lol
Maybe just a little
3080's have less Tensor cores to use, so I'll see less benefit from it. Still, it's quicker.
my pc has never had so many upgrades since sd/llm bullshit came around

this is 2048?
that's fro 1024x1536 only
but that makes sure you can run short and long prompts
so 512x768 for highresfix?
if you want to make it support both, you'd have to change it up a bit
both what?
both resolution
yeah
sd1.5 might benefit from both in the same engine
you'd think the tiles might fit into a smaller TRT
sd1.5 is fster, so loading a new engine wont'win out
I found the switching time of TRTs (even on SSD) to negate the perf gains
but for sdxl, a specific engine per engine feels a lot faster
what's the switching time? mine feels nearly instant

i mean... the full 2048 image is done in 40 seconds, including the engine switch
solution -> no more loras
Imagine making a model for all 2k+ LoRAs at multiple resolutions.
yeah... for lora abusers this tech is just totally useless
not just useless, garbage tier even
makes inspecting songs for songmaster material a lot easier tho
Why the hate 
prompting masterrace!!!!
moving all lora's,models,vae's to stability matrix so i can use all of it in Fooocus/comfyUI and Auto1111 without duplicate all over my drive
oh wait... that's literally just me 😢
symlinking everythign?
saw on reddit sd.next had some updates... anyone tested it?
hmmm... this feels like a pretty huge fail for songmaster... only a single interesting result
Aside from using LoRAs though, it's always fun to push the boundaries of what a model is capable of. I've done that for Aurora and DarkAlfa. Composable diffusion is an underrated force.
i think regional prompter supports lora's too
Also, why put the optimal tokens at 75 and not somewhere in-between?
because it changes the minimum tokens with it -> that's what it did after i tested it
Ah
and if your engine generation takes 30 minutes, it's a very costly bug to have
and most prompts are 75 or less tokens
ғᴏʀ ʏᴏᴜ
Most of mine are probably around 150 or less these days, since I use BREAK for everything.
they should be for you too -> unless you're staying close to what you already did
I still don't get BREAK
dude... i BREATHE the BREAK
we all need a BREAK sometime
brought to you by kit-kat
That increases the tokens though right? It's padding after each break to the next token limit.
yes, what break actually does is pad the current group to 75 tokens
so that prompt automatically is 225 tokens
how does this help me tho
seperate style from subject and composition
Oops. Sorry, got talking with Eface lol. Lemme get you what I said yesterday.
for me -> a HUGE thing. for anime model users, a lot less (i'm using a generic non-anime model)
tokens have stronger influence the closer to the beginning of a block of 75 tokens they are and their effect cascades down so token 1 will have effect on something like tokens 2-20, then 2 will effect 3-21, etc. They have minimal effect on each other accross separate blocks of tokens so token 1 will have little effect on tokens 76-150.
the word(token) blue on token position 1 in group 3 would like have a word with you... -> this is why BREAK is a good thing (hint: the image was... BLUE)
So the most important word should be the first out of 75 , gotcha
this is a good songmaster prompt!
Yep, the first 15 tokens or so are where all of the most important words should be and try to separate differently colored things throughout the block, usually with the ones that are inherently the strongest towards the end.
https://mojim.com/usy255151x1x1.htm try for this song 🙂
is generating ^^
You see that TensorRT Nvidia SD Unet model stuff yet?
It's a pain, but it's crazy.
all depending on model too -> if a model responds very strong to specific word, it's generally not smart putting them in the first token position, best fixed by delaying their introduction until later [tokens:0.2]
Yeah, I gave up on setting it up because it was just giving errors and I'm not going to store and switch between 10M different ONNX models for the different resolutions, models, and loras that I want to use.
wished windows file copy/move was as fast as most game installers ... why does a 140 GB game install faster than copying a 30 GB folder on the same drive
(it's not nipples)
Other people had the errors too, it was a not too difficult fix according to here: https://github.com/NVIDIA/Stable-Diffusion-WebUI-TensorRT/issues/12#issuecomment-1768564187
*Fixed my link
After I had the extension installed, I just cmd lined:
pip uninstall tensorrt
pip cache purge
pip install --pre --extra-index-url https://pypi.nvidia.com tensorrt==9.0.1.post11.dev4
pip uninstall -y nvidia-cudnn-cu11```
setting up tensorrt was a pain for real
I also have only 16GB RAM so even making the size that I would want would be a huge pain.
the extesion is pretty fuckn buggy to set up
this is the gens i got with those lyrics 😄
I usually gen at 544x960 with 2x hires fix so I'd need a 1088x1920 engine to run my hires fix.
Yeah, it froze up on me for a simple 1024x1536 hires fix model build earlier. But maybe I didn't let it run for long enough.
gonna remove the backseat part tho -> it's horrible for ALL generation that have it
THERE IS TWO
no wonder i got it all the time
i'm using my verser script
Tangled up in knots, feeling lost in this uncertainty. Frustrated, caught in twists of this life complexity
| Suffocating in the grip of my shattered dreams. Chasing rainbows, drowning in shattered moonbeams Tears streaming down, I'm searching for reasons why
| I'm breaking free from these chains. No matter how hard it gets I gotta keep on moving You can't take my life away I just wanna be free Please, set me free Let me be me
| Barely survive in this ocean of inconsistency. I'm the emotional storm, longing for simplicity Find more lyrics at
| Thrown myself in the backseat since forever. I'm drowning in the ocean of my own frustration In the darkness, I lost my way Those laughs keep echoing
| Set me free
| Tears streaming down, I'm searching for reasons why. No one to blame but me, myself and I
}```
Hey what starting res do you use for 16x9 with 1.x SD? I've alternated between a few, but since I'm trying to build a Unet model I need to narrow it down to one.
I'm leaning towards 800x448
ah
fixed lyrics 😮 they go into songmaster!
Ah, it looks like there are still stricter resolution limitations for TensorRT.
yep, needs multiples of 64 in both directions
Just wait till we get consumer 64 GB VRAM TensorRT GPU's 😄
yes. that consumer part was important
i mean..
544x960/1088x1920 is going to be the best option.
put the gpu outside because it will act like a house heater on its own
-> my house heating strategy for coming 'winter'
that's 1 small blessing
this looks like a fun prompt!
If it's a multiple of 64, it goes from 512 to 576. I'd have to find another set. 
oh right. I have the dumb 
No worries, I did the same thing lol.
512px tiling 🙂
that's probably why I got an error when I set it up the first time. setup went smoothly and then I tried to make that engine.
am i just that lazy clamping to multiples of 256 in ui.json?
That bottom right pic on the left side: in D&D when you cast Melf's Minute Meteors.
Ah, could be.
then after it broke my auto-complete extension, I uninstalled it and had to rebuild venv and I haven't been able to get it to install again since.
second run on this prompt. damn, this one can run solo!
I wish the clipboard handling in A1111 was more reliable
maybe it's cause I'm using webp that I can't copy-paste from txt2img to img2img
and you can't drag and drop from 1 website into A1111
this is whatever is on your clipboard indeed
copying a webp is not the same as copying a png or jpg
I imagine they just didn't implement a paste handler for webp?
because you can drag a webp from FS just fine
Alright. Wish me luck, I'm trying for a 576x1024 build.
you can though, at least with PNG. I do that all the time to grab PNG info from discord images.
good luck! i hope it succeeds
hm can you drag one of my imgs?
never got that to work
yep
the lower left tho... that one vibes with me hard
hm
I can't in firefox
are you dragging from discord app or discord website? I'm on website
same for me, on firefox
Gradio hates browsers that aren't Chrome
click image, open in browser, drag from that browser into webui.
🥳 SONGMASTER is over 500 lines!
i had this too -> it's going
Somehow, it's not really using my GPU like it should though...
it deferred some tstuff to ram
I should give TRT another chance
it's going to be slow from now on -> but it will keep running
because I can use lora in txt2img, then TRT + raw model in img2img to upscale it
you should... the speed is just insane... it just comes with so many restrictions 😦
I liked the speed but I was put off by lora and TRT switching
I think TRT engine building is handled on CPU because of the very high RAM requirements. Unrealistic for most consumers to use it if it had to happen in VRAM.
but my current upscaling for these recent images has been to img2img without lora so they don't blur the image as much
so if I can upscale those faster, win
it tries some of the things in vram, but falls back to ram if you run out mildly
before
this is a songmaster fail. DENIED
Well it's done. That was a lot quicker than I though it would be, even if it's only 576x1080.
Uh
What the heck did I just say? Lol 1024
got a nice upscale engine now?
I think it's firefox that hates clipboard
It was a starting resolution. 
good fucking luck on the final resolution
It's a minimum for 16x9.
if that one already crapped out your vram
Well... at least I have 64gb of RAM.
I've been able to run a 65b LLM, albeit slowly, heh.
i feel this is ajust a tiny bit unrealsitic, i mean, those are 40kg of pauldrons...
I need to get back into local llms
totally unrealistic
Gradio has lots of problems in non-Chrome browsers. I've even seen plenty of problems on chromium browsers.
haha
Haven't rly tried myself but I know that firefox clipboard doesn't include image data
wym? I regularly paste images from clipboard into discord
and from firefox -> r-click copy -> back into discord
I used hours in work project to get paste into firefox working but file data didn't exists anywhere
-> copies the actual png/jpg to your clipboard
yeah sometimes converts file types on me
lower left ❤️
full prompt for the variations, or just the lower left one? XD
Left bot
BREAK Where we go, what we find, what we will leave behind is. How we choose to define the journey within our mind Where we go, what we find, what we will leave behind is How we choose to define the journey within our mind```
And now it goes to never used prompts list
it certainly hit my songmmaster list ^^
songmaster goes very well with prompt alternation
[your actual prompt | songmaster bullshit]
gives some very nice flavour
and, i get to listen to new songs too!
songmaster fail, nice crow tho
one of these prompt parts might have a very strong effect...
the question is ... WHY is she blushing that hard...
she just realized I was following her
https://civitai.com/models/110244?modelVersionId=118849
It'll look pretty different on most other models. Probably my favorite LoRA.
tbh, I haven't bothered to set up adetailer in comfy so I'm not sure what it's supposed to look like.
this is fine,right
We love the hair for both the women. 😊
Photographed mid-blink
So how has the SD scene progressed while I was gone for a few months?
aside from new models and loras, there's a couple of new extensions that claim to improve quality (FreeU mainly), and TensorRT came out which can give a big speed boost if you don't mind dealing with a lot of downsides.
what downsides?
You have to build an "engine" which is an ONNX model that's only compatible with pre-specified resolutions and prompt lengths so you probably need several for each model, lora, resolution, and prompt length combination you want to use and realistically 2 for each of those combinations if you want to use Hires Fix. If you want to use LoRA you have to build them into the model at the weight you want to use them. The possible resolutions are much more restrictive than using Pytorch now days. It takes a lot of ram to build the "engines". Each "engine" is almost 2GB for 1.5 based models. It doesn't work with a lot of really useful extensions at all. And it's a pain to install.
But hey, you can get almost double the generation speed on a 4090 (increases are less the slower your card is)
aka ... first release SD hassles for next gen cards 😄
ah seems like a lot more hassle than a passing amateur like myself could deal with
thanks for the information
This extension is kinda neat though. https://github.com/ljleb/sd-webui-freeu.git
no idea, how to use it 😄
me neither I just turned it on and used some settings someone else shared for a while 
I'm rather curious about stable audio and any AI speech modulators/t2speech that are open source
though I guess having FL Studio and a midi keyboard of good quality I could just make my own sounds
and a ton of plugins of virtual synths
I just need to get autotune or whatever is the best one now
synplant or something is getting traction recently
50% Off Music Production Course With Code LSE - https://quickstartmusic.com
Check out Synplant2 Here: https://soniccharge.com/synplant
does that work on voices?
I mean I want to not only generate music, but I want to generate speech, or experiment with generating singing
so I need something that does speech
it is a more advanced Vocaloid synth so .. i guess it would do speech (singing)
as far as i understand it samples the audio input and breaks it all the way down to the tones that is needed to make that sound
free trial / $149
cause I was asking about open source
something that'd be free
https://www.bespokesynth.com maybe?
I think you're misunderstanding what I'm asking for here. I'm asking for human voice speech generation
in open source
cause the paid options are real expensive
not sure any are released publicly yet... meta was planning to i believe
hmmm...I thought there were at least one or two that I don't remember the name of. That deal in both text2speech and speech2speech
is kinda meh though
like RVC?
I dunno
just search rvc on youtube and you will see it can also do TTS
but can it do speech2speech?
rvc-v2 yea is pretty good
yea thats where its better
this webui does tts with rvc https://github.com/litagin02/rvc-tts-webui
can it train voices too somewhere?
yes it can train
nice
thanks
https://github.com/WadRex/RVCompact looking at this one now
Im guessing RVC V2 refers to the model
with the interface still being the interface
yes
https://github.com/Mangio621/Mangio-RVC-Fork this might be better
Mangio is decent yea
i use this mostly but my pc can barely handle it in realtime: https://github.com/w-okada/voice-changer
a 20xx+ graphics card definitely recommended
does it exist in English?
yea.is english too
cool, thanks again
from fire to water
the miku hatsune
g' night all
@untold glacier hello 
evenin'

gmorning

2 minutes render and have freeu, or 1 minute render, and not have freeu
i'm very inclined to say -> freeu quality upgrade way better than tensorrt speed
The first image is better. Is that with FreeU?
yeah
I just haven't seen a marked improvement with Anime models.
... it's not 😛
lets make it a bit more fair comparison, to compensate for the tensorrt speed, i'll halve the steps freeu gets
if it's still better quality, i'm sold
Let me rephrase. I haven't seen an improvement anything like the images you show up there, with the anime models that I've used.
i suppose freeu stabilized my stupid prompts a lot 😮
Have you tried it with different models?
just downloaded it, so taking it for a spin

have only been using brightprotonuke for like a month now
serves me for all purposes
Guess I should try with DarkAlfa to see if that's any different. I've only tried it on Booru-based anime models thus far. It works better on Aurora than BreakdomainAnime, but it's not much and it's not better most of the time. Or even half the time. 
your milage may vary ^^
eface 
tensorrt (no freeu) 1:20 VS freeu low steps 1:24 VS freeu high steps 2:14
i feel freeu gets a LOT of the smaller details right
Hmm. The implement I'm using to try FreeU doesn't have step options.
That could be the difference.
nah, just sampling steps
to make up for tensorrt's speed, i just halved the amount of steps to get comparable speed
Interestingly, there's a large difference in FreeU vs not with my gens. It's not subtle changes.
Some of them almost look like completely different seeds.
yeah, had that happen a couple of times too
the interpretation just being vastly different, however, better adhering to prompt imho
For instance: No FreeU
FreeU
With DarkAlfa.
Certainly harder to tell if it's better or not when the entire image changes, lol.
same seeds?
Yep. No variation.
wow, some are completely different

non vs freeu also different outputs
I dunno. Her torso didn't turn into a head covering up half the screen lol. She's still in the same position, head turned left with her eyes closed. Backless dress, etc. 
yeah, i suppose fair
And actual info here: https://github.com/ChenyangSi/FreeU
sampler extension that seems to improve output quality
That's what they claim. But I haven't seen it with any of my gens yet 😛
i see it tho
I know what you're saying. I've been looking for it myself.
I haven't been using any 
goooooooood
ahhh ok
It's PART OF THE TESTING. 
but i love whacky output nonetheless ❤️
Pon YOOOOOO
Lemme ask ya, what's your settings? Maybe that's my problem.
Maybe I'm crazy... but I'm already seeing an improvement from the recent gens.
I'm probably crazy.
nope, it's certainly improvement
Well, there are aspects of that pic that I like more in the first image, but I get what you're saying. I'm just wondering if it's a model-specific thing; where it won't really improve anime models. But I'm still testing.
cd tuner at 0 for details
it missed a part of the prompt in the first image, flat color, so i feel it adheres a lot better
Ah, I getcha.
and the shadow seems a bit better thought out
and again, better resolved details on the grass
So far after the adjustments to get it (I think) closer to your settings I'm liking things more. It's difficult to know if the settings were changed correctly though, as my interface is different, heh.
i wonder why
cd tuner at 0.5
i'm using this one https://github.com/ljleb/sd-webui-freeu
hmm not much diff
fixed pole in background, better spacing on wall left, hair ornament less blurry, same for arm, actual crease in the skirt
not a lot, but all these small details get resolved better
Lol
now if only tensorrt could work with extension 😢
tries it with adetailer , hi-res fix and few other setting change, cd tuner 1
I dunno man. Look at the presentation, the cohesion of the hands, and the less-baked nature:
No FreeU.
FreeU
It could be that the prompt is bad, and so the FreeU version is getting closer to a bad prompt. I'll have to try a different one.
grinz evilly and hits a button
and a few more, changes prompt.. and waits
blinks
blinks
BLINKS
@untold glacier what are some good anime models
one that makes good, not weirdly shaped bodies 
MOOO
If you want Booru-tag prompting, Aurora. If you want more natural language prompting, DarkAlfa.
https://civitai.com/models/40199/aurora
https://civitai.com/models/64843?modelVersionId=74900
ree
I don't have those
I can't download them for this service 
a LOT of regional prompting and dumb stuff going on (as always with my prompts) freeu hard winning this one
I'm giving you 16 comparisons at a time, and you're giving me one back lol.
I just don't see it yet.
because that takes 8 minutes
Fair
my pc is burning non-stop
She is strong.


straight from the source
MOO
oh I had the seed set. .DOH!
i wanna eat that peach

it's called addiction
When you plan for things to get darker, it's a bit better to work with. I'm still not convinced though... I'll have to give it time and letting the community figure out what settings are best so I can compare then and with more models.
I'm heading off for the night. Goodnight peeps! 
sleep well
I SHOULD be in bed but... sigh
@untold glacier here's a grid for ya ❤️
basically, i feel coherence and small details are improved pretty much
seed 8675309
Yep. A definite improvement with nearly all.
Might be the method of prompting. I'm using mostly exclusively booru-tags with mine.
booru tags are very restrictive tho
Restrictive, but focused with the proper booru-prompting anime models like Aurora.
i can't work with them, i need my models to be free 😛
I just use RANDOM Things 😄
Hence darkalfa. And things actually looked better using it with that. With the brief test I did anyway.
using booru tags on darkalfa? 😮
Shhhhh
NOOOOOOOO WHAT HAVE THEY DONE TO YOU MY BABY
umm--> bedtime hoodie, Yuki Miku, HUGW bobble head, unreal engine render , 8k, 3.5D
DOH!
where is it i dont see it 
Yep. Definitely better following the prompt and fixing issues with DarkAlfa, at least with the small amount of additional testing here.
Ok. I really am going to bed now. G'night peeps.
yeah, fixed up a lot of small things, doesn't it?
gnight @untold glacier

why does it look like she is sneaking?
She doesn't want anyone to notice that she is a fox
hre ya go
I find it funny how detailed SD makes armpits
x3
heh
AI secretly abbreviates to armpit illustrator
Scary
types armpit in the prompt
👍
arms behind head, sleeveless
(giant,massive,gigantic,gargantuan armpit:4.5)
umm.. wht?
@native halo rate this moon
i love that massive moon 
Very important pose
what a friendly creature he even created a bonfire so we can rest🤠
👍
beautiful round perfect circunference moon 
👈










