#Midwinter - All Purpose Anime Screencap Model / LoRA
386 messages · Page 1 of 1 (latest)
I've collected the Dataset, Tagged the Images etc.
Total Dataset Images are : 1795
Total Repeats : 10 - 20
4 Different Datasets are Included
which are :
- Ufotable Dataset
- Kyoto Dataset
- Jamex Hentai Stash
- Eufonuiz Style ( Similar to Screencap Style )
Settings I will use for the First Train are :
Batch Size : 6
Repeats : 10
Optimizer : DAdaptation
Learning Rate : 1
Scheduler : constant_with_warmup
warmup steps : 130
Max Train Steps : 3000
Base Model : Anime_Full_Pruned
Idk if its needed but I have an entire list of screencap artists if u need it
I mean I will include more style in here.
This is the first test run for it so I'm just testing the waters.
Just hit me up if u need material, im loaded 👍
for sure

@odd jetty would it really be a good idea to train a LoCon with like Thousands of Images ?
hrmm Im not too sure on that
Most I've gotten up to is 9
But same style. but with 9 characters
oh well
let's see how this goes

@odd jetty Also what would be better in this case
10 Epochs or Batch Size 6 ?
you asking locon, lora, or pur dreambooth
oh
Uhh depends on size
For my 9 charas (50 images each, aesthetic filtered and tagged with 5 repeats) I believe at epoch 4 it is OK.
Another locon I had used around 230 images for 1 style. that took 5 repeats and by the end of epoch 5 it is good enough.
TBH its really a toss up
I think with 200 images on 1 style with 5 repeats per epoch should fit the style in 4/5 epoches
each style has
gotta add Im using dadapt so YMMV.
400 images
hrmm maybe 3 repeats? save ckpt at each epoch for 5 total epoches.
Midwinter - All Purpose Anime Screencap Model / LoRA
that's about 26000 steps at Batch Size 1
Okay, so let's see
I'd prefer more epiches since you can test more frequently and select which best epoch. but bigger batch size probably fits faster (? Untested)
1795 Images * 2 Repeats = 3590.
Then Multiplied by 5 epochs
17950 total steps
then on Batch Size 2
9000 steps
🤔 seems about right
Will save an epoch at around 1800 steps
and since it's batch size 2 it should be fast
alright I guess I'll go with this for the first train.
I dont colab either lol
I try to keep my files local
Gonna send a few OOC Screenshots for your viewing pleasure.
Part 2:
Part 3:
That's All I think... @terse bone pick whicn one your like and I'll say the anime name
That's cool but what for ?
you have Datasets for all of these ?
So I just listed whichever ones look intresting

I have the anime as mkv's so I can wand my way to a dataset
like I said for now I'm mainly doing testing out settings.
but Thank you
Ah ok
will defo want these later down the line
np
popcorn time
oh
ive got 11576 anime screenshots all upscaled to 4k if you want
subtitle free
pm if interested
yikes
IMO keep anime at native res
or at least when screenshotted
like 1080p
Since upscaling introduces additional artifacts
your 3rd screenshot shows some rining issues coming from upscaling
I preseume it has been sharpened as well
did you also resample to like 60fps
only for panning scenes. looks like shit otherwise
Yikes
the lines are too thin and might not survive conversion back to 1080p or 1024px sooo
oh well. there if you want it. use them in my wallpaper rotation but they are just sitting there otherwise
although i do have 322 from this season that hve yet to be upscaled and are still at 1080p
buuuuuuuuuuuut then again i use a lot of custom filters for mpv
warpsharp, esrgan, etc
realcugan for upscaling, esrgan is... not a favorite
and i use glsl-shaders in mpv, not anything gan related for playback
also debanding, scale, cscale
the normal shit
is this not good enough? https://civitai.com/models/4982/anime-screencap-style-lora
it's not that it's not good, I'm just making a Multi Purpose LoRA with Lots of different Studios.
oh I see
Final Test version done @odd jetty
nice
...doesn't feel much diff
This is all Midwinter Model ( Custom Merge so Untrustworthy )
I'm gonna try with Anything V4.5
@odd jetty
It's having trouble doing the part of dataset that looks like actual screencaps even though that part has the most images.
Should I increase the DIM or the repeats in the next version to make it better ?
LMAO

Okay, I increased repeats and am now training a Model instead of a LoRA.
since I need a good Base Model for this.
I increase the Repeats from 2 repeats to 10 repeats
Me when i see duplicate loras made everyday
Its not enough to have one popular one that works generally well we gotta have 5 other people make their own
I just said that the purpose of this was to have a Single LoRA that has alot of different Styles. 
well, they're making a totally different thing. It's like my Makoto Shinkai multistyle lora
except with more styles
in this case yeah, im excluding this one from what im talking about
repeats
You should add a tag inside the caption telling what style it is
Got told by derrian that the folder name doesn't matter (except the repeat count)
@terse bone
Sorry I have Touhou AI server mited. Just peeked over.
yeah I've added those
For Ufotable I added "ufotable style, ufocoloring"
probably also do keep tokens
For Kyoani I did "kyoani style, kyoani coloring"
for the Hentai Stash I did "jamexx style, coomer style"
I did
keep tokens is at "2"
which is the amount of custom tokens I added.
but it wasn't getting the style in the Hentai / Eufonuiz Folder at all.
even though it got Kyoani and Ufotable almost perfectly.
I'm thinking I increase the repeat by 1 on each of those folders.
Balance the no of repeats with the no of file Ig
coomer style lmao
i have 3.1k images from takt op destiny lol
@terse bone btw
Recommendation for Tle is 0
Since it's an artstyle
it's currently at 11,500 steps for a LoCon.
Loha is better by the way
I don't think I can add more than double of what I already have.
For Artstyles in general
try maybe 3e-4 lr
tle 0
and dims 8/4 for network
for conv i need more knowledge
maybe 0.3/16?
3 batch size
prompt tags won't be effective, no?
you should probably make a ckpt
or X loras and supermerge them into a ckpt
I tried making a Ckpt.
but the Optimizer Args were not working for the ckpt Colab and I didn't know how to fix it so I got frustrated and quit.
I will do it eventually.
https://huggingface.co/OedoSoldier/animix
potential base model
You could also ask the Myne Factory guys. They're training a their own model. They also have high quality datasets.
Okay I took a break yesterday so my bad.
I just tested the checkpoint I had trained the day before yesterday.
it seems to work just fine.
Ufotable Style. ( No Anime Screencap Tags )
Gens are with Aurora
Will be more screencap like with other models.
Conclusion, This needs a CKPT to be at it's full potential.
a ckpt would also be very useful for mixes
I dont see much difference with jamexx and eufoniuz
Besides, eufoniuz just straight mimicks how the anime looks, at least facially speaking. So it wouldnt settle on any particular look
Yeah, that's the point
I want to make a base model that can capture that look
without having Influence from other models
Because right now, it barely looks like the images in the dataset.
I could also try retraining at 128 dim Idk
let me know your thoughts @clear sluice
for multiple styles I'd go 256 or higher. You can always reduce the size after training.
Anyway a ckpt will always work better because some styles require conv layers to be overlapped on the same latent space
lora latent space is simply too small to accomodate too many styles at dims lower than 128
I contributed the datesets i realized
I'll be including a lot more Datasets as well, I just need to find what works best.
so are you going to do checkpoint or a lora
Checkpoint
going to start training now.
Started Training
Total Steps = 14150
Reconfigured for Batch Size 1
Now total steps are :
28 300 steps
nicceee
Hopefully CKPT can capture Style Better.
and work as a base model for further training.
add makoto shinkai
rip
I summon thee @hasty jackal
I was using DAdaptation at first
I didn't know 
oh honestly I don't know if dadapation works with finetuning either
I was just curious since I've only seen people use it with loras so I wasn't sure if I was misunderstanding or not
hmm
Yeah, I switched to AdamW8bit and it's working fine now.
Although I am pretty sure DAdaptation would work, but Colab doesn't have enough VRAM to do so.

yeah that's true
if you end up having more issues you can turn on gradient checkpointing I suppose
(if that's an option)
not sure if colab has enough ram for gradient checkpointing or not 
16 GB
that's what colab has
Yeah, I'm just going to hope it turns out fine with AdamW8bit at default settings.
since it's a finetune, it shouldn't need much tweaking with the settings
if you need any more help let me know
ofc, thanks for input
💗
Wait wat memory issues does providing a sample prompt do
yeah it wasn't the problem
it was using DAdaptation.
it's training pretty well judging by the samples
bruh
let that sink in
idk maybe @violet fulcrum could train
nah
it was going well
at around 5000 - 8000 steps
will just make it less.
I'm just going to start training a 256 dim LoCon and while it trains I go to bed.
I do not have enough energy for this rn.

I have never trained a ckpt
Neither have I, but here I am.

honestly, this could take me a month
or less to finish completely.
with my current computing power
for locon loha
if only andite were still here
Breh
his ghost is still there
2nd Locon Retry Results :
LoCon was trained at 256 DIM using DAdaptation.
- Ufotable Style : Is good, represents Dataset well and is flexible,
- Kyoani Style : Is good, represents Dataset although anatomy may vary slightly.
- Eufonuiz Style : Is better compared to previous trains, represents coloring well and has somewhat flat shading.
- Jamex Style ( Hentai Dataset ) : Despite having the most data and repeats ( more than 500 images and 5 repeats ), results stilll do not match dataset and Models fail to represent the style.
Conclusion : Will train the style seperately from the other dataset to see if the results presist and if there is any improvement in representation of the dataset.
Previously trained CKPT represented Dataset much better than the LoRA / LoCon Method, will investigate further down the line and experiment more with finetuning in the future.
Just gonna drop this here in case anyone want's to test.
@odd jetty
SD 1.5 and any other model?
It's trained on NAI and it's pretty flexible
KyoAni and Ufotable are prominent when using.
what's the trigger word/words?
while Eufonuiz and Jamex are not
there are alot
wait
@odd jetty
- For Jamex Style : "jamex style, jamexx style, coomer style"
- For Eufonuiz : "eufonuiz style, coomerv2 style"
- For Kyoani : "kyoani style, kyoani coloring, kyoani screencap"
- For Ufotable : "ufotable style, ufocoloring, ufoscreencap"
@terse bone probably Silicon29 doesn't like it
Works better on AOM3 Though
I didn't request for glasses but got one anyway lol
oh
that's ptobably my fuckup tho
Wheres
The screenshottiness
its called mid for a reason
by the way
hires does difference
try AuroraONE
It's most flexible
How can you go THIS wrong ?
With Ufotable ?
You probably did something wrong.
Ufotable is the one of the style it does good
I'll get get to it tmrw.
Rn I sleep

Idk
I messed up
I switched it with a file with a very similar name
its still a mid style

I have exams this week
Any further progress is delayed as I focus on those.
Relatable
Okay I'm semi-somewhat back to working on this.
I've found the LoRAs trained by my method using Dadaptation don't translate well to other models when not using my own merges.
So I'm going to start testing again but with AdamW8bit and Lion settings this time
Also, going to reduce the amount of images in each dataset so to reduce the training times.
Because right now it's like 450 images per dataset.
Which is already too many.
Also, I'm going to be deleting the ( almost 1250 images ) of just straight up porn from the dataset.
So anyone who has some anime Datasets they would like to share instead would be appreciated.
💗
I will put you in credits if this ever gets finished.
Alright
You remind me
Let me get it
Wait
My trigun stampede dataset+700 images
My trigger studio dataset+1700 images
https://drive.google.com/file/d/17DlA1_J3jOFgh-0onG4Fe9GQzvnZkvoJ/view?usp=drivesdk
My takt op destiny dataset (madhouse & mappa with ufotable theme artstyle)
+3k images
@terse bone
If you want my NotSHAFT dataset, feel free to yoink
Actually let me pass it to you like later in the evening
Thanks ya'll 💗 , for now I'm only going to be trying out 200-250 images per dataset.
will cap at 150 ( quality picked ) images if the steps get too high.
Id contribute but all I have are names and trained loras of screencap artists
Well actually i might have one dataset lying around but others are already putting forth their stuff so its probably not needed
Style bleeding might be unavoidable when mixing too many styles. Not sure if text encoder conditioning is enough to bring one clean style out of the latent space
I mean something I could do is make Multiple Screencap Styles as different LoRA's.
but then it isn't
an AIO ( All in one ) LoRA.
Already doing that 😄
That was my plan
@sonic elbow
It isn't here
I never made any of the 15 Midwinter models public
OP left?
Nono some people like to leave and rejoin later
He should be back in a few months
Silvelter is another such person
oh damn, any update on this? i need a nice screencap style
@terse bone





