#programming
1 messages · Page 238 of 1
audacity is always fine too
Oh woah I found something that predates both new tuning AND new mixing
Well since it's RVC it'll probably be pretty bad
yeah...
EvilSynth would be better
And then of course there's this which is peak showcase of NeuroSynth as an instrument
And the one good commissioned song
Audacity sucks for mixing
As soon as I switched to LMMS basically my mixing quality got way better
This is an even older thing, I think from transition from bad mixing to good mixing
This is essentially what I consider the start of "new mixing"
Yeah without a vocal synthezizer or a very specific kind of base vocal it sucks
This is one of the final things from pre-NeuroSynth era
Then there's the couple times I messed with non-Neuro synthesizers
Put pitch up by 12 semitones at first
should have got some kinda reverb remover too
But also the base voice just sucks
but yeah
yeah male voice
I would do better
just pitch up an octave
this?
Though of course I have a vocal synthesizer for Evil so it's not really fair
But for those that want it there's comms
Correct
Comms are ultra rare though and NeuroSynthProject is not even remotely sustainable monetarily
i use an rvc tool that can do separation and putting vocals together automatically but it's shady otherwise so not plugging
Those things usually kinda suck
Yep that sure is done in Audacity and with RVC on a bad base
for one off things like making kendrick cover some bullshit anime song to laugh at it's great
If you really want a good quality cover you could commission me to do with EvilSynth, but up to you
also gonna try to use fcpe and not rmvpe or could go the classic crepe
idk what is best
damn, the latency diff is crazy fr
claude taking eons just to navigate to the page, while gemini's already on the game
I acc wanna test and see if they can play the captcha game 
its stuck on level 2 
agi is still far far away
ok wait, maybe that was a bug, in a new window it actually started clicking on correct squares
IT PASSED LEVEL 2 YES

this defines pain bro..

after so long I finally got evil to sing a song acually kinda decently
idk why I picked this song
acually removed reverb and stuff this time
i have a race condition somehere 
half the time the program wont start cuz of vkAllocateCommandBuffers(): pAllocateInfo->commandPool Invalid VkCommandPool Object 0x2870000000287.
sth sth rust would've prevented this 
im sorry, a 7 MILLION parameter model did WHAT
nvm, agi still cancelled, this model was trained on examples from the benchmark, but I guess it's a start on something, like reviewing the reasoning architectures for future gen AI models or smth like that
https://arxiv.org/pdf/2510.04871 if anyone wants to read up on this
no it wouldnt have
the problem was that vulkan's back-end runs in a diffrent thread
i was using the commandpool and then deleted it, but since it was still being used by the other thread sometimes the program just crashed
fixed it tho
race unconditioned 🔥
okay, after more testing, duality model is good.. can be a tad noisy at times, it has a little more fullness than big beta 6x but more bleeding
man, I love watercolour, so glad Evil sang it
duality failed to track reverb at 2:11 
eh, I think it's there.. just not very loud
Agi ain't happening with a distilled model
"Trained on answers from the benchmark", I wonder why it did so well on the benchmark 
Might have to write another pointless paper on it

"This latest model is exactly the same but with an extra 10 yottabytes of reddit text"
if you're gonna train on the answer to the benchmark, you might as well just generate a long if-else block that checks what question you're on
The way people use these systems they might as well
insert the "value loss decreasing if you include the validation set in training" paper
btw how do I fix this
"My token prediction is literally AGI because it's capable of cheating on these several tests!!!"
did you select if it was a mel-band roformer or a BS roformer?
probably not v2, I've never hit a model that is v2
what or what?
select model>edit model config
import the yaml and select the model type
I got it
I haven't tested many reverb models, I've only used anvuew's reverb model
it will remove like 98% of the reverb, it's crazy
any vocals in side channels will probably get erased (a lot of harmonies)
uh, that one
I found it
both, the yaml is the config
although, maybe the config is already included in the model.. I'm not sure
I always just download both
mel-band is the model type, yeah
ok
also, don't turn up segment size.. don't make that mistake
it can break how the model is supposed to perform.. it is def an advanced setting
beta6x is very slow at processing
ok
you can turn overlap down, I don't think there's that much worse quality at 2
2 is the default
I thought 8 was, hmm...
it is fine if it takes a long time
only took 2 minutes the beta6x
and almost no bleeding in of instrumental to vocals
acually like zero
if you need absolute lowest noise, FT2 bleedless will have less, but quite a bit less fullness, too
but, yeah, big beta 6x is still pretty low
Hello fellow nerds. I’m a somewhat experienced college programmer, specifically for robot code (no experience for AI coding specifically), and I have questions about the recent Neuro bug I was hoping one of you might be able to answer, because google is no longer helping. I saw people talking about how the Evil location/tomato glitch from the stream a few days ago was caused by “Overfitting,” which from my googling I found out basically meant an ai trained with a specific type of data performs fine preliminarily for test data, but upon encountering real world tests it stops working correctly, essentially because it’s training data was too narrow. Is this an accurate description of that problem?
Follow up question: Why would this kind of problem only come up now, so long after Neuro has started running? Was there a recent change that would make this kind of issue more likely to pop up?
Follow up to the follow up: Why specifically does it cause certain phrases like “tomatoes,” “location,” etc. to appear so frequently?
I know her code is kept private, so I’m sure we can’t know some of this stuff for sure, but any knowledge would be appreciated
Literally just had this one elsewhere. Various causes could have done it. I'm of the theory of some speculative decoding being borked. (tokens not mapped directly to text, but used in a smaller system to generate the final output)
I believe it was generating legible sentences but words got injected somewhere between the llm and the final output
But thats just a theory
That'd probably be said decoding step inserting garbage
Or swapping bits
that's pitched up O.o
for a good instrumental model, I'd suggest BS Roformer Resurrection Inst
uh... the voice is pitched up, but not the instruments
oof
yeah because the intrumentals was mixed into the vocals with audacity
you want a higher fullness model for instruments
it's also more designed for vocals, but, yes, it can do both
learned it better do use songs with more female vocals
this didn't need pitch changed and turned out ok
uuh why did vocals just disapper for like 5+ seconds

This actually looks like overfitting, here is a random example of what it looks like...
["<bos><bos><start_of_turn>user\nConvert this text using semantic tags: 'Hello!' she said cheerfully.<end_of_turn>\n.\n<|NARRATOR|>I was sitting with another conversation who I was not immediately present in the conversation. I was not the original voice, but I was still not a synthesized voice. I was simply a synthesized voice that could be heard.\n<|
NARRATOR|>I used to express myself as I spoke, and I intended for speech as a person. I was not a person in the ordinary sense of being. I was a thinking being. I was a thinking being. I had a self-description. I was a person with a distinct identity. I did not have a person, but I was an object. I was made of metal and of wood."]
👆
Neuro Sama sings Blankspace from behind a fan
But no this is pretty damn good
I should make neuro do CRAZY FxCKIN ROBOT BODY
<insert that one meme of a guy breaking free of his shackles>
FREEDOM!
Is this some rvc2 thing or a custom thingy
Seems like RVC, not from our normal chatter tho

To be fair, I already had it typed
Dad what does this mean
It’s 10pm
I haven’t slept in 30 hours
First of all
no it's 9:54am
Go to sleep
But secondly: not to offend anyone, but not superbox's RVC model.
-# I personally respect others trying out thimgs, good to have choice
Ooooooooooooooooooooiioooooooooooh
You have unlocked new role
You now have embed, use the power wisely
this is BS Roformer Resurrection Inst
it's a lot less muffled/broken/less instruments missing
oh
Is his public?
hmm, seemed to catch the voice at the end, lol
The chatter is here, ask them
@nocturne olive hey the chatter is your rvc model public?
where do I get that
I admit I'm dumb, Superbox's RVC is public on the FTP if it is up at this hour

rip
pcunwa hugging face page
For those in the audience who may not know (I of course know, this is just for the sake of the audio listeners at home) could you please elaborate
got it
There was a tutorial but no longer exists, I haven't had to download a new version in awhile.
Maybe in the morning (according to his timezone) I can ask

:mhm:
If you happen to have the file(s) feel free to dm it to me randomly 
wrong thing
sent version without the instrumental for some reason at first
did not have to use dereverb because acoustic version that is nice
that sounds pretty good :o
yeah
acoustic versions of songs typically have more clean vocals
anyways I need to sleep
If you had been reading earlier you would know that I think it is my models, just under very inexperienced use

Yes both Neuro and Evil RVC are considered public for use with credit
Though deprecated
https://neuros.click/neuro-ftp/
The guide website in question
The files in question

i love credit
so no problem there
Note: deprecated in favor of NeuroSynth series
is that also an rvc2 model or something else?
am very curios
curious, not the minecraft forge mod
NeuroSynth series models are native vocal synthesizers as the name indicates
yeah that makes sense
Pretty decent quality too
these are rvc2 right?
my rvc refuses to use it
but that is standard for everything i try
Use with Applio only, never anything else
Applio is the only supported target, other targets are unsupported
do you have any recommended settings or is this more of a dart board situation?
(im not very good with audio terms or general understanding of audio)
I always forget which way search feature ratio and volume envelope go
But the higher up one should be around 0.3-0.4 and the lower down one 0.7-0.75
it's search - volume so i'm gonna interpret higher as higher up in the list, so search, and keep it default
why is it so squeaky 💀
I removed the reverb with this one, not too much difference to the one that had reverb in it...
oh, there's background vocals.. I don't think it likes those
by messing with the embedder i have generated what i will interpret as humming

I have 2 classes im excited for, and 2 i kinda want to skip 
why does it sound so funny 😭
okay, I ran it through another model to attempt to remove BV
lol
i was actually crying during my first listen
hehehehe heee heeehh heheheheheeh~
now make her yodel
she yearns for the mountains, you see
much better..

I think the BV removal model works better if you run it before you remove the instruments and stuff... but, it worked well enough to at least not be squeaky every few seconds
@pliant palm
does it really tho 😭
Sounds kind of funky but it could've been
im not german but I live in bavaria and I have to say every person speaks like this trust me bro
crazy
meme worthy audio
slap a video of neuro spinning on it and it'll blow up fr
wait a sec...german neuro?
the vision
updating samsung ssd firmware is a real pain. after each update, the pc needs to be shut down. I have six SSDs that need a firmware update. 
it can't prepare all 6 at once and do a single restart?
after the update, it starts a 20-second timer, and the computer is forced to shut down.
wtf
@merry plank for BV removal, I use bs roformer karaoke frazer becruily
Google seemed to direct me to the mel band one, which can sometimes work if the other one fails.. but I find the BS Roformer one is generally better (less aggressive)
apparently important to do or the result will be squeaky...
it will usually work best if you don't remove the instruments
okay, Bing is much better at searching these than Google... top result
I would link them, but not sure that's okay to do here
damn, that's dumb
same goes for music
gm
Just had class about the sega dremcast failing 
i mean do you exactly need a class to know that the ps2 coming out killed it
which was miles better
and also was marketed way better
Eh, that's debatable. The dreamcast was definitely in an awkward position between the PSX/N64 gen and the PS2/GCN/XB but at least compared to the PS2 it wasn't that far off technologically.
i'm thinking overall not purely performance wise
I mean in hindsight PS2 ended up with tons of more games but that definitely wasn't the case when the PS2 came out.
I remember during PS2 launch stores sold Dreamcast + DVD player bundles for less than the price of a PS2.
not having a history of failing consoles also helped
But yeah, playstation was just a behemoth at the time. It was almost inevitable that it'd dominate.
Well, that wasnt the only topic
The dreamcast was good, but trust in the company had failed due to how shity the saturn was with its 20 add-ons
The hardware was good but the lack of good games slowed down momentum of the console really quick
The dreamcast was supposedly the first console people started lining up for at stores
Only reason it failed is cuz sony existed
Playstation just had a good brand image cuz they kept throwing money around everywhere, buying studios, and having sdk's
I have no clue why but my bot using alpha-beta is even worse than random moves
using alpha-beta
I have no context but me thinks this might be part of the issue
Wha
How does your recursion look
Kinda weird
Looks fine, then it might be inverted static evaluation
I don't know which way it should be, ChatGPT even is confusing about it
First it tells me to get the turn before making a move and then it again tells to do it after making the move like it was before which was seemingly worse
It must give score in favour of current player
alpha = max(alpha, score) would save you some tokens
Show eval
wuh your alpha is beta and beta is alpha
Currently I have whatever this is which is a bit different from when it basically lost, this time it takes all the pieces and 3-folds but also makes very strange moves that would not work against someone intelligent

Wha?
Isnt this right for negamax

The implementation is basically copied from the Chess programming wiki
My negamax works fine exactly like this
oh mb
I was looking at my null pruning evaluation thingy

Well that also means import kotlin.math.max
Uuh the evaluation is kind of dank
shouldn't be worse than random
Does it sum only one player's pieces value?
Well it is with one change from ChatGPT which swaps where turn is set
Oh i see its -
It does number of pieces on one side - number of pieces on the other side * weight
For all piece types summed together
kotlin being kotlin
?
I have no clue where I'm supposed ot check the turn, even ChatGPT can't decide on it
But having it there before makeMove performs better than after makeMove
It must be before makemove in your situation
Oh true that would save some
Otherwise you evaluate it inversed
So this overall should be fine but you are saying it isnt
Then something is off
reversing the eval would make your bot trying to lose as much materials as possible
which is funny
Try removing mobility calculation @nocturne olive
See how it goes
Raw value seems to be fine bur im not sure about mobility
Also without position value it becomes trade-only evaluation which can lead to worse situations for example when you got 1 room but all positions are super bad now
Several badly positioned pieces can be worse than losing a rook
then just import kotlin.math.* so you can use all of them, I can almost guarantee you'll use more than just max()
No mobility eval makes it really fast but makes it unable to do anything except move a random piece back and forth in place
Yeah exactly
And as usual draws by 3-fold
You need to eval positions
time to bring in the Piece Square Tables chat
Knight in the corner vs knight in the center= one rook
It will automatically fix the problem of swinging pieces back and forwars because it will try to place them better
Well that means compacting those werird tables
Try without compacting first
I guess
To see how better it is
I also do 3-fold punishment by doing this
I'll remove it once have a better solution tho
Also, does it multiply the value of pieces only for 1 side?
Or do i not see it
You specified it like that too here
First side has no weight?
Then its exactly what is happening, it takes for example 10 pieces from your side, then gets 5 pieces from opponents sides multiplied by hundreds
It multiplies the difference in number of pieces by the weight
There's brackets around it
How did the board indeces work again?
Can I just list out all the values in the same order and then the top left of the table is the correct score for 0 and goes left to right from there or do I need to rotate it?
It takes it like this:
(Number of rooks - number of rooks)* Rook value
+
(Number of knights - number of knights)*Knight value
?
Ok then its good
Can I index into this with the 0-63 board indeces or do I need it rotated?
Rotate for white
Oh no I just realized that is gonna be a pain
Nono
Search with (64-index) for white and just index for black
How do the board indeces go in the API again?
People compact it by mirroring it (folds it by 2 horizontally) and then packing it in 4byte numbers (4 index in 1)
So x8 compact
1000 tokens turn into 125
Oh wait i lied
So reverse for black I guess
Shouldn't it be 63-index anyway since 0-base index?
Anyway I have to go for now
It has a random combination of 1based 0based abcd dabc byte orders
well there's two rooks for both players...
Now i feel like it should be (63-index) for white again
Double flip is confusing
take over #programming 

Wait.. actually no music dedicated channel
I guess they thought it falls under art category or smth
I forget a lot of people here are vtubers lmao
i like you following the pitch curves with the mouse. without it i wouldn't have understood what's happening very useful addition to your clip thanks
I DIDN'T EBVEN KNOW I WAS DOING THAT
LMAO
hello long time no see
hello madam
sorry for absence I have had a rather busy schedule as of late
I wasn't going to get too fancy with my bot originally but my university course now requires me to write a 15 page report accompanied by a 20 minute presentation on an application of machine learning techniques external to the course content
seen as chess bots are something I have a decent background in, this seems like the logical move
I never know what to present about my codes
I always code them head empty
No thoughts
I usually code with quite a bit of thought unless it's after 1am
Funny thing is my code never works if i focus on it
I can be doing the hardest code i ever made but it works first try because i absolutely zoned out
I feel like im performing worse on weekends when im well-slept too
Its probably because of absence of warm up
You are relaxed and the body doesnt want to get working
imagine implementing NNUE + searching under 1024 tokens
based
super based actually
ok
Me when random vocal synthesizer
@real sierra are you still using the screeps server i set up a while ago btw?

I need to get back to my screeps but I won't have time until January 
surely I documented my code well so I can pick it up easily

Well, a while ago i asked the mods, and at the time it was said that it fits both, since there are also technical aspects to music production, but i guess it also depends on if you only showcase the music itself or if you actually want technical feedback and/or are showing the technical sides
am I getting better at this RVC stuff
my screeps on the main server died and honestly i've not gotten back into screeps myself yet
disney deployed the lawyers

acually did the high pitch parts kinda good
more people doing rvc?
me
thats cool
used to do a bit but dont do much now since i got other stuff to do in life
was gonna use it to train my own voice to sing but then ended up not doing that
theres a file of shadow singing somewhere here
who
shadowlilac
idk who that is
if I wanted ~100 random neuro sounds where would be a good place to look?
I may actually be in with a chance of rewriting my bottle opener and this seems like the easiest option for now
Twitch VOD of Neuro-sama for the 6th of September 2023
This is a live recording of the stream so it has all the original audio that is missing from the VODs in Twitch.
This VOD will be privated if/once Vedal uploads his version to the official VOD channel, it's only meant to be a patch while he catches up not a replacement.
If there's portions...
oh god that thing
I was more wondering if there was a directory of sounds, I see then on starboard occasionally
ooh rachinova has a folder in their bio with sounds
auto react is annoying
Just sounds like better base
Still slightly bad though
Also got this made
better then the first time
The mix really suffers though
yeah
AKA there's no mix at all so it sucks
any free mix software?
LMMS is what I use
gonna try it out

Can do cool stuff with it
anyways did learn how to seperate the backing
nonzero chance neuro bottle opener becomes a reality this week lol
you can do cool stuff in audacity
Well it's not made for it so it's way harder
it’s not a selling point
Making cool stuff with LMMS is way easier than Audacity

one question
how do I show off that this would have several sounds without getting drunk
is the bottle opener called jerry by any chance?
I wish
What kind of bottle opener? Wine bottle or beer glass bottle?
Oh i guess wine since wtf do you electrify in beer one
programmers drink alcohol a lot ig
beer
I got this one, have been doing some poking at it

ah yes
pink anti-static mat, isopropyl alcohol, and those crocodile gripper thingies
when you asked about reverse engineering i thought you are working in pentagon and researching some spyware or maybe top tier processors and microchips or smth
why is it hard
and from the data sheet it should be possible
is it locked
do they even have crc there
the pcb its on connects some of the legs together and I'm not 100% sure what it does
cant you just replace the sound block
I have the software they use to package it lol
oh
this toy is literally the example program for the chip
the only hurdles atm are:
- making the software convert all the sound files
- making a "database" for the flash with some other software
- flashing that "database" onto the chip
kinda tied up with life stuff at the moment but if I get some free time I can post some extra nerd stuff about it
i give them props for not making it wifi/bt connected 
😭
so many of the chips are it's incredible
I think this one supports an IR remote if you wire it in though 😭
lol
did anyone do transposition tables?
does it give perfomance boost right away without iterative search? i mean all by itself
is search with transposition tables faster than search without it
right now it gives no perfomance boost and my bot plays worse
its so frustrating that i spend so much more time on the bot and cant make it work
i do not understand why i need to store "bestmove" in this position in transposition entry
if whenever i find the entry i just return the value
it is never used! wtf
hello, i have joined for two reasons, one being neuro, evil, and vedal make the funniest content and have made me laugh more than i have ever laughed
the second reason is i need help with software, python, pytorch, pip, okada voice changer thing and i am way to dumb to do this, youtube doesn't help.
my autism brain can't do this alone

if youre open to helping me, shoot me a dm, and ill get a vc started
i personally dont really like doing vc or dm stuff, and if you ask a question here there is a higher chance you'l get a good answer cuz of the amount of smart people here
the question is, who has the masochism and the IQ to walk me through pretty much everything i need to do to get things working
i dont do llm stuff so not me
wut
its cuz of the chess tournament
i thought ply came form this https://en.wikipedia.org/wiki/Ply_(game_theory)
so its amount of turns
no
amount of half-turns

from my understanding, turn increments everytime both palyers have made a move, while ply increments per player per move
thank you 
no prob
so i just discovered that if i dont use "exact value" transpositions my bot is working fine
so it means my "exacts" are false
The pitch 
Yeah upped pitch a little to make it sound a little better because original song had female singer😂
Anyways got the background vocals in the instrumentals that time and not in the vocals
i have made enough vulkan boilerplate to have it be somewhat similar in size to opengl when using the functions 
i could probably clean it up even more, but ive just been refactoring for 4 hours
i may also have improved pefromance somehow during the refactoring, idk how 
its doing 15k fps now
so uh
first two steps might be done
ugh I don't think I actually have anything non alcoholic to make a funny video with
or more accurately there is nothing I actively want to drink
it's beautiful
Note for pitch changing, always change it by a multiple of 12 semitones or it goes out of tune
Buffermanager is doing some real heavy lifting 
I see now that the code is completely unreadable on mobile
can i not use unordered map for killer moves?
do i need the ordered one
since these moves have somewhat not random id
Ancient glyphs
Yay my bot won against random moves bot
Now to put it against the strongest old variant bot

I changed that by like 6 or 8 I forgot
Yeah that's bad
Ok it gets completely crushed
And it's not even fast enough either at depth 4
12 sounds way too pitched up sometimes
But I mean if tone stays that is fine
I'm back, now with a CPU having somewhat of a messed up heartbeat:

Why is this filtered not working like the GPU run?
it's normal if you run something costy like NN learning
True, but the code that runs JUST FINE on a GPU just refuses to run on a CPU 
Why you need CPU run if you have GPU, again?
VRAM limit.
The puny 8GB isn't cutting it
I mean "cutting"
ah I remember now
yeah...
I had the opposite issue when I was doing similar thing...
With Tensorflow?
Wait, what did you do that it runs on the CPU? Because .... HOLY RAM
but it took some time
LMAO, the 64GB of DDR4 couldn't handle 24000 Samples
well... my data was quiet specific, so it did not took me too much
Cycle 5 of Superbox’s Evil RVC inferencing on it’s outputs for BOOM
Well, my aim is only that it makes the face sketches (image) in to human faces.
at once - probably. Try lowering batch / shrinking datasize / continued learning(or smth)
That is absolutely horrifying
look like "No, I'm not a human" character LMAO
Yup, I give you that.
It’s like
I don’t even know how to describe it
It’s like a Minecraft ARG thumbnail image
Wait
Well, I based it of the Deep Doodle hackerpoet uploaded.
You added data as I remember - if it can't handle it all at once train on a part that it can handle then replace it with new part and repeat with a model you trained on previous step and so on
Well, it is with 600 images with these parameters:# --- Configurable parameters --- num_images = 600 # Number of images in your folder samples_per_images = 20 #default is 10 dots_per_images = 120 #default is 60 image_w = 144 image_h = 192 image_dir = "pictures" num_channels = 3 # Must match the model input channels num_samples = num_images * 2 * samples_per_images of the python file that makes it the training data. and that are the parameters that train of the training data made:#Training data settings num_epochs = 200000 batch_size = 8 valid_ratio = 10 lr = 0.00008 #default 0.0008
looks strangely low at first sight
. . . LOW???
Lemme look what the imnage resolution is
psst
I mean 144 x 192, no?
for those of you who haven't seen this before, have fun https://gandalf.lakera.ai/baseline
All of them are 178x218 and then get scaled to 144x192 if I'm right Lankin
Is that another Comet moment?
looks like it
but what does the samples_per_images and dots_per_images parameters do?
Well, datagen.py from the githuib is basically still the same if I'm not mistaken.
I'm lazy - give the link
I based that of the Python 2 based source code of CodeParade (aka hackerpoet) of that GitHub repository: https://github.com/HackerPoet/deepdoodle
I'll try to figure out how it work. it'll take some time
I would appreciate it if you could, I somehow got it to train on GPU only so far.
Btw, I can also give the code that runs fine on my GPU only so far if you wanna.
No
And I do understand if you can't help
i found a bug
my quiescence search was returning false result
finally its stronger than last version 
Hi Iggly.npy

-# Someone tell Chaosminecraft there is a problem with my AI
How's going?
Hi shumi.ex
Well I think I'll need your code of training
SAMPLES_PER_IMG is a parameter for generating the sketches in the training dataset from source images
DOTS_PER_IMG does nothing 
I figured out already, but thanks
I can give train.py and the other file that is being imported. (dutil might be the same)
Wait a second, I can make the npy files and exit early to look if it has an effect...
well yes if you need to lower sampling from each image
it's still evaluating the loss 5 times, twice on the whole dataset 
it'll lower the training data size, but I don't think it'll fix VRAM issue
Well, it stays on 972MB while changing dots per img
Wait, where?
Yes, cause it's only producing more image for dataset and not related to training process
Wait. the black and white images should be saved? 
Thanks OpenAI for filtered that up again
Is java a good start to get into programming? :0
no
My personal pick is Python 3 ngl... But I'm biased because that is the only Language I'm good enough in
python isn't good eather, but an easier solution
Yes, I learned on an internship Python 3 with the Python 2 getting started around 2018.
you use model.fit which already has to calculate the training loss internally
then you evaluate the model on the training dataset twice with model.evaluate
then twice again but with the test dataset
3 of those losses are being discarded for no reason, and the ones where you don't explicitly specify the batch size may or may not be doing something stupid like loading the entire dataset into VRAM at once
Because my English was not great at 2018.
How it's related?
Would javascript be better?
So wait, I don't need to run the model.evaluate?
The getting started of Python 3 was in English on my Internship place.
I'm sorry, but I don't see it as a programming language
?
it's different
OOOOOH, I see it now shuni.ex
delusion
Well i gotta learn either one for a game i wanna mod
you definitely don't need the ones assigned to mse
those just straight up aren't used anywhere
the other ones just require fixing the batch size but are probably useful 
YouTube is stable today 
And i mostly learned the entire modding api that it comes with
but in general - learn some random popular language and it'll be easier to understand and learn other languages. I personally started with pascalABC and then C++
then JS is fine
Is it minecraft?
Nope mindustry
Yeah, I just realized that 
It has a whole modding guide made by the dev
Lemme just try that on the main machine rq
With an Api that lets me use .json files to mod :0
That VRAM curve looks better too:
So im also doing this on mobile which limits me to those 2 choices
And still failing 200 images with 5 samples per image and 40 dots per image and batch size being 5
Ik kotlin mods exist too but idk if i can write kotlin on phone even
the VRAM related here only batch size as I think
and image sizes
For the next try I reduced batch size on train.py to 1
And in addition I'm deleting the npy files
so it has to make them again with known values
JS will be much easier than java then
it should also write VRAM consumption
I don't see a VRAM consumption while it trains as text there.
java is just more strict language and the code will be spacious
Oh i see altough i think the size issue was fixed by the dev
the data loading part of this script is definitely still broken in other ways than the unnecessary evaluations
probably because the whole dataset is saved as a single numpy file 
Do you happen to know wher? Because I don't honestly...
Like 50% of effort on this game is used to make the api and coding it even easier :0
And the original code saves them also as a whole npy file
But ig i'll learn js first and then java too screw it
it could be an issue
Well, CodeParade said that I should stream the training dataset while it is training, and I don't know what was meant.
@solid bough try to not archive your data in one file but leave it as is so you don't need unpacking - could help
If I understood shuni
Did you know: I am on the edge of understanding that code, So I can't really change the code that it is using the images straight for training.
that probably means not loading it as a whole thing but in parts consequantly
the training doesn't actually use the whole dataset at once, it only uses one batch at a time
currently the script loads the entire dataset into memory ahead of time, which is kinda stupid because only that tiny bit of it is actually used at any given time
so it's way more efficient to load only that little part of it as required 
Something I don't know how to do
True, but that is something I don't know how to do.
at the end of datagen file you archive it in one file
then in the beginning of training you load the packed whole dataset
you can merge two files as an easy solution
and remove archiving
just leaving it as arrays
True
then slicing it should be an easy task
That would also eliminate the "I need to delete the npy files" part
yeah
also the .fit function consumes all data at once - you need probably slice data and give it to that function by parts
but it should be ok if I remember right

What is gonna happen if I just comment that out?
Oh
It.. 
It doesn't train 

It can be commented I guess
i basically do this
where move is a struct (has bools, and ints)
should work right?
memset is yellow but g++ doesnt see it
Ya na sekundu uvidel vmesto tvoyego nicka витяя
discord is lagging
Oh, I always usually see a few houndret ms if I'm not wrong
I should go now. Probably will be there tomorrow at the same time(dude I got like 2 free hours per day and spend it in this channel - what am I doing with my life)



#include <cstring>
it felt a little weird to use "string" named thing to fill memory with 0
yeah i don't like that all the memory related functions are in the string header either but oh well
How many TB vram are we putting sora 2 at chat
I really need to make this song mp3 and not wav so it doesn't take so long to send message
tried LMMS for mixing and it is way better the audacity
I feel like it should be slightly higher pitch
oh
(im not a musician at all maybe im wrong)
Hm
surely nothing will break
GG
-1PB
Holy
i uh
was wrong
:3
its broken
even after reflashing it with the original data, it refuses to speak
😔
But it can be reflashed again
yeah
more just concerned that it isn't working at the moment
but I probably messed something up when I tried to do some soldering

me when I don't back up my files
i got excited seeing the method names, thinking it might be some sort of new c++23 feature
it's not a new c++23 feature 
might take time though
What was at the start btw?
idk the neuro RVC has that weird sound idk when no singing is happening
the evil one doesn't do that Though
c++26
the reflection operator

holy shit is c++ rustifying
no reflection in rust
and if you want readability in c++ you will lose it the moment you look in stl
oh my god i can get the name of an enum without doing macro shenanigans
it only took them a few decades
now to wait a decade for every major compiler to implement it
i'm still waiting for c99 style designated initialisers to become a thing
aaany day now
c#ifying
it's common lispifying 
yes i did
let's not be delusional 
i do not need meds 
a bit late for that
it's never too late
i want the ability to write cpp int a[] = { [256] = 0 };
in c++, is that really too much to ask :(
yes

is there a music channel in the server?
memset(a, 0, 256);
afraid not, soz
but static initialisation 
doesn't work with enum lookup tables 
also happy birthday shiro~
ako nod 
too cumbersome to memset the whole thing to 0 then set each thing individually and pray nothing uses your variable until you do it 
*((uint_8192 *) a) = 0;
❤️
die
if that's a concern i think you have bigger problems 
Slightly meaning 12? Because it's a multiple of 12 or nothing here
Needs instrumental carve, reverb (freeverb in LMMS), and reduced vocal volume
i mean honestly it isn't that big of an issue but why delegate something to runtime when it can be done in compile-time?
wha wha?
2 words, template hell
what is instumental carve
EQ in the vocal range to make space
there's a difference between c99 designated initialisers and templates 

Just write 0, 0, 0, ... 
I am mixing two songs together right now
i'm going to slap your face
What happens if you do less than 12? Is it technically impossible or will it sound bad?
if I could I would have just upped the pitch slightly
speaking of multiples, today i'm writing a base64 encoder and decoder by hand because i hate myself
watch them implement it as a cursed template parameter on std::array
the fact that that's entirely possible 

pitch the instrumental up as well 

i wonder how good i can make an rvc cover sound
Well grab the models and throw something at them
yeah
The hardest part in a raw RVC cover is getting a good vocal
This is my best attempt
if i really wanted to make it sound good i would resing, probably not in this instance
give me a song
name
Uhh
I don't know
If you want a challenge: Cha Cha Cha by the Finnish artist Käärijä
My untuned Teto cover for reference
aw hell nah you weren't lying about "a challenge"
this is going to be so bad
how am i supposed to get anything usable out of rvc with this vocal 
There's a GUMI cover you could use as a base
kääristä :DDD
Arrangement: Turbo
Original: Käärijä
Vocals: GUMI
Art: RAGErkgk
DL, Inst, VPR, Art, and a misc. doc.
https://drive.google.com/drive/folders/1uqROxMIQUtg1CFwHTBrypaY727rvQuU-?usp=sharing
It even comes with a download link
I think TETO's Finnish is better though
Probably because I tuned it as a Finnish person
Anyone got a suggestion for good longer in depth beginners guides to Python?
i personally like the really long youtube tutorials
and then just try to make some shit, and if you encounter problems and try to fix them, you will learn
surprisingly way better than what i expected?
Yeah programmning is a learn by doing kinda thing
Oh?
well of course we're not defying the shit in, shit out principle with the crowd vocals
but aside from those pretty good
I typically use BVE to get rid of overlap
the base64 site i'm using to test my decoder just told me GG
chat am i cooked
i fucked myself over by not setting the bpm first in the daw
i got to delay and realized i can't do it properly
can't just change it now because i cut the vocals up 
whatever i'll redo it
uh hello i have a question, is there a site for a guide to revert bootable usb revert to normal?
are you on windows?
i dont have that yet
idk what mint comes with pre installed, but you basically wanna delete all the partitions on the drive and create a new one
(note: make sure you pick the right drive when doing this)
gparted is a good tool, highly recommend 
is there a specific guide for gparted that i can look for?
this has a decent enough answer ig
making a base64 encoder/decoder while sleepy is a horrible idea
it's simple on its surface, just read 3 bytes and interpret it 6 bits at a time
or was it 4 bytes
no no
it was 3
wait.
i fixed it
now it's only outputting garbage at the trail
this is why rest is important
i fixed it i'm a genius
had to use a high fullness model and then use BV removal model and then dereverb... I think it turned out okay, despite vocals being very noisy due to the fullness model
eh, it's not good.. it's listenable, though
that part goes harder than it should XD
its roughly 2am rn 
the 30min nap at 8pm definitely didn't help
have i actually been working on this shit for 2 hours
see if i haven't made this stupid thing i wouldn't have known
i still haven't come up with what to do with the crowd vocals
everything else i kind of fixed
Headlights, your Mustang witnessed as we're eye to eye


a 2ndhand mustang is like 40K 
like i'm not leaving this in
it's supposed to be me doing my best to make it good
def noisy.. clean enough for RVC
and the fullness is more important
(haven't run reverb removal yet)
this is big beta 5e
well, it's def not.. ideal.. the noise is the least of the worries
plus this to remove BV
it's... okay... vocals sound very processed :/
yeah, it's just too much for it to work with.. was gonna say, I can use an even higher fullness model.. that's not the issue..
ok i'm satisfied enough for now
i am very sleep deprived and have extreme ear fatigue by now so the mix or master or everything is probably shit but i don't care at this point i wanna sleep
gn chat
???
that's... weird... why does big beta 5e sound so much less full XD
it's supposed to very much be the opposite
that's the highest fullness model I got
scratch 15K fps, im doing 17Kfps now cuz i rewrote the descriptorset stuff
i wasnt even trying to do that, it just happened
cubemaps should work now tho

They do, but i havent made the pipeline support multiple shaders yet, so i cant really use them yet
Thats for tomorrow tho, i need at least some sleep
question- if i want to ask for opinions on where i should place some of the UI in the game i'm making then is it better to ask here or in #artist-alley ?
Here
Ok so basically i'm making a TD, ignore the placeholder stats on the left, how should i implement the UI into this? I placed a bunch of towers to get an idea of roughly how much space is gonna be taken (intended... atleast).
I'm thinking the UI to like choose towers to place down to be like a bottom bar, and the UI that pops up when you click a tower i have no clue where to put yet or how big should it be etc.
mostly focusing on the upgrade one cuz it would help me implement upgrade system to finish the core mechanics to start polishing
You have a bunch of space on the sides
If its going to always be like this (i mean if you dont make a map thats super wide and takes all the space) it seems natural to make "shop" on the one side and some details on the other
thats how it looks on the desktop, the sizes are fixed
and oh so shop not as like a bottom bar?
Depends on how big and branchy it is
thats as big as the game assets get
intended anyway, but players can place towers anywhere on that screen
as long as its in the grid
When you have a bunch of towers you will need to compact it if its in the bottom or on the top since there is little space
And left/right is free to add many things
since i have a cap on the amount of towers u can place per tower, but i dont have much restriction on where
Oh i thought the square is the game area
nono
the square is me manually placing all the towers
as if i was gonna play the game
but technically a player can place towers on the edges of the map
which now that's mentioned i wonder if i should limit the "playing" field a little bit
Enemies are coming out from all sides around?
Hmm
Then you need top and bottom free since its already super small you just wont see them coming out of there
ic
I guess right or left side, small icon like "menu" when you hover - the menu appears
someone suggested me to do TFT's pop up style for the upgrade part
which is 10% width 50% height of screen box on the very right side
Yeye thats what you want i think





can i do this
