#programming
1 messages · Page 237 of 1
No that is also not the same thing
Self trained implies trained by something made by itself, differnt from self learning which is learning at inference time
Alright I'm sorry. But that's the reason I'm trying to learn
Whatever the case, the Neuro is trained on Twitch chat thing is false information spread by clueless idiots
And no current public architecture can do self-learning
At least for text models
Ok so then what resources do I need
Depends what you want to learn
Well I have Mistral and python
What kinds of models you want to work with, to what extent you want to train them, how from scratch do you want to do them
I work with vocal synthesizer models trained using open source tools
And the goal is to first create a bot that I can interact with locally to test and then put him in my own discord server
Stupid easy, simply grab llama.cpp and any random language and then link the two
As long as you have enough GPUs
Will there be limitations down the line if I wanted to give it a voice or use it on twitch.
pitch should be fine regardless of how shit the output is considering the instrumental does not bleed into the separated vocal. if you use a bleedless model that should be totally good, idk how reverb affects things though. i'd dereverb with a separate model
I've got a GeForce RTX 3060 and an i9 12900
12GB 3060?
Mainly hardware
Oh, I have something similar actually written in python and using gpt-3, except the llm itself isnt hosted on my machine
I hate cloud models
Not sure from what I remember Nvidia GeForce RTX 3060 but it seems to be great. Last time I checked 4090 was the best one
most of the models are way too big for quality of output. like the most performant way to do custom voice tts is still a small model + rvc after all these years
I will need to upgrade my ram. I've got 16 GB ram
The VRAM is what matters
It's not my first choice but for testing purposes it suits my needs; I figure once I have more tools and agents defined I can work on migrating to a local model
its very early stage rn
Money though
Ah yeah vram. That's what the 12 GB is. I've heard about 24+
Yeah do you have a 12GB 3060?
Good then you can run 8B models just fine
Let me double check
i paid $10 sometime in the spring and havent run out yet so I figure im fine for now LMAO
24GB is found on the 3090, which is what I recommend for inference
Why cant llm's train on the fly
i've given up on local llms it's just not worth it they're pretty bad
I'm apparently using Mistral 7B but I don't know what the 7B means
I will never pat for LLMs
¯_(ツ)_/¯
7b parameters, model size
7 billion
many 
as llms go that's not many
How could they? They're statistical models
oh btw what about the titans arch what happened with that
Isnt that "memory" vedal made for neuro - technically learning?
no
No
7 billion parameters, at 8 bits per parameter it requires 7GB of VRAM for the model alone
it's like if you had no memory whatsoever and relied on a journal to "remember" things
Hmm
llm memory is what it is, a journal
uhhhhh yeah? to understand language?
Is it about complexity of things they can discuss
Does the ti at the end tell you anything
at the very least
I'm trying to figure out how to know what GB it is
the bottleneck with ai inference is usually the memory bandwidth
and i don't think the ti has any improvements on that?
usually LLMs do pre-training and then fine-tuning, training too much further can lead to overtraining, and extra pre-training on a fine-tuned model is less stable for that than extra fine-tuning (pre-training is more ideal for on-the-fly, since you get the loss function by comparing output to input, instead of relying on a human or better LLM to produce an example response to compare to for fine-tuning)
also training is much more intensive than inference, so you'd notice slowdowns or simply be unable to run both at the same time unless you outsource the training to another computer and have it run in parallel with the inference model
So you need certain volume to just run it, and memory speed to train/run efficiently?
and any solution for that would be pretty hacked-together i feel
you could do "sleep cycle training" or something where you have it experience things as it runs and then it stores that data to train on later during off-hours
idk what it is for training, but pretty sure the actual performance is what you need, but for running (inference) it's memory speed and amount is what you're after
it's a lot easier to not update the parameters much and to add a memory system or something though
Inference= running?
they crippled it with the shitty ass vram lol
16gb is alright
but i wouldn't want to spend that much on 16gb vram
"buy datacenter gpu pls we want $2000 margin"
So I'm definitely going to need a better graphics card if I ever want to run something like this is what I'm gathering at least to the extent of streaming
12gb is enough to start off
get more as you need more
Yeah but I got 8GB
ah
I was wrong
can work with smaller models i suppose
You suppose. Oh god I feel so inferior lol
7B is a common standard but there's smaller models out there that are still pretty good
buying big ass gpu is expensive, don't do it unless you're sure you need it
there are some cases where some continuous learning like this is done now
the big use case I'm aware of is basically continuous RL since you usually want your data for that to be on-policy (from the newest version of the model)
usually that happens during training before a model is released but there was recently an article by Cursor that they're applying it to their tab completion model too
(but AFAIK they all kinda "cheat" by using checkpoints)
ooh it's finally catching on
well you see i think that they're all shit below like 30b fuck local llms they're insanely stupid. and you can't exactly run anything larger on one consumer grade gpu at a sane quant
I use my computer for gaming and music production and I plan to start streaming. Was gonna go the v tuber route
and you want to make a local LLM based discord bot with a vtuber model as well? damn that's busy
good luck
I got some other ideas as well. But with this recent one. I want to get as far as I can with it
The discord one was just to test it and see how it would work
I figured testing something like this on discord would be beneficial to learning and seeing if I could do it
maybe gaming doesn't count but meh "i want to make local llm discord bot that is also a vtuber" is a pretty big commitment on its own i feel
yeah but music is a thing you can assume a person doesn't do professionally
I don't want to do exactly what vedal did
ah
i have zero experience with it
didn't know
More so a fun AI thing I could bring out from time to time
The majority would be my stuff
so one can just allocate less time to that
But it seemed like a really cool thing to have
Oh you're on a 3060Ti? That means 8GB VRAM guaranteed and you're screwed
it's like all art, most people who do art don't have a career around art
yeah it sounds a lot less busy now lol
I'd say music is just as hard. I just know more about music but even still I wish I was better
i'm not saying it isn't hard
Yeah lol
thanks nshittia!
are there any docs for the 7tv api? cant seem to find anything useful on the website or the repo
the backend code is the docs. they don't have docs
I don't have a career around it but I'm still hoping. I just need to better prioritize my time with actively investing in things that will help me in the future like learning skills
I do audio ML and vocal stuff, I make cool stuff
Hell yeah
The cool stuff in question
I finished a song and tested out Hatsune Miku for the lyrics
It's ok but not the best
This is a real Neuro vocal synthesizer by the way
btw if you want to list channels and emotes and that kind of thing your best bet is the graphql api
the rest api they have is pretty bad
If you're committed to running inference stuff, I suggest a 3090 as a good value inference GPU
I can show you some of my stuff I don't think it's as good honestly
I personally just silly around, throw some stuff at NeuroSynth, throw that at a DAW and throw a mix on top, and cool stuff just magically comes out
thx. Btw would be thankful if there's an example somewhere of fetching emotes(metadata only) from an emote set fetching https://7tv.io/v3/emote-sets/Id works for me
You have Vocaloid? Interesting
I only use free vocal synthesis tools
Oh I can't post audio in here
Lack embed perms
It was a thought to use it but I really want to work on my own voice
Just keep chatting to unlock
Your own synthesizer?
No like me the human hi
Oh
Lol
I think there fun. But I feel like I have to. Real talk. I feel like I have to be the best and if I ended up not trying to achieve being a great vocalist it would not satisfy me
But yeah I have two projects I can share with you
When it comes to vocal synth usage, it's not that hard once you get it on machine learning banks, but way harder on sampled banks like Teto or Miku
One was a thing I trashed that I still really like and then the vocaloid song I made that was a lot of firsts for me
Also because I am the only one with NeuroSynth access that can do good stuff with it I am automatically the best NeuroSynth artist
Yeah getting Miku English was a process
if you still wanna do gql here's what that would kind of look like. it's just that you have more control with gql over what you're requesting
JP banks to EN is kinda hard
JP to Finnish is easier
I'll just dm them
But yeah that's basically my portfolio.
At least what I'm comfortable sharing. And isn't like 20 second musical idea lol
They seem so simple. But it's weeks of grinding. And hours in between moments of inspiration
I don't know what the average is on song writing. But it can take like hours to create 20 seconds of music
what
exactly
allowing me to read pdfs on an ereader 
I've started multiple projects and I never finish a single one of them

real and true
and factual
Progaming
Progaming real
a truly good project is never finished
it's constantly evolving 

that sticker is great
Hi
A wise man once said
hi
Happy birthday cheese
I was wondering how the twins, Evil in this case, is able to recognize itself. I understand she uses a Clip-style model or a seq2seq multimodal like Florence2 behind the scenes, but I guess is like fine-tuning the model with images of them?
he said the coffee was eliv's face
contextual clues

you think so? A lot of times in art review they seem to know which on is which (sometimes)
they do have a vision module
not sure about the implementation, but according to the old geoguessr videos I think it might be just some vision embeddings with attention
I would guess there's some fine tuning to recognize themselves. They need it for art review.
but in this case, evil doesn't really need a vision module to answer "I'm so cute"
'cause she have the context of that coffee has her face
I don't think he told her before showing her. Let me recheck the clip.
yeah, I think it was something along the lines of: "look who it is"
and evil responding 'its me!"
While traveling in Japan with Robcdee, he decided to buy Evil a very special coffee with her face on it.
Vedal Official Links: https://vedal.ai/
Links to streamers featured in the video:
https://www.twitch.tv/robcdee
https://www.twitch.tv/vedal987
Music used in the video:
Kirby and the Forgotten Land - A Trip to Alivel Mall
#vtuber #neurosam...
I check the VOD and I guess she didn't know
my bad

how do I have bad memory
I'm not even 20
I tried np.float16 there:py x_train = np.load('x_data.npy').astype(np.float32) / 255.0 y_train = np.load('y_data.npy').astype(np.float32) / 255.0
Yes
It's alive
Ahah... Uh wait, am I supposed to be worried? And that looks like a cursed heartbeat
Btw
HIII caibi 

just funny
Okay 
And I'm training the worst model with my GPU rn:
Dw, There is gonna be a down movement again.
Now I can experience the pain of trying to get that exact code that runs right now to run on CPU alone 💀
Told ya:
so to even change the built in dpi for my mouse i need to install a full app??
I think it depends, But I have a LG mouse and last time I checked, positive.
I don't understand my errors
I just edited a little and it somehow spawned 30+ errors
That's coding for ya, Should I just trial and error the code that trains the model on my GPU to run on a Intel® Xeon™ E5-1650 V3?
oh god
You can do it, I believe in you!
I have backups
bestValue := currentEval 
chat this is why you use a version control system

when did my python turn into a makefile
always has been
why can't they unite both two
weirdly smooth loss curve
ah it's being fit to the data
i see
lr is 0.00008
the curve in picture 1 is different than in 2
Yeah, that first one was an earlier one, That is right now the state:
very different actually, must have been a big spike around epoch 600
the real loss graph belike
No, that is literally the raw loss values with no processing.
it can't be
probably more like
I'm being serious
it literally isn't, the values at different points change between pictures
Because that is while the trai... You mean that you sent?
wait i'm mistaken, the numbers in the graph don't change that much
There's not a chance that is the real loss unless this is an exceptionally simple model
Loss just is not that clean
still heavily doubt it's the raw loss values, yeah
that's gotta be smoothed
you do not see smooth loss like that anywhere
Well, learning rate is "0.00008" and that is pretty much it
Yeah not even close to that smooth
NeuroSynth at its most stable is nothing like that it still varies like crazy
That is where the data get's held:py #Training the model print("Training...") text="Done, Training the Model (initialisation)" log_info(text, logg) train_rmse = [] test_rmse = [] train_score=[] test_score=[]
And that is the plotting routine:py def plotscrores(scores, test_scores, fname, on_top=True): log_info("Plotting scores...", logg) plt.clf() ax = plt.gca() ax.yaxis.tick_right() ax.yaxis.set_ticks_position('both') plt.plot(scores) plt.plot(test_scores) plt.xlabel('Epoch') loc = 'upper right' if on_top else 'lower right' plt.legend(['Train', 'Test'], loc=loc) plt.grid(True) plt.draw() plt.pause(0.01) # <-- updates live plt.savefig(fname)
There is no such smoothing being done.
Nya >:3
And that is the loop that writes to the variables:```py
for i in range(num_epochs):
print(f"Epoch {i} of {num_epochs}")
text=f"Epoch {i} of {num_epochs}"
log_info(text, logg)
print(x_train.shape, x_train.dtype)
print(y_train.shape, y_train.dtype)
model.fit(x_train, y_train, batch_size=batch_size, epochs=1)
#Training results
mse = model.evaluate(x_train_mini, y_train_mini, batch_size=8, verbose=0)
train_loss = model.evaluate(x_train, y_train, verbose=0)
train_rmse = np.sqrt(train_loss) # if your loss is MSE
print("Train RMSE:", train_rmse)
#Test results
mse=model.evaluate(x_test, y_test, batch_size=batch_size, verbose=0)
test_loss = model.evaluate(x_train, y_train, verbose=0)
test_rmse = np.sqrt(test_loss) # if your loss is MSE
train_score.append(train_rmse)
test_score.append(test_rmse)
print(f"Test RMSE:", test_rmse, "\n")
model.save('Model.h5')
print("The model is saved.")
plotscrores(train_score, test_score, 'Scores.png', True)```
So god for saken nowhere it is getting smoothed
"plotscrores" lol
your test loss is using the training dataset 
also you're evaluating it twice which is unnecessary, but that's just efficiency
test_loss = train_loss
Lmao, I wondered why the hell both are the same
Where was that, I literally can't find where that line is
It literally doesn't exist on my code:
I didn't name the function, ok?
Ah thanks for showing.
congrats wtf
and you started on white
My model recognizes itself in pictures based on the features present (even without separate fine tuning of the vision part of the model)
but it's something
let me try raising it up to 10,000,000 nodes and start on black
The curve starting to get movin:
*differentiate
It was smooth before, and now it is getting a bit squigly.
it's because you have two curves
Well, t was right that it was using the train variable instead of test.
i would've never spotted that error if shuni hadn't pointed it out
Yeah, but I appreciate the help to find the error, next run is gonna use the loop in that configuration:```py
for i in range(num_epochs):
print(f"Epoch {i} of {num_epochs}")
text=f"Epoch {i} of {num_epochs}"
log_info(text, logg)
print(x_train.shape, x_train.dtype)
print(y_train.shape, y_train.dtype)
model.fit(x_train, y_train, batch_size=batch_size, epochs=1)
#Training results
mse = model.evaluate(x_train_mini, y_train_mini, batch_size=8, verbose=0)
train_loss = model.evaluate(x_train, y_train, verbose=0)
train_rmse = np.sqrt(train_loss) # if your loss is MSE
print("Train RMSE:", train_rmse)
#Test results
mse=model.evaluate(x_test, y_test, batch_size=batch_size, verbose=0)
test_loss = model.evaluate(x_test, y_test, verbose=0)
test_rmse = np.sqrt(test_loss) # if your loss is MSE
train_score.append(train_rmse)
test_score.append(test_rmse)
print(f"Test RMSE:", test_rmse, "\n")
model.save('Model.h5')
print("The model is saved.")
plotscrores(train_score, test_score, 'Scores.png', True)```
So that should be the fixed code
Thanks for pointing that out shuni.ex
oh, I think I know why the loss curve is so smooth now
it's not the actual training loss 
awww man yea no
null pruning actually doing something at the end game tho
That are the Parameters btw:py #Training data settings num_epochs = 200000 batch_size = 10 valid_ratio = 25 lr = 0.00008 #default 0.0008
HOW IS THAT STILL GOING DOWN???
you used a lower learning rate than usual
oh. right...
the model will converge much slower
my bad
and will tend to reach a lower loss when it does converge
the hyperparameters aren't the issue
it's that your "training loss" is evaluated using the entire training dataset after each epoch
it's not the loss used for optimization 
it's not spiky because it's a huge average, not a sample like usual
Yeah... But in my defense that is what the original code did too
interesting to see the average go down so smoothly
because there's (probably) a lot of data
Well, let's say I use a learning rate of 2, that doesn't go well
10 images
that is the datagen.py parameters:```py
--- Configurable parameters ---
num_images = 10 # Number of images in your folder
samples_per_images = 4 #default is 10
dots_per_images = 100 #default is 60
image_w = 144
image_h = 192
image_dir = "pictures"
num_channels = 3 # Must match the model input channels
num_samples = num_images * 2 * samples_per_images```
Aka the code that makes the data the train.py uses
No wonder then 
i was thinking "epochs make me think of looping over an entire dataset 1 time per epoch, but surely that can't be happening here"
that doesn't make a difference for how you calculate the loss
if anything calculating the loss separately like this is limiting because it slows down training
yeah, that is exactly what's happening 
honestly, upping the learning rate is probably better in this case
Well, that is why I wanna get that code running on a CPU
if you do too many epochs over the same training data, it converges worse
Well, I just run it like right now for Kicks n giggles.
usually you'd never evaluate the entire training dataset like this, it's insane
only works with small datasets (or a really really fast model I guess)
What did I do now toast.? 😅
You could add some randomness to the images to make more input data and force it to better learn the actual features idk
Like idk, shift them or sth
yeah, transformations or masks or mirroring or rotations or blurs
I have no idea how I can make the Code actually use the 600 images that I can train it on GPU right now.
What do you have in your bot
if I go higher on image amount, I need to drastically drop the other stuff.
Except if I run on CPU
almost tempted to work on bot but i added a bunch of code yesterday and now it crashes without printing anything to console so uhhhh i think i shall procrastinate that a little bit longer
Because I repurposed temporarily one PC that has 64GB of total memory
but no GPU. (not even iGPU)
curious to see how the changes will do (i've added my first idea for a pruning system when i heard about this challenge) but not quite that curious just yet
If I said what epoch it is doing right now, it would be out of date by like a dozen already
your training code is inefficient
improve it 
meanwhile i struggle to copy lague's code into c++ and fail for 3 days straight 
Im not winning this challenge thats for sure
I... didn't... Ugh. I have to come clean I guess.
Chat was doing the conversion from Python 2 code to Python 3 code
Yes, boo me, but it runs.
Neural network to convert a sketch into a face. Contribute to HackerPoet/DeepDoodle development by creating an account on GitHub.
That is straight the original
All I was able to do was the print changing and xrange, but the keras n stuff was something I was just not good on
so the original code was already a bit weird but it's much worse after the modifications 
Yeah, What do you expect of Chat?
it's probably not even that hard to fix
chatgpt is decent enough at bugfixing, but its bad at generating complex code imo
I had to go to tensorflow, and I myself didn't know that before Chat did that.
In the past where I tried it, it was not going great.
Is it now overfitting?
Right i found what's causing my errors
if you give it the problematic code and the error, 9/10 times it will find the issue in my experience
Accidentally deleted some letters
Yeah, but to run the code on CPU with the model staying the same, no.
just don't evaluate the losses twice, only do it once like in the original code
also make sure you're always specifying the batch size, it's probably crashing with more images because it tries to process the entire dataset as a single batch or something
I tried lately to train on CPU only that I have the upside of 64GB total Memory as my advantage
And the PC is then dedicated only for training the model
OOO, that looks not too bad, it's still mids Training though:
"mids draining" 🤣
can't really tell because there is no test loss 
but since you only have 10 images you aren't going to fit the model to the distribution you want anyway 
Right... Imma just wait until it goes straight to the right, if that ever happens...
You don't know if it's overfitting without eval loss
Yeah
but that isn't too bad for my standards
it will happen eventually, but by that time your model is already cooked 
What kind of model is that anyway?
Deep Doodle: https://youtu.be/eEFlk6sSv88
#DeepDoodle
Demonstration of a side project of mine.
Download: https://github.com/HackerPoet/DeepDoodle/raw/master/DeepDoodle.zip
GitHub: https://github.com/HackerPoet/DeepDoodle
Music By incompetech
That is straight of the maker
Oh so it's just an image to lines model
Lines to images, Yes.
And I really REALLY wanna beat that model
And I do have data
but it's grooling to sift through the THOUSANDS of images for QC
Idk the right word, Corrections are appreciated
trying to make cubemaps work in vulkan 
Here is a hug.
i have to make stuff that werent settigns before parameters and put them in a diffrent file
i'll be so down if i dont make anything good today 
Yikes 😬
its not too bad, just copy pasting a lot of stuff
I end that now:
And yeah, the model was still mids training when I started doodler.py
That can't go wrong
Well, it is that kind of model
Ooo, Second line 
Yay
How do I run that on CPU only?
Oh
You're on Phone
Sorry shuni.ex
But that is straight the code with the changes where you saw the error
why do you want to train a CNN on CPU 
screams in Model.h5ValueError: Dimensions must be equal, but are 8 and 3 for '{{node mean_squared_error/SquaredDifference}} = SquaredDifference[T=DT_FLOAT](sequential/activation_7/Sigmoid, IteratorGetNext:1)' with input shapes: [20,8,192,144], [20,3,192,144].
I'm going to go straight to the point, The PC has 64GB total of RAM:
So that PC has more than enough RAM to train the model
I don't even see a GPU
Yeah, because the PC has none
If there is no GPU it should automatically run on CPU
Nothing
you do not need 64 GB of RAM to train this model, it's tiny
Tru
I was planning on doing 600 images as a test and then sifting through more images to expand
Tensorflow doesn't even have GPU acceleration on Windows
I have no idea, lemme look in the logs
That is all I see on the logs...
07/10/2025 16:04:46 [INFO] Checking if the data is set up
07/10/2025 16:04:46 [INFO] Data is there.
07/10/2025 16:04:46 [INFO] Loading Data...
07/10/2025 16:04:52 [INFO] ('Loaded', '8000', ' Samples.')
07/10/2025 16:04:52 [INFO] Attaching more channels and splitting the data.
07/10/2025 16:04:54 [INFO] Channels attached and split.
07/10/2025 16:04:54 [INFO] Shuffling the data...
07/10/2025 16:04:55 [INFO] Data got Shuffled successfully.
07/10/2025 16:05:03 [INFO] Setting the image format.
07/10/2025 16:05:03 [INFO] Building the model from Cache or Scratch.
07/10/2025 16:05:03 [INFO] ILoaded pre_model.h5
07/10/2025 16:05:03 [INFO] set up the optimizer successfully.
07/10/2025 16:05:03 [INFO] compiled the optimizer.
07/10/2025 16:05:03 [INFO] made a Model image.
07/10/2025 16:05:03 [INFO] Done, Training the Model (initialisation)
07/10/2025 16:05:03 [INFO] Epoch 0 of 200000
At that line:py model.fit(x_train, y_train, batch_size=batch_size, epochs=1)
So pretty much on the bottom of the file
Well, it also throws:2025-10-07 16:12:52.287128: W tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'cudart64_110.dll'; dlerror: cudart64_110.dll not found
Also:2025-10-07 16:12:57.806102: W tensorflow/stream_executor/cuda/cuda_driver.cc:269] failed call to cuInit: UNKNOWN ERROR (303)
But I think I can ignore that
is that on the GPU or the CPU machine?
CPU
If there was enough VRAM 💀
how much do you have
How do you not have enough VRAM for such a tiny model?
Only 8GB
that's enough
a model this tiny will need practically nothing
more VRAM just means higher batch size but practically anything will work
I wanted to train for a test 600 images, and then I wanted to sift through more images for qc to have like 1200
Sounds tiny
The vast, Vast majority of the images don't look where the camera was
But what trips me up is still:ValueError: Dimensions must be equal, but are 8 and 3 for '{{node mean_squared_error/SquaredDifference}} = SquaredDifference[T=DT_FLOAT](sequential/activation_7/Sigmoid, IteratorGetNext:1)' with input shapes: [20,8,192,144], [20,3,192,144].
But I can't do 600 images because apparently I go OOM when doing that
Wha
Lemme just
You're definitely doing something wrong then
Are you trying to load every single image into memory all at once?
yeah, very likely attempting a batch size of "the entire dataset" 
do the default settings
Also how are you using the GPU in the first place? Tensorflow has no GPU accel on Windows
. . . 600!?
what
don't make the batch size 600
Yeah, I just set default stuff now
Except image amount, That is the total images number I have now
That... What the
huh
WHAT IS HAPPENING!?
It friggin runs now?

Okay, What the filtered is batch size supposed to mean?
still trying to allocate way too much 
at least it runs now
YEA, BUT HOW???
Last time I tried it ALWAYS failed to run

basically it tells the training code how many samples from the dataset to load at the same time
So a lower batch size would lower the memory needed, but something is... The file is there in that chat 🤦♂️
So a lower batch size would lower the memory needed
correct
That is the file
All I can say now is: WTF
What the, I can't
That is not train.py
that is of doodler.py
Wait, it's of doodler.py
It was unrelated to the training
How to submit a Library of Ladev issue? It doesn't handle dashes, like bun-believable. https://libraryofladev.com/search?text=bun believable
I know why it was working
I think the search function of my chess bot is good enough, for now. I have some issue on evaluation now
It was using the old x_data.npy and y_data.npy
PeSTO's PVTs doesn't really perform well on my model

surely shiro won't notice if I just copy the piece value tables from Boychesser 
Well, with the x_data.npy and y_data.npy corrected, I get:
I had to delete the files that it is what I set.
ok i think i made negamax with alpha beta with nothing else and it works
when i remove beta cutoff it slows down so i guess its all right
with quiescence it became super slow and super bad
it is literally the same as normal search but shorter wtf
is this correct
i dont understand where i put - and where i dont put -
im so dumb
should have become a janitor
i think i got it

incorrect
i didnt make move here so i pass alpha and beta and everything unchanged
i'll do it really explicit this time and only then shrink
because it all gets messy
What now? Is it correct or incorrect?
no its incorrect
we go deeper with a minus if we made a move
here i just pass it to different search mechanism with same alpha, beta, and sign
Oh, well, after I’m done making my food, imma loook what I need to do that I can run my code on the CPU
this is correct (i think)
see how alpha and beta have no minus and passed on their places
Hi
Bye chat
Why???
No ass
ooh i forgot about sorting.. thats why it looks slower than when i walked this path last time
mlntcandy
chaosminecraft
That is what happens if I do 600 images with default settings of the GitHub
It just doesn't have the needed memory
i don't know what you got going on but what should fit into the memory during training is the model and the batch
The training data
if it doesn't have enough memory but still runs at some point that must mean the batch size is low enough to allow that to happen. are the samples even in size? if oom doesn't happen consistently then samples not being the same size would be an explanation
The time where I said "How?" it was with the data having only 10 images
Something like that: https://youtu.be/eEFlk6sSv88
#DeepDoodle
Demonstration of a side project of mine.
Download: https://github.com/HackerPoet/DeepDoodle/raw/master/DeepDoodle.zip
GitHub: https://github.com/HackerPoet/DeepDoodle
Music By incompetech
i see
I wanted to make a better model
That is actually too low compared to what the creator used, apparently the creator used "It was 1200 plus mirrors"
weird idk then
That is straight the message inside the "" part
Yes, I indeed had the balls to contact the maker
Assuming the GPU of the maker having 8GB when it was trained, I wonder how it was done
And that is the training data size when 600 images:
does anyone have any idea what "ply" could mean
in total 1,85GB
its lague's code
Wait, What are you trying to do?
Oh, I thought you were doing 3D stuff
players?
My bad vituha
ply moves ply tree idk
Sorry
plays? 
is it some slang
oh its chess
you said league so i thought league of legends
league 
oh
i meant Lague its the name of some guy
Lague != league 
you didnt say league, i just cant read
lague the guy who made the chess ai videos
come on he makes fucking gamedev videos you should know him by name
i barely know my own name
based
you're setting the bar too high
soemtimes i hear a ping and then i look to discord and there's nothing there
discord has given me undiagnosed schizophrenia
Then it is a reaction, I know that feeling.
just turn off pings ezpz
Now I need to start thinking which is: sdlkhjgklsdjhfghjklwsehjgrkewhl
i only have them on to mentions
i accidentally turned off direct mention notifcations outside of dms
so i miss a lot
😭
im 61st in the server ranking, its too late for me to never get any pings
i do enjoy reading them tho
unless its a ping about bugs
Did someone say buks?
@olive sable 🐛

You know what else is massive?
why can't this reverse engineering project be super easy so I can finish it and focus on real work like uni
Nah, You. 🤣
You were not thinking of...
Uh
I can't say, not PG
i have 2 options, either i make a new commandpool in the framemanager
or i have to pipe an existing one from main.cpp into the subfile, then into a function, and form there into a function called by that one

Low taper fade
-# totally wasn't thinking something else...
Oh, I am not familiar with that stuff, My bad...
What do you mean spelling? I wasn't spelling.
Spelling is when someone turns the word "Picture" to "P i c t u r e"
aka each letter on a word

Anyways, tensorflow really doesn't like running on CPU
What line of that message?
damn he just had it spelled out to him
ericdoa my goat
With the context I have I still don't understand
I'm just gonna work that the code can friggin work on CPU now 🤦♂️
there are 2 lines there. you really cant figure out i meant the first one?
No, I go in multiple paths while thinking what was meant, and some times I just happen to go the wrong one
Oh great, lemme pull up my phone rq
What emoji is that even?
All I see:
Oh, on my iPhone it is just blank
object replacement character (U+FFFC)
why do you even bother to send the message if you type nevermind in the same line? 
erasing more effort probably idk
Because of the "WHAT THE"
and that
But mainly for context that I was wondering how the code now works on the CPU only machine but that it then still didn't work out.
Sreams in binary streamTraceback (most recent call last): File "c:/Users/Sheep/Documents/temp/train.py", line 327, in <module> plotscrores(train_score, test_score, 'Scores.png', True) File "c:/Users/Sheep/Documents/temp/train.py", line 52, in plotscrores plt.plot(test_scores) File "C:\Users\Sheep\Documents\temp\.venv\lib\site-packages\matplotlib\pyplot.py", line 2842, in plot **({"data": data} if data is not None else {}), **kwargs) File "C:\Users\Sheep\Documents\temp\.venv\lib\site-packages\matplotlib\axes\_axes.py", line 1743, in plot lines = [*self._get_lines(*args, data=data, **kwargs)] File "C:\Users\Sheep\Documents\temp\.venv\lib\site-packages\matplotlib\axes\_base.py", line 273, in __call__ yield from self._plot_args(this, kwargs) File "C:\Users\Sheep\Documents\temp\.venv\lib\site-packages\matplotlib\axes\_base.py", line 419, in _plot_args for j in range(max(ncx, ncy))] File "C:\Users\Sheep\Documents\temp\.venv\lib\site-packages\matplotlib\axes\_base.py", line 419, in <listcomp> for j in range(max(ncx, ncy))] ZeroDivisionError: integer division or modulo by zero
Oh

Train RMSE: 0.28846472713957755
Test RMSE: []```
this does not alter "depth" variable does it?
no
instead of mating it repeats last turn move which is illegal
here it doesnt pass depth==max_depth for some reason
even though its supposed to be highest level turn
are you even incrementing the depth anywhere
im setting it to 4 before calling starting search and decrement it when going down
can i get fen from cute chess
hmm
iirc there was a way to have it log to a file and dump all states
idk specifics though
function looks like this, it detects mate but doesnt pass depth==maxdepth check and doesnt record the move
it is called with depth=4 (depth=maxdepth)
alpha is -9999999
i dont get it
log prints board before move
wtf is this
in pictured board queen makes e7d7 (last move)
oh it bugged
nvm
so it detects bestmove e7d7, does it
enemy king moves
it detects mate but doesnt record the move so it makes last turn move
it correctly sees the board before move
it probably detected this mate 5-6 turns ago
wtffff
just do it man 
mate him 
?!
i know how to do it.
i'll print more info when turn count is close
to this mate
oh i think
i got mixed up with bot names and running wrong bot
@solid bough
Colab notebook for your doodle thingy
don't even attempt to train on CPU, even inference sucks ass
I really don't want Google thinking I'm doing anything but Training
hey, running it on the cpu can be fast
all it takes is a smol useless model 
and lots of pain manually vectorising
I have a history where I used to run a render client on collab...
If it even makes one 
Welp, small image amount it is 💀
that's the thing with CNNs, they don't have many parameters so they're small but they still need a ton of compute 
i forgot how to implement CNNs ngl, wasn't that a 5 level deep loop or sth
well, for the shitty cpu version ig
ain't nobody got time to use mpi 
And more like a pain in my poyo to run code where it runs on GPU btw
Btw, how am I supposed to open that file?
Starting with TensorFlow 2.10, Windows CPU-builds for x86/x64 processors are built, maintained, tested and released by a third party: Intel
interesting
convolutions
one small kernel applied to every position of the input 
(idk I never really dealt with CNNs except now)
upload to Colab
or another Jupyter instance but you very likely do not have one, so just use Colab
i guess you go over all rows, then all columns, then the same again and over all layers
so 5 level 
nice
though
Huh:
sounds about right
not n^5 at least, more like zn^2k^2
imma be quiet for idk how long
there's a funny thing you can do wiht fourier transforms and convolutions iirc
its pretty neat imo
though I don't 100% remember what it is
iirc its basically "instead of doing convolutions normally take the fft of the kernel and the fft of the image, multiply, reverse fft, poof convolution applied"
that checks out i guess
since you're basically multiplying 2 polynomials
and you can use fft for that
i cant see what is wrong
is it because my last turn gets beta-cutoff
but why does it affect the winning bot and not the losing one
no, there are 35 moves and 10 cutoffs

it only checks one move and it gets beta cut-off
wtf though
so beta cut-off kills final search
but i dont use this high number anywhere than in calling initial search.. how can "evaluation" get this value
because opponent's moves are so bad he returns -999999999 which turns into 999999999
i just ban beta cutoff on max layer then
it works
ok tomorrow i add transposition + iterative search
and it beats stockfish ez (
)
9999 ELO tomorrow surely
one of the joys of being an older brother is looking at your siblings math homework and thinking "which dumb fuck wrote these questions?"
Lol
they had a x and y or z, but didnt specify if "and" or "or" took priority.
so i jsut assumed programmer style "and" took priority, and it was wrong
i need to smack that teacher
broken on mobile, so no
my predicament:
false advertising 
name: "Don't touch my links"
i guess i wont then
it is indeed broken on mobile
mobile is inferior anyway 
Sam, I got a raspberry pi 16gb andddd radio equipment so... robot arm with AI vision remotely controlled will be peakkk
if i use my other hand to hold the mosue i can do 9cps
huh
holy shit i can play with them
i dont like the website title
Wtf I am not clicking that
I know nothing of coding aside from a little knowledge from watching Doug Doug code for a few years, mostly how to get AIs to cooperate. I have this idea for a game that I have been wanting to make. What is an beginner friendly coding language for game design. It needs to be simple enough that I can teach myself. I have chat gpt to help but I mainly want to use it as an assistant to learning like generating questions and problems to fix.
unity + they have their own ai helper
embedded into ide
they use c# for programming and there is a ton of guides
interesting, so full utf8 support
“gayballs” is crazy work
although some characters are a bit cropped
mf WHAT
i aint putting each and every dot for every letter in every language
i automated it
gamedesign gamedev is mostly unity ye, godot if you feel contrarian or unreal if you hate both yourself and your gpu works aswell, altho its harder
unity is the most widely used so a lot of resources on it
of course. and it's not like I complain - just an observation
why I can't attach pics btw?
idk
i've seen people use php for scripting but why laravel 
this has 4k stars btw 
php can be faster than python for some use cases (not a huge bar to clear i know)
I once used pure bash for a game pathfinding backend, so, why not?
it's probably a replacement for ink but with laravel instead of react because ew react?? i am guessing here
ink being the thing that things like claude code and codex and gemini cli use
am I missing something or is the whole repo combined just some 200 lines of code?
check what it says in the readme
it's a micro-framework - it's just an addition to Laravel
these are pretty cool
whaaa
ooo damn that's cool
Sure...
how much do 1t tokens cost
tree fiddy
varies based on the model, also if that's input or output tokens
for GPT5-Mini, 1T output tokens would be 2 million dollars + however much the input tokens cost
Open Router essentially resells it though, it is a connector to multiple API providers with a single unified API
the cheapest one would be gpt-5 nano on which approximately $5k assuming all tokens are input and cached
but probably like $40k if actual usage, but keep in mind 5 nano is crazy cheap
$2m is more realistic
all companies in the 1t club resell it (or you'll go bankrupt lol), other 1t ones i saw are from notion and cognition
(notion ai and devin)
(nano is also crazy bad though, it only works for very easy tasks like summarizing text in the context or bad direct translations)
yeah

explain why this doesn't say bajillion dollars then
chat more 
SpAmMiNg??
on cooldown? Like for a fixed time or fixed messages?
a message awards you 10-20 points but then a cooldown begins during which messages won't award points and you have to reach 1k i think?
Understood, thanks
Well... I just started chatting more actively a few days ago since finally I have some spare time after work and university
It'll take some time
oh shut up gayballs is better

Applio?
😕
Other RVC GUIs suck
wha
oh
Drop whatever you're doing and get this
Yeah Applio is real easy, on Windows anyway
thanks
What are you gonna use it for by the way?
If you're planning to use it for Neuro and Evil I have good voice models for you
That works, Applio has an easy training mode
also maybe neuro and evil too
Then you want these
They're deprecated in favor of NeuroSynth and EvilSynth, but they're the best of the best when it comes to Neuro and Evil RVC
cool
Remember to credit if you post anything using them
where do I put the voice models
idk what folder
You use the Download tab and drop the files

You extract the model files, then go to the download tab and individually drop each PTH and INDEX file
oh ok
That will let Applio put the files in the correct locations
I do need to remove background from a song before putting it in right...

what ever I will find out after it generates
yeah

Try UVR
yeah
I like the acapella of evil singing life
but yeah better if I seperated background music

It's recommended to use vocal synthesizer output as a base
😕
See
the what
Something like SynthV Solaria
oh
It has way better quality
NeuroSynth and EvilSynth are a thing because SynthV is expensive so we want a free solution
And RVC is just limiting
oh ok gotcha
or sing it yourself 
isn't syth more robotic though
True
Who made that?
lol
Technically
Nah Machine learning synthesizers are really good these days
See
Me and Wispers, though I did the work on the mix
CHXI's instrumental was also used
Yeah,old vocoder is……
oh and also you get backing vocals in the vocal track as well as the lead vocal mixed together when stem splitting and rvc won't handle those well
isn't neuro and evil karaoke RVC?
v1 is
beyond that voice synth
But V3 no chance
i don't think v2 is rvc
oh ok
I think V2 is RVC or RVC on top of a basic synthesizer
maybe the latter? or maybe i'm just wrong i'm not confident enough
But either way V3 is 100% not RVC
V3 is certainly a native synthesizer
Just like NeuroSynth
so neuro and evil singing is pretty much vocaloid
NeuroSynth is cool
Eh not really, closer to SynthV AI
But it's impossible to train a model by self
Clueless
It must cost much
Literally what NeuroSynth is
NeuroSynth is done on a single 3090
Which I have at home
Very cheap compared to an LLM
You should not try to assume things about vocal synthesizers if you don't know what you're talking about
it wasn’t even a 3090 at first
Yeah at first a 4070Ti which has half the VRAM
I have 4060 I think
Should be fine enough for inference
The 8GB of VRAM will limit you in training though
Does that really matter when the end result is cool?
You don't do audio ML so don't go assuming things someone like me that does do audio ML can correct
superbox gatekeeping voice synth training pipeline 
Yes I have been doing so for a while now
genuine question why
I still think RVC can be more realistic sounding
Something like ENUNU?
Not default models
Like FT2 bleedless or big beta 6x or something not old
RVC is way more limiting
NeuroSynth can already do stuff RVC can't really do
yeah like the original songs?
with good enough input rvc will still sound better than neurosynth 90% of the time though
attempted to add transpositions and iterative search, now it misses easy mates, and is super slow
uhh at least nothing seems to be broken
i don’t have access to neurosynth so i’m not sure if it’s the model or the tuning that isn’t good enough
With good enough tuning NeuroSynth will always sound better than RVC once trained on real data
maybe it already is better but no one can actually use it that’s able to make it do better
we’ll never know
Well we're cooking up the first organic data model, NeuroSynth-BETA-JP
Data is at 2/46
Oh amazing!
skipping rvc pretraining or no?
Yeah no more RVC

All directly from Neuro data
what does "organic" mean here 
Not synthetic as in not RVC
data
like train data?
oh
NeuroSynth will be available to the public once we have English organic
i thought its
organic datamodel
and not
organicdata model
base neurosynth is trained on vocals made to sound like neuro with rvc (synthetic data), so not that
We're just abandoning RVC for good and moving on to better things
Yeah, and some of them are not that good for Neuro

RVC is limited by the quality of the base model.
and it's uncontrollable.
Is this the reason?
RVC causes generational loss so it's better to not have RVC at all for NeuroSynth training
The generational loss is significant enough that NeuroSynth-BETA models up to 3.2 always miss any note higher than G5
tbh mostly just sounds like sloppy tuning 
True I would tune that completely differently now that I know what NeuroSynth actually wants
But the model itself is also very bad
I forgot I did this
I think this was for Arisu
I forget which tuning type this has
My guess is old tuning as new tuning should begin at Raise up your bat
I think this predates new tuning by a week







