UE Threat Inputs for AB | Stockfish | Page 12

rocky vigil Nov 11, 2025, 10:45 AM

#

yeah i see, if you or someone else solves it just make a pr

prime mica Nov 11, 2025, 10:45 AM

#

yep doing that rn!

#

lol I can't test 32-bit builds locally

#

is there a way for me to run CI from my branch

rocky vigil Nov 11, 2025, 10:49 AM

#

💀

#

idk

prime mica Nov 11, 2025, 10:50 AM

#

ok lemme make sure it works on godbolt 32-bit compilers and then if it does we can just try it?

#

worst case we revert

#

except godbolt doesn't have them either blob_facepalm

foggy wind Nov 11, 2025, 10:54 AM

#

prime mica is there a way for me to run CI from my branch

Yes. When you name your branch github_ci the ci should run on your fork too, I think.

prime mica Nov 11, 2025, 10:54 AM

#

oh what

#

https://github.com/anematode/Stockfish/commit/2122681b0c881e03342efd16868ac8a2add8c083

GitHub

add x86-32 fix · anematode/Stockfish@2122681

foggy wind Nov 11, 2025, 10:56 AM

#

Hmm, at least that's how I understood it. Maybe it also needs to be activated in your repository settings?

#

Maybe I'm completely wrong, but something like this existed 😄

prime mica Nov 11, 2025, 10:58 AM

#

lol

#

powerful

#

I guess I'll just cerate a dummy PR to stockfish

violet badger Nov 11, 2025, 11:04 AM

#

prime mica lol I can't test 32-bit builds locally

I think you need to install the right 32 bit runtime, should be possible on the side.

#

sudo apt-get install gcc-multilib

#

or similar

prime mica Nov 11, 2025, 11:06 AM

#

gotcha

#

ugh why is vmovl_high_s8 not defined on armv7-neon

#

oh it's 64-bit only?? why

#

good news tho, seems like my patch fixes x86-32...

#

will now try to fix armv7--neon

#

also matetrack... @violet badger any ideas what might be happening there?

#

what is matetrack anyway 😩

violet badger Nov 11, 2025, 11:14 AM

#

matetrack is a script that tracks matefinding ability https://github.com/vondele/matetrack, but it is just the testcase above

prime mica Nov 11, 2025, 11:14 AM

#

ack

#

you mean the tablebase crash?

violet badger Nov 11, 2025, 11:14 AM

#

#1336647760388034610 message

#

yes

#

same problem

prime mica Nov 11, 2025, 11:14 AM

#

gotcha

violet badger Nov 11, 2025, 11:15 AM

#

gotta run.

prime mica Nov 11, 2025, 11:15 AM

#

🏃‍♂️

amber fern Nov 11, 2025, 11:18 AM

#

oh yeah! Does anyone know how threat inputs performs on matetrack?? Im so curious!

prime mica Nov 11, 2025, 11:19 AM

#

it crashes :P

#

ok time to download TBs I guess

#

I was gonna do that anyway at some opint

lofty cedar Nov 11, 2025, 11:28 AM

#

Whoa! After all the patches and 20+ people working for like half a year or longer, the final code is only about 1000 lines?

#

Stockfish code is so compact.

stray reef Nov 11, 2025, 11:29 AM

#

quite a few months of the time "spent" on this project was me thinking that increasing L1 would not work

#

it's not that complex once everything works. though there's ofc some trainer changes as well

amber fern Nov 11, 2025, 11:33 AM

#

lofty cedar Whoa! After all the patches and 20+ people working for like half a year or longe...

of nnue code? or other files?

lofty cedar Nov 11, 2025, 11:33 AM

#

But why is it only like 6 elo in Stockfish even at VVLTC?

lofty cedar Nov 11, 2025, 11:34 AM

#

amber fern of nnue code? or other files?

All files combined.

lofty cedar Nov 11, 2025, 11:34 AM

#

lofty cedar But why is it only like 6 elo in Stockfish even at VVLTC?

In Yukari, it's like 25 elo.

stray reef Nov 11, 2025, 11:34 AM

#

lofty cedar But why is it only like 6 elo in Stockfish even at VVLTC?

what else should it be? :P

amber fern Nov 11, 2025, 11:34 AM

#

lofty cedar All files combined.

all files in stockfish total? Really?

lofty cedar Nov 11, 2025, 11:35 AM

#

amber fern all files in stockfish total? Really?

Just the change.

lofty cedar Nov 11, 2025, 11:35 AM

#

stray reef what else should it be? :P

20+?

prime mica Nov 11, 2025, 11:35 AM

#

I mean it's more like 15 elo when you take into account that one NN is SPSAed

#

also you'd expect gains to be smaller when modifying stronger engines right

amber fern Nov 11, 2025, 11:36 AM

#

lofty cedar Just the change.

ohhhh! lol

stray reef Nov 11, 2025, 11:36 AM

#

lofty cedar In Yukari, it's like 25 elo.

the engines are so different and so far apart, it's absolutely not comparable. maybe threat inputs work quite well with little data, weak data, or suboptimal training procedures. too many factors to say

#

also we don't know the speed diff (TI vs. pre-TI) in yukari

#

comparing a same-sized net in plenty vs. yukari, i found it's a rather slow engine, that might explain some of it

lofty cedar Nov 11, 2025, 11:38 AM

#

In Monty it's like 40.

Well, in Plentychess... it's less gain, but still.

#

But wow, Stockfish was fast.

stray reef Nov 11, 2025, 11:38 AM

#

monty is MCTS

prime mica Nov 11, 2025, 11:38 AM

#

info depth 7 seldepth 9 multipv 1 score cp -96 nodes 1268 nps 181142 hashfull 0 tbhits 0 time 7 pv g4e2 d3d4 e2f3 b3d2 f3g2 d2c4 g2f3
syzygy/tbprobe.cpp:1081:22: runtime error: shift exponent 64 is too large for 64-bit type 'value_type' (aka 'unsigned long long')
SUMMARY: UndefinedBehaviorSanitizer: undefined-behavior syzygy/tbprobe.cpp:1081:22 
(base) cowpox@Cowpox src %

ok great, can reproduce locally

#

this is on ARM, so I doubt it's any of the vector code

#

https://github.com/official-stockfish/Stockfish/actions/runs/19263787581/job/55074630761 also fixed armv7-neon!

#

so only CI failure now is matetrack (and include what you need whcih should be trivial to fix)

lofty cedar Nov 11, 2025, 11:41 AM

#

It looks like you're on a zero to one of the valuable contributors speedrun.

prime mica Nov 11, 2025, 11:41 AM

#

meh

#

need to beat shash

rocky vigil Nov 11, 2025, 11:45 AM

#

cool cool

#

exciting

lofty cedar Nov 11, 2025, 11:47 AM

#

Now, do we wait a bit longer for things to settle? When do we launch a search tune?

NNUE weight tune wouldn't be done until we've exhausted the training options, but search params are not trainable anyway, so they can be tuned.

rocky vigil Nov 11, 2025, 11:47 AM

#

prime mica https://github.com/official-stockfish/Stockfish/actions/runs/19263787581/job/550...

ah can you update your PR with this

prime mica Nov 11, 2025, 11:48 AM

#

which PR

#

oh to your thing? yeah sure

#

am just debugging matetrack rn

rocky vigil Nov 11, 2025, 11:48 AM

#

yeah

#

matetrack will take longer

prime mica Nov 11, 2025, 11:48 AM

#

true

#

ok one sec

rocky vigil Nov 11, 2025, 11:48 AM

#

i have no idea how we have issue, we didn't touch tb code...

prime mica Nov 11, 2025, 11:49 AM

#

probably memory corurptoin

#

https://github.com/sscg13/Stockfish/pull/8

GitHub

GitHub ci by anematode · Pull Request #8 · sscg13/Stockfish

plz

rocky vigil Nov 11, 2025, 11:50 AM

#

ok

#

merged

#

now CI should officially work

#

except for IWYU / matetrack

prime mica Nov 11, 2025, 11:53 AM

#

huzzah

#

ok lldb time

#

ah, e.pieceCount is 5 for some reason

#

1
1
0
Piece count:5```

rocky vigil Nov 11, 2025, 11:58 AM

#

????

#

oh

#

bruh

#

yeah i know the crash occurred

#

after two captures on h1

prime mica Nov 11, 2025, 12:00 PM

#

so I guess we're not decrementing pieceCount properly

#

maybe swap_piece?

#

but what's weird is the pos_is_ok check is passing...

#

lemme verify by calling pos_is_ok in tbprobe

rocky vigil Nov 11, 2025, 12:02 PM

#

e is passed in this code

#

*entry = e

#

I think

#

so maybe the material key is off?

prime mica Nov 11, 2025, 12:03 PM

#

oh bc fricking Fast = true

#

    constexpr bool Fast = true;  // Quick (default) or full check?

ok this is aura

prime mica Nov 11, 2025, 12:04 PM

#

rocky vigil so maybe the material key is off?

maybe

#

that would make more sense actually

#

also explains why even with Fast = false there's no pos_is_ok fails

#

I don't rly understand how materialKey works tho ngl

#

maybe something here?

rocky vigil Nov 11, 2025, 12:06 PM

#

oh

#

promotion

#

that might be a culprit

#

bc of h1=bishop underpromo

prime mica Nov 11, 2025, 12:06 PM

#

hm

#

swap_piece doesn't modify its arguments though

rocky vigil Nov 11, 2025, 12:07 PM

#

actually idk

#

materialKey is not changed in the code

#

hmm

prime mica Nov 11, 2025, 12:08 PM

#

fk

#

https://github.com/rn5f107s2/Stockfish/compare/293d3a673f8a7cc0983d48feb9b202f4286e9985...6a047d92f5ae491c285772569ad3b1af6bd60d87

GitHub

Comparing 293d3a673f8a7cc0983d48feb9b202f4286e9985...6a047d92f5ae49...

UCI chess engine. Contribute to rn5f107s2/Stockfish development by creating an account on GitHub.

rocky vigil Nov 11, 2025, 12:08 PM

#

oh did you use git bisect

prime mica Nov 11, 2025, 12:08 PM

#

no but lemme try reverting this

#

ok that fixes the TB crash

#

so there's some subtle issue here

#

help me find it?

#

evidently it's somewhere which changes how materialKey is computed...

rocky vigil Nov 11, 2025, 12:14 PM

#

huh

#

must be indirectly

#

bc everywhere materialKey is directly updated is the same

prime mica Nov 11, 2025, 12:15 PM

#

right

#

how is en passant handled here

rocky vigil Nov 11, 2025, 12:16 PM

#

aha

#

this line

#

swap doesn't update the material key

#

when it should

prime mica Nov 11, 2025, 12:16 PM

#

ohhhh

rocky vigil Nov 11, 2025, 12:16 PM

#

wait

#

maybe

#

not

#

idk

prime mica Nov 11, 2025, 12:17 PM

#

lol

rocky vigil Nov 11, 2025, 12:17 PM

#

i think there is something sus

#

about calling swap though

#

without the material key updates

prime mica Nov 11, 2025, 12:17 PM

#

sure

rocky vigil Nov 11, 2025, 12:17 PM

#

actually I might be trolling

#

nvm

#

this is really strange

stray reef Nov 11, 2025, 12:18 PM

#

put_piece and remove_piece don't update the material key either tho

rocky vigil Nov 11, 2025, 12:18 PM

#

maybe try printing material keys before and after revert

#

that might help us understand where they diverge

prime mica Nov 11, 2025, 12:19 PM

#

agree

#

brb

#

https://github.com/official-stockfish/Stockfish/pull/6408 you can also try this to make sure I'm not trolling

GitHub

Does this fix matetrack by anematode · Pull Request #6408 · offic...

#

might have made a mistake

rocky vigil Nov 11, 2025, 12:20 PM

#

that has same bench right

#

just wait for CI to run ig

rocky vigil Nov 11, 2025, 12:24 PM

#

prime mica https://github.com/official-stockfish/Stockfish/pull/6408 you can also try this ...

this indeed fixes matetrack

#

like ??

#

move_blunder is going on here

prime mica Nov 11, 2025, 12:24 PM

#

we're so close bro

#

I'm trying to step through the code in my head for every class of move lol

#

I think ur right that it might have to do with (under)promotion that's also a capture...

rocky vigil Nov 11, 2025, 12:25 PM

#

it's just underpromo

#

i think

#

no underpromo capture

#

though that would probably be even worse

prime mica Nov 11, 2025, 12:25 PM

#

oh ok

rocky vigil Nov 11, 2025, 12:26 PM

#

but it is an underpromo followed by 2 captures

prime mica Nov 11, 2025, 12:27 PM

#

wait

#

#

I think pieceCount[captured] is not updated yet at line 785

#

if it's not an ep

#

(This is the fix diff)

#

it has to be that right

rocky vigil Nov 11, 2025, 12:29 PM

#

ohhh

prime mica Nov 11, 2025, 12:29 PM

#

testing

#

YES

#

it works

rocky vigil Nov 11, 2025, 12:32 PM

#

ah

#

ok

prime mica Nov 11, 2025, 12:32 PM

#

https://github.com/official-stockfish/Stockfish/pull/6408/commits/cdd887bbb9716c5e81c6b2cdc23364119730e454

GitHub

Does this fix matetrack by anematode · Pull Request #6408 · offic...

#

this is inelegant tho

rocky vigil Nov 11, 2025, 12:32 PM

#

so it was a capture

#

and not underpromot

#

that was the issue

prime mica Nov 11, 2025, 12:33 PM

#

fun times

#

ok now let's fix IWYU

#

maybe in a later PR I'll add a check that materialKey is correct to pos_is_ok...

#

didn't realize TB depended on its accuracy

rocky vigil Nov 11, 2025, 12:38 PM

#

yeah compute it from scratch ig

#

and verify

prime mica Nov 11, 2025, 12:39 PM

#

yessir

rocky vigil Nov 11, 2025, 12:39 PM

#

anyways for now if this works I'll just accept it

#

and hopefully IWYU won't take so long

prime mica Nov 11, 2025, 12:40 PM

#

@regal steeple tribunal for u

#

jk much love

twilit oriole Nov 11, 2025, 12:46 PM

#

@rocky vigil u can do this also now the pr is opened. download the file and then edit PR description and there is attach file button there

prime mica Nov 11, 2025, 12:48 PM

#

how close are we on the cleanup/code quality side?

rocky vigil Nov 11, 2025, 12:48 PM

#

twilit oriole <@693549181838819338> u can do this also now the pr is opened. download the file...

edited

prime mica Nov 11, 2025, 12:48 PM

#

https://github.com/sscg13/Stockfish/pull/9

GitHub

fix CI by anematode · Pull Request #9 · sscg13/Stockfish

rocky vigil Nov 11, 2025, 12:49 PM

#

btw I'm just gonna download the pr doc

#

the "everything" section

#

unless ppl still have stuff to add to it

prime mica Nov 11, 2025, 12:54 PM

#

sounds good to me

#

can edit in the futuer if necessary...

rocky vigil Nov 11, 2025, 12:54 PM

#

yeah ok

#

so now the PR message looks half decent

#

we can complete it once everything else is done

prime mica Nov 11, 2025, 12:54 PM

#

great

prime mica Nov 11, 2025, 12:55 PM

#

prime mica how close are we on the cleanup/code quality side?

?

lucid grove Nov 11, 2025, 12:55 PM

#

Given that we have been extremely lucky that matetrack caught the bug, is there a case for including a specific test to the ci that would catch the original bug?

rocky vigil Nov 11, 2025, 12:55 PM

#

prime mica maybe in a later PR I'll add a check that `materialKey` is correct to `pos_is_ok...

.

prime mica Nov 11, 2025, 12:55 PM

#

materialKey is only used for tablebases fwiw

#

so it makes sense that we never detected this until now...

rocky vigil Nov 11, 2025, 12:56 PM

#

yeah but it would've been really sad if matetrack didn't catch it

prime mica Nov 11, 2025, 12:56 PM

#

also additional debug checks in tbprobe.cpp make sense to me

rocky vigil Nov 11, 2025, 12:56 PM

#

and then we got had in a tcec game...

prime mica Nov 11, 2025, 12:56 PM

#

haha yeah

#

cucked by materialKey

#

we can do more extensive testing with TBs maybe

twilit oriole Nov 11, 2025, 12:57 PM

#

use pdf for the attachment

prime mica Nov 11, 2025, 12:57 PM

#

bruh is my clang-format outdated

rocky vigil Nov 11, 2025, 12:58 PM

#

bruh

rocky vigil Nov 11, 2025, 12:58 PM

#

twilit oriole use pdf for the attachment

ok

prime mica Nov 11, 2025, 12:58 PM

#

sorry lol

twilit oriole Nov 11, 2025, 12:58 PM

#

lol

prime mica Nov 11, 2025, 1:07 PM

#

hm the other problem is that even if we add the materialKey check to pos_is_ok, the checks in that function are hidden by default because of their enormous runtime cost

#

so likely no one would have found it during testing...

#

@lucid grove any ideas?

#

I guess asserts in tbprobe are enough given that tablebases are used in CI...

#

it would be nice to have 6-men checked in CI but eh

twilit oriole Nov 11, 2025, 1:12 PM

#

👀

lucid grove Nov 11, 2025, 1:17 PM

#

Would need to take a closer look. Naively any test that would discover the mismatch is fine. Doesn't necessarily need tb for it iiuc

#

I think assert in tb probe is a possibility. But apart from matetrack no other ci uses TB atm iirc

prime mica Nov 11, 2025, 1:18 PM

#

matetrack runs on every SF Pr

lucid grove Nov 11, 2025, 1:19 PM

#

But we were lucky it triggered, right? It's not a given that it would trigger for any mismatch.

prime mica Nov 11, 2025, 1:20 PM

#

If we add an assert in probe it should catch any mismatch

lucid grove Nov 11, 2025, 1:20 PM

#

Agreed

prime mica Nov 11, 2025, 1:20 PM

#

Honestly I’m not why such an assert didn’t previously exist

#

Passing the wrong position to a TB seems easy to do lol

#

Skill issue ig

lucid grove Nov 11, 2025, 1:21 PM

#

But even with assert, it would only trigger if search comes up with a position that has capture promotion or whatever triggered this bug

prime mica Nov 11, 2025, 1:22 PM

#

The bug was actually just a capture

#

any non en passant capture was buggy

#

but yeah an extremely subtle bug could escape detection

#

fuzzing with TBs could be nice

#

but I think only when updating do_move and such functions

#

yay CI passed

#

@rocky vigil I think you can hold off on merging the speedup(s)

#

That way the upcoming non regression test is more accurate…

rocky vigil Nov 11, 2025, 1:25 PM

#

prime mica yay CI passed

gg

violet badger Nov 11, 2025, 1:48 PM

#

lucid grove I think assert in tb probe is a possibility. But apart from matetrack no other c...

I think there is another one, but it is fairly rare bug to trigger.

lucid grove Nov 11, 2025, 1:48 PM

#

rare ones are the hardest to debug 😉

violet badger Nov 11, 2025, 1:48 PM

#

cool, great fixes, btw!

#

sure, but also hard to exhaustively cover.

#

But I'm all for adding some more tests.

#

Now, for completeness, our tests did catch it, which isn't bad at all.

rocky vigil Nov 11, 2025, 2:00 PM

#

prime mica yay CI passed

yeah so now wait for shawn to get back to us on nnue-pytorch

#

do maintainers (vondele, disservin) have any comments on PR

green moat Nov 11, 2025, 2:06 PM

#

I think Yoshie2000 said he would have read the whole diff.....
pauseChamp

stray reef Nov 11, 2025, 2:06 PM

#

had to actually listen to the lecture unfortunately

prime mica Nov 11, 2025, 2:06 PM

#

mfw you're actually learning in school

rocky vigil Nov 11, 2025, 2:07 PM

#

stray reef had to actually listen to the lecture unfortunately

yeah if I don't plan on paying attention to lecture I typically just don't go

prime mica Nov 11, 2025, 2:08 PM

#

stray reef had to actually listen to the lecture unfortunately

what are you studying :o

stray reef Nov 11, 2025, 2:08 PM

#

compsci masters, the lecture was on statistical signal processing

prime mica Nov 11, 2025, 2:09 PM

#

super cool :)

#

https://github.com/official-stockfish/Stockfish/pull/6410

GitHub

check for material key validity in tbprobe by anematode · Pull Req...

During work on threat inputs, there was a recent hard-to-debug crash in tbprobe caused by incorrectly computing st->materialKey. This PR adds correctness checking to pos_is_ok and a faster c...

#

(not sure if this is ideal, but here's something)

violet badger Nov 11, 2025, 2:26 PM

#

@rocky vigil can you add to the commit message also that the net is the result of the following training recipe https://github.com/vondele/nettest/blob/7de71238e9b295e3f88ed7c9c5936af632c9b981/threats.yaml

rocky vigil Nov 11, 2025, 2:26 PM

#

ok

violet badger Nov 11, 2025, 2:26 PM

#

That documents exactly what needs to be done to 'reproduce'

prime mica Nov 11, 2025, 2:26 PM

#

when the net is reproducible 😍

violet badger Nov 11, 2025, 2:27 PM

#

unfortunately pytorch training is not bitwise-reproducible, but executing this recipe enough times should yield a net of similar strength. In practice the variation is actually fairly small.

rocky vigil Nov 11, 2025, 2:29 PM

#

comment modified

#

to include that information

lofty cedar Nov 11, 2025, 2:29 PM

#

Bitwise-reproducibility is probably not possible on a gpu with floating point unit.

#

Because GPUs might operate in different orders affecting the rounding.

prime mica Nov 11, 2025, 2:30 PM

#

demented

rocky vigil Nov 11, 2025, 2:30 PM

#

there is a simpler reason, and that would be the random skipping of positions

#

(and presumably, the shuffling)

violet badger Nov 11, 2025, 2:31 PM

#

which is of course pseudo-random..

formal smelt Nov 11, 2025, 2:31 PM

#

the core issue is use of atomics in FT backprop
i think pretty much everything else could be resolved (e.g. by fixing seeds)

lofty cedar Nov 11, 2025, 2:31 PM

#

Nah... randoms aren't truly random.

#

Seeds can be fixed.

rocky vigil Nov 11, 2025, 2:32 PM

#

oh

lofty cedar Nov 11, 2025, 2:32 PM

#

Another thing moving forward is... do we let people submit net training runs?

rocky vigil Nov 11, 2025, 2:32 PM

#

making the position order deterministic is strange to me but in principle it could be done

rocky vigil Nov 11, 2025, 2:33 PM

#

lofty cedar Another thing moving forward is... do we let people submit net training runs?

i think this has already been open for a while?

lofty cedar Nov 11, 2025, 2:33 PM

#

rocky vigil making the position order deterministic is strange to me but in principle it cou...

It's doesn't actually look deterministic. It just means that the rng seed is fixed.

formal smelt Nov 11, 2025, 2:33 PM

#

rocky vigil making the position order deterministic is strange to me but in principle it cou...

as stated it still doesn't matter :p

lofty cedar Nov 11, 2025, 2:33 PM

#

rocky vigil i think this has already been open for a while?

I mean that the training code is open, but the resource needed to train is not available for everyone.

#

Since training ideas only take like a few days or so, we should be able to test at least a hundred training ideas in a year.

rocky vigil Nov 11, 2025, 2:34 PM

#

the ability to PR to nettest has also been open for a while

lofty cedar Nov 11, 2025, 2:34 PM

#

Really?

#

What about making a run?

lofty cedar Nov 11, 2025, 2:37 PM

#

rocky vigil the ability to PR to nettest has also been open for a while

I know you can PR or run such a thing... but many don't have the resource to run.

prime mica Nov 11, 2025, 2:37 PM

#

seems a bit tricky w/o volunteer resources for GPUs

rocky vigil Nov 11, 2025, 2:37 PM

#

i thought if you made a PR to nettest then a run for it could have been set up on gitlab pipelines

#

there is at least support for 4 concurrent runs

violet badger Nov 11, 2025, 2:38 PM

#

so, ideally we get DDP to work, would reduce the cost ....

prime mica Nov 11, 2025, 2:39 PM

#

DDP?

violet badger Nov 11, 2025, 2:39 PM

#

distributed data parallelism in our nnue-pytorch.

#

right now doesn't seem to work.

prime mica Nov 11, 2025, 2:39 PM

#

oh I see

violet badger Nov 11, 2025, 2:39 PM

#

some bug somewhere.

lofty cedar Nov 11, 2025, 2:39 PM

#

prime mica seems a bit tricky w/o volunteer resources for GPUs

There could be. There are about 2000 CPUs on fishtest. There could certainly be a few GPUs.

prime mica Nov 11, 2025, 2:39 PM

#

lofty cedar There could be. There are about 2000 CPUs on fishtest. There could certainly be ...

maybe... but then you need people to download many gigabytes of data

violet badger Nov 11, 2025, 2:39 PM

#

https://github.com/official-stockfish/nnue-pytorch/pull/352

prime mica Nov 11, 2025, 2:40 PM

#

etc. etc., it's a lot more requirements than fishtest...

rocky vigil Nov 11, 2025, 2:40 PM

#

many gigabytes is an understatement

violet badger Nov 11, 2025, 2:40 PM

#

about 1TB

prime mica Nov 11, 2025, 2:40 PM

#

lololol

#

would just barely fit in RAM ;)

violet badger Nov 11, 2025, 2:40 PM

#

some people's RAM ...

prime mica Nov 11, 2025, 2:41 PM

#

prime mica https://github.com/official-stockfish/Stockfish/pull/6410

vondele any thoughts on this?

violet badger Nov 11, 2025, 2:42 PM

#

seems reasonable.

#

I've also asked co-pilot for a review, only consider what is reasonable...

#

Stockfish threat inputs PR summary.pdf I think some of anematode's speedups could be mentioned?

#

once the PR is in final stage, I'd like it to be squashed, with the commit message containing the appropriate co-author: ... designations and also the PR message as commit comment... please.

lofty cedar Nov 11, 2025, 2:54 PM

#

Ummn what do we do after the cleanup, after the merge? Do we launch a VVLTC search tune first?

violet badger Nov 11, 2025, 2:57 PM

#

have a medium-strength, cold beer, and enjoy the day?

green moat Nov 11, 2025, 2:57 PM

#

lofty cedar Ummn what do we do after the cleanup, after the merge? Do we launch a VVLTC sear...

Some speedups could already be applied, no? Then we have to test Stage 4-5 nets with L2=31, then quantize the smallnet.....

prime mica Nov 11, 2025, 2:58 PM

#

@long quest https://tests.stockfishchess.org/tests/live_elo/691343297ca8781852331452 lol this is genius, I think you can give this simplification bounds...

#

although hm let's just let it run as is

green moat Nov 11, 2025, 2:59 PM

#

still PLENTY (pun intended 😛 ) things to do....

rocky vigil Nov 11, 2025, 3:18 PM

#

violet badger once the PR is in final stage, I'd like it to be squashed, with the commit messa...

what exactly do you mean by this sorry?

violet badger Nov 11, 2025, 3:22 PM

#

so, all PRs are usually squashed into one commit, easiest for maintainers if this is done by the author. By adding to he commit message something like

    Co-authored-by: sscg13 <[email protected]>
    Co-authored-by: Timothy Herchen <[email protected]>

Also a single commit can be attributed to multiple authors.

rocky vigil Nov 11, 2025, 3:22 PM

#

ok cool

#

once everything is done I can squash it into a new PR

prime mica Nov 11, 2025, 3:29 PM

#

one kinda goofy idea I've been mulling over

#

to try to address the x86 pain

#

is to have an i16 expanded version of the threat weights as well... then use that for features which are being very frequently used

#

it would be quite complex to track tho

#

and the tracking would have to be extremely fast

#

but I'll probably try it at some point... if the distribution of threat feature incremental updates is good then it could be a significant speedup

#

also somehow I didn't realize that the L1 size is now 1024...

#

that changes the calculus on add/sub honestly

#

loop overhead and index generation will matter more

rocky vigil Nov 11, 2025, 3:35 PM

#

oh yeah talk about claude comments

#

I do have a determination that 64 is an upper bound

#

and I know why 32 is insufficient

#

and I suspect 48 is sufficient

prime mica Nov 11, 2025, 3:36 PM

#

once snowy-egret is merged there will be no perf difference

#

but yeah maybe nice to have a justification about the size

rocky vigil Nov 11, 2025, 3:36 PM

#

here the move Qxe3 modifies 48 threats

prime mica Nov 11, 2025, 3:36 PM

#

rocky vigil here the move Qxe3 modifies 48 threats

hate when this happens

rocky vigil Nov 11, 2025, 3:37 PM

#

wait shoot

#

why doesn't 32 work

#

aren't added/removed separate

prime mica Nov 11, 2025, 3:37 PM

#

no

#

we have a boolean add

rocky vigil Nov 11, 2025, 3:37 PM

#

ohhhhh

#

ok

#

yeah then 48

#

16 * 3 pieces

prime mica Nov 11, 2025, 3:37 PM

#

that makes sense

rocky vigil Nov 11, 2025, 3:37 PM

#

the only way to involve 4 pieces in a move is castling

#

but I strongly suspect you cannot hit anywhere near 48 with castling

prime mica Nov 11, 2025, 3:38 PM

#

right

#

need a queen for maximal enjoyment

rocky vigil Nov 11, 2025, 3:38 PM

#

oh actually it's easy

#

9 * 4 is a trivial upper bound for castling

prime mica Nov 11, 2025, 3:39 PM

#

you'd make a good mathematician

rocky vigil Nov 11, 2025, 3:39 PM

#

~~did I mention I'm supposed to be a math major~~

prime mica Nov 11, 2025, 3:39 PM

#

lol

#

I have this one math friend who always calls things trivial

rocky vigil Nov 11, 2025, 3:39 PM

#

sounds about right

prime mica Nov 11, 2025, 3:39 PM

#

but then it requires 15 minutes of explanation covering two chalkboards to explain why its' trivial

rocky vigil Nov 11, 2025, 3:40 PM

#

"trivial" means "i could come up with the reason on the spot"

prime mica Nov 11, 2025, 3:40 PM

#

ok well we can try 48 then...

#

maybe in a future PR for safety? unless you're 100% confident

rocky vigil Nov 11, 2025, 3:40 PM

#

yeah

#

I'll put the argument down tho

prime mica Nov 11, 2025, 3:40 PM

#

great

rocky vigil Nov 11, 2025, 3:40 PM

#

to silence claude

prime mica Nov 11, 2025, 3:41 PM

#

🤐

#

also maybe you saw but I made a small PR to clean up some redundant stuff in full_threats.cpp

rocky vigil Nov 11, 2025, 3:44 PM

#

ah ok

#

lemme get to it

violet badger Nov 11, 2025, 3:47 PM

#

in this context https://lichess.org/@/Tobs40/blog/why-a-reachable-position-can-have-at-most-218-playable-moves/a5xdxeqs is a beautiful blog post..

prime mica Nov 11, 2025, 3:47 PM

#

yess

#

let's write a Coq proof that at most 48 threats are changed

violet badger Nov 11, 2025, 3:48 PM

#

https://github.com/Tobs40/chess218 as starting point

prime mica Nov 11, 2025, 3:48 PM

#

Gurobi

#

one thing I just realized is that https://tests.stockfishchess.org/tests/live_elo/690e99ecec1d00d2c195c391 probably only helps because we memcpy the whole thing...

#

so we can probably simp it out after snowy-egret

violet badger Nov 11, 2025, 3:49 PM

#

I actually think there was another blog post on that... maybe tehre are two.

prime mica Nov 11, 2025, 3:50 PM

#

O where

violet badger Nov 11, 2025, 3:54 PM

#

nah, probably that one. Even though I thought it appeared on lichess recently. Seemingly is already a bit older

dark stream Nov 11, 2025, 3:57 PM

#

So, basically everyone is waiting for shawn_xu to do his change to nnue-pytorch, right?

"Remove unused code (Legacy PSQ stuff in Full_Threats.h looking at you)" : this is one of the checklist items. Was this done?

rocky vigil Nov 11, 2025, 3:57 PM

#

this is considered done when everyone agrees it is

#

although the thing I specifically called out indeed has been removed

prime mica Nov 11, 2025, 3:58 PM

#

I didn’t see anything unused…

dark stream Nov 11, 2025, 3:59 PM

#

So, do you think there is any possible regression from this cleanup?

rocky vigil Nov 11, 2025, 4:00 PM

#

no

#

but just to be sure

#

(it's maintainer decision)

#

(as a stc nonreg will take a while)

violet badger Nov 11, 2025, 4:00 PM

#

LTC might be the shortcut 🙂

#

STC filtering has been done.

#

and yeah, I think we should do a final LTC run, maybe after the squash.

rocky vigil Nov 11, 2025, 4:01 PM

#

oh so like do a second LTC test after squashing?

violet badger Nov 11, 2025, 4:01 PM

#

yeah.

rocky vigil Nov 11, 2025, 4:02 PM

#

ok

#

in that case i guess we still welcome more speedups

#

as they pass fishtest

violet badger Nov 11, 2025, 4:02 PM

#

nah..

rocky vigil Nov 11, 2025, 4:02 PM

#

oh

violet badger Nov 11, 2025, 4:02 PM

#

let's move forward as is.

rocky vigil Nov 11, 2025, 4:02 PM

#

ok

prime mica Nov 11, 2025, 4:02 PM

#

Agre

rocky vigil Nov 11, 2025, 4:03 PM

#

do we continue waiting on shawn to do nnue-pytorch work or do the merges asynchronously

violet badger Nov 11, 2025, 4:04 PM

#

can be asynchronous.. in the end the net generation was with a specific sha which we know where to find. Obviously, I'd like to move forward with integrating that in the main branch, but I'm confident we'll do that.

rocky vigil Nov 11, 2025, 4:49 PM

#

prime mica I didn’t see anything unused…

Yeah if no one else has an opinion I’ll set up a new LTC soon

regal steeple Nov 11, 2025, 4:56 PM

#

I just noticed that I forgot to remove some things from the FusedUpdateData struct, dp2from and dp2fromBoard can be removed I think

prime mica Nov 11, 2025, 4:56 PM

#

@long quest

base (...stockfish.ti) =    1515728  +/- 1787
test (./stockfish    ) =    1542716  +/- 1503
diff                   =     +26988  +/- 2310

speedup        = +0.0178
P(speedup > 0) =  1.0000

#

your patch works extremely well locally

rocky vigil Nov 11, 2025, 5:12 PM

#

regal steeple I just noticed that I forgot to remove some things from the FusedUpdateData stru...

ah you can pr this (to my branch) if you want

#

or let me know what to remove

prime mica Nov 11, 2025, 5:15 PM

#

   2dd15:   41 31 d8                xor    r8d,ebx
   2dd18:   31 de                   xor    esi,ebx
   2dd1a:   41 c1 ec 1c             shr    r12d,0x1c
   2dd1e:   49 0f be d1             movsx  rdx,r9b
   2dd22:   49 39 f0                cmp    r8,rsi
   2dd25:   40 0f 92 c5             setb   bpl
   2dd29:   49 89 d1                mov    r9,rdx
   2dd2c:   c1 e8 10                shr    eax,0x10
   2dd2f:   49 c1 e1 04             shl    r9,0x4
   2dd33:   83 e0 0f                and    eax,0xf
   2dd36:   48 c1 e2 06             shl    rdx,0x6
   2dd3a:   4c 01 c8                add    rax,r9
   2dd3d:   48 8d 6c 45 00          lea    rbp,[rbp+rax*2+0x0]
   2dd42:   4a 8d 04 02             lea    rax,[rdx+r8*1]
   2dd46:   48 89 c2                mov    rdx,rax
   2dd49:   45 8b 0c af             mov    r9d,DWORD PTR [r15+rbp*4]
   2dd4d:   48 c1 e2 06             shl    rdx,0x6
   2dd51:   4c 01 ea                add    rdx,r13
   2dd54:   44 0f b6 04 32          movzx  r8d,BYTE PTR [rdx+rsi*1]
   2dd59:   45 03 04 86             add    r8d,DWORD PTR [r14+rax*4]
   2dd5d:   45 01 c1                add    r9d,r8d
   2dd60:   41 81 f9 ef 37 01 00    cmp    r9d,0x137ef
   2dd67:   77 22                   ja     2dd8b

assembly after also applying bald-eagle, they seem to synergize rly well

#

"small"-speedup my ass

#

Result of 100 runs
==================
base (...stockfish.ti) =    1515621  +/- 1655
test (./stockfish    ) =    1569287  +/- 2560
diff                   =     +53666  +/- 2930

speedup        = +0.0354
P(speedup > 0) =  1.0000

threat-inputs vs. small-speedup + bald-eagle

#

super exciting

regal steeple Nov 11, 2025, 5:20 PM

#

rocky vigil or let me know what to remove

I commented the lines on github

rocky vigil Nov 11, 2025, 5:38 PM

#

regal steeple I commented the lines on github

alright I took them out

rocky vigil Nov 11, 2025, 6:33 PM

#

if no new suggestions I'll make a new LTC soon

prime mica Nov 11, 2025, 6:36 PM

#

Go for it I think…

#

Maybe a maintainer should review first tho

#

idk

rocky vigil Nov 11, 2025, 6:40 PM

#

yeah I'll wait

#

for a bit

rocky vigil Nov 11, 2025, 7:02 PM

#

aight well I've made it

twilit oriole Nov 11, 2025, 7:06 PM

#

Is this including extra speedups or smth

#

Why not a non reg

#

@rocky vigil

rocky vigil Nov 11, 2025, 7:07 PM

#

vondele wanted a new ltc vs master

#

so a new ltc it shall be

#

it unironically might finish faster

twilit oriole Nov 11, 2025, 7:08 PM

#

Really? That seems not right, it's opening it up to the machine lottery again.

rocky vigil Nov 11, 2025, 7:08 PM

#

violet badger and yeah, I think we should do a final LTC run, maybe after the squash.

at least well

twilit oriole Nov 11, 2025, 7:08 PM

#

I'll approve but a non reg wouldn't have that issue

#

Yeah I'm not seeing the Vs master. Because I don't see the logic behind that

rocky vigil Nov 11, 2025, 7:09 PM

#

wait what do we want then

#

LTC against master again?

#

LTC nonreg against passed version?

twilit oriole Nov 11, 2025, 7:09 PM

#

A LTC non reg to make sure you didn't regress anything?

#

Vs what has already passed

rocky vigil Nov 11, 2025, 7:09 PM

#

ok

#

I can set that u

#

aight

#

yeah

#

let's do that

#

it might take like a day or so idk

prime mica Nov 11, 2025, 7:35 PM

#

am I crazy or is the github diff different than the diff on fishtest...

#

the one on github looks wrong

#

but the commit hashes look right

violet badger Nov 11, 2025, 8:33 PM

#

twilit oriole Really? That seems not right, it's opening it up to the machine lottery again.

no, it makes sense.. but well, I see something is already running.

#

Here's the result on the ARM nodes for an LTC test vs master, you might want to add that to the PR:

   # PLAYER    :  RATING  ERROR   POINTS  PLAYED   (%)
   1 patch     :    13.9    1.9  38296.5   73728    52
   2 master    :     0.0   ----  35431.5   73728    48

twilit oriole Nov 11, 2025, 8:41 PM

#

violet badger no, it makes sense.. but well, I see something is already running.

How does it make sense to test for non reg by passing it yet again. There is a test made for that

violet badger Nov 11, 2025, 8:45 PM

#

it make sense to test the final version of the patch by the usual standards...

#

it would also have saved resources..

twilit oriole Nov 11, 2025, 8:45 PM

#

But it does not tell you if it is actually non reg

violet badger Nov 11, 2025, 8:46 PM

#

no, it tells if it gains against master..

prime mica Nov 11, 2025, 8:46 PM

#

why does it matter if it regressed slightl

violet badger Nov 11, 2025, 8:46 PM

#

anyway,

prime mica Nov 11, 2025, 8:46 PM

#

will be fixed in the normal course of development

twilit oriole Nov 11, 2025, 8:46 PM

#

prime mica will be fixed in the normal course of development

Extend this argument to any patch

violet badger Nov 11, 2025, 8:47 PM

#

any patch that passes the normal fishtest development, is actually handled that way..

#

but it is not worth long discussion.

twilit oriole Nov 11, 2025, 8:47 PM

#

Anyways correct it is not a big deal. I just think a non reg is more appropriate here

violet badger Nov 11, 2025, 8:47 PM

#

let's instead bet on how long it takes till the first gainers pass on fishtest after merging..

#

My bet.. 1 day for code and .. 1 day for net?

twilit oriole Nov 11, 2025, 8:48 PM

#

I think you have some insider info Kappa

violet badger Nov 11, 2025, 8:48 PM

#

🙂

#

SEC going after me.

prime mica Nov 11, 2025, 8:49 PM

#

I thought Swiss banks were good for that

foggy wind Nov 11, 2025, 8:50 PM

#

There are already speedups ready that are not part of the PR, right?

violet badger Nov 11, 2025, 8:50 PM

#

I think so.

#

at least options and half passed tests.

foggy wind Nov 11, 2025, 8:51 PM

#

prime mica ``` Result of 100 runs ================== base (...stockfish.ti) = 1515621 +...

3.5% will pass STC quickly 😄

prisma hatchBOT Nov 11, 2025, 8:57 PM

#

k7/2n1n3/1nbNbN2/2NbRBN1/1nbRQR2/2NBRBN1/3N1N2/7K w - - 0 1Lichess Link | Image

regal steeple Nov 11, 2025, 8:58 PM

#

This position with move e4d5 has 68 dirty threats if I didnt measure anything wrong

lapis parrot Nov 11, 2025, 8:58 PM

#

I think sf should just segfault there

regal steeple Nov 11, 2025, 8:58 PM

#

    assert(pos_is_ok());

    assert(dp.pc != NO_PIECE);
    assert(!(bool(captured) || m.type_of() == CASTLING) ^ (dp.remove_sq != SQ_NONE));
    assert(dp.from != SQ_NONE);
    assert(!(dp.add_sq != SQ_NONE) ^ (m.type_of() == PROMOTION || m.type_of() == CASTLING));

+   std::cout << dts.list.size() << std::endl;

    return {dp, dts};

Stockfish dev-20251109-dbc3dcc3 by the Stockfish developers (see AUTHORS file)
position fen k7/2n1n3/1nbNbN2/2NbRBN1/1nbRQR2/2NBRBN1/3N1N2/7K w - - 0 1 moves e4d5
68

lapis parrot Nov 11, 2025, 8:58 PM

#

so people wouldn't feed it some garbage

violet badger Nov 11, 2025, 8:59 PM

#

Is that relevant for this list?
using DirtyThreatList = ValueList<DirtyThreat, 64>;

regal steeple Nov 11, 2025, 8:59 PM

#

I think it should be

twilit oriole Nov 11, 2025, 8:59 PM

#

lapis parrot so people wouldn't feed it some garbage

It is a legal position I think

#

Oh actually it isn't

#

Because there isn't enough pawns to promo into all the extra pieces

violet badger Nov 11, 2025, 9:01 PM

#

Triggers an assert indeed:

stockfish: misc.h:138: void Stockfish::ValueList<T, MaxSize>::push_back(const T&) [with T = Stockfish::DirtyThreat; long unsigned int MaxSize = 64]: Assertion `size_ < MaxSize' failed.

twilit oriole Nov 11, 2025, 9:02 PM

#

What to figure out is if any legal position triggers it?

violet badger Nov 11, 2025, 9:02 PM

#

it is not too far from legal though..

prime mica Nov 11, 2025, 9:02 PM

#

https://tests.stockfishchess.org/tests/view/69108025ec1d00d2c195c5d6 this patch avoids copying it around, so we can make it as large as necessary w/o perf impact (within reason obviously)

prisma hatchBOT Nov 11, 2025, 9:03 PM

#

k7/2n1n3/1nbNbn2/2NbRBn1/1nbRQR2/2NBRBN1/3N1N2/7K w - - 0 1Lichess Link | Image

prime mica Nov 11, 2025, 9:03 PM

#

but obviously it'd be nice to have a good idea of the bound

regal steeple Nov 11, 2025, 9:03 PM

#

this should be legal

prime mica Nov 11, 2025, 9:04 PM

#

prisma hatch ```k7/2n1n3/1nbNbn2/2NbRBn1/1nbRQR2/2NBRBN1/3N1N2/7K w - - 0 1```[Lichess Link](...

hate when this happens

violet badger Nov 11, 2025, 9:04 PM

#

but yeah, still triggering the issue.

#

stockfish: misc.h:138: void Stockfish::ValueList<T, MaxSize>::push_back(const T&) [with T = Stockfish::DirtyThreat; long unsigned int MaxSize = 64]: Assertion `size_ < MaxSize' failed.

prime mica Nov 11, 2025, 9:05 PM

#

😩

lapis parrot Nov 11, 2025, 9:06 PM

#

regal steeple this should be legal

for the chess clarity this should be illegal

violet badger Nov 11, 2025, 9:06 PM

#

so, we should probably fix the code... and probably include this position in CI somewhere.

lapis parrot Nov 11, 2025, 9:06 PM

#

sf should segfault and print "fuck off" message

#

Kappa

rocky vigil Nov 11, 2025, 9:06 PM

#

regal steeple This position with move e4d5 has 68 dirty threats if I didnt measure anything wr...

oh shoot I forgot that deduplication was only processed after the list was constructed

#

i also forgot about x rays

#

lemme get a new upper bound then

prisma hatchBOT Nov 11, 2025, 9:14 PM

#

K7/8/8/BNQNQNB1/N5N1/R1Q1q2r/n5n1/bnqnqnbk w - - 0 1Lichess Link | Image

rocky vigil Nov 11, 2025, 9:14 PM

#

Qxe3 should have 71

rocky vigil Nov 11, 2025, 9:18 PM

#

regal steeple This position with move e4d5 has 68 dirty threats if I didnt measure anything wr...

replace black bishop on d5 with queen, will add a few more

rocky vigil Nov 11, 2025, 9:35 PM

#

If it’s just raw threats before deduplication we can use 80 as an upper bound

frosty imp Nov 11, 2025, 9:56 PM

#

rocky vigil do we continue waiting on shawn to do nnue-pytorch work or do the merges asynchr...

Oof I hurt my wrist tendons yesterday which is making typing difficult

prime mica Nov 11, 2025, 9:56 PM

#

O no

#

feel better

frosty imp Nov 11, 2025, 9:58 PM

#

Basically what I wanted to do is to move features to under model/modules, and for each feature set, define its own module by inheriting from DoubleTransformerSlice

#

Then each module specific to the input feature can define its own coalesce weights function

#

FeatureSet stuff can prolly just be deleted. Combined features can be reimplemented if needed anyway

amber fern Nov 11, 2025, 10:44 PM

#

rocky vigil Qxe3 should have 71

is this a competition xD

long quest Nov 11, 2025, 10:57 PM

#

prime mica your patch works extremely well locally

looks like the test’s gonna take a while since I didn’t set simplification bounds - what should I do?

prime mica Nov 11, 2025, 10:57 PM

#

if it's ok with you we can stack our two changes and it'll probably pass a bit faster...

#

but taking a while is also fine

long quest Nov 11, 2025, 10:59 PM

#

sure, can you just make a combined patch?

prime mica Nov 11, 2025, 10:59 PM

#

yes!

long quest Nov 11, 2025, 11:00 PM

#

cool, I'll stop my test

prime mica Nov 11, 2025, 11:05 PM

#

sounds good, I think I'll make the test targeting 0x539's test (https://tests.stockfishchess.org/tests/live_elo/69139f8e7ca87818523314dc) if it passes because they touch the same area of code...

dark stream Nov 12, 2025, 4:37 AM

#

https://tests.stockfishchess.org/tests/live_elo/69138a317ca87818523314bf
Passed

#

Time to merge?

amber fern Nov 12, 2025, 4:41 AM

#

Merging time??? 😁

amber fern Nov 12, 2025, 5:00 AM

#

Obsevation: The stockfish discord channel is either really busy, or completely quiet depending on the time of day it is xD

dark stream Nov 12, 2025, 5:02 AM

#

amber fern Obsevation: The stockfish discord channel is either really busy, or completely q...

Eh, it was still relatively busy this time of day for the last few days.

I just think people are chilling after putting in this much hard work. Which is fair.

amber fern Nov 12, 2025, 5:08 AM

#

dark stream Eh, it was still relatively busy this time of day for the last few days. I just...

I think it becomes more active around 2 hours from now?

#

If I remember correctly

dark stream Nov 12, 2025, 5:09 AM

#

amber fern I think it becomes more active around 2 hours from now?

We'll find out. Anyway, threat inputs probably gets merged today.

amber fern Nov 12, 2025, 5:29 AM

#

I most curious for the progression tests before and after search tuning with the best net, just how much elo will be gained since sf 17/17.1!?

dark stream Nov 12, 2025, 5:35 AM

#

amber fern I most curious for the progression tests before and after search tuning with the...

I'm even more eager for search modifications, tbh. For the last couple of months gains from that had stalled. The new net probably changes that for a while.

violet badger Nov 12, 2025, 6:17 AM

#

rocky vigil If it’s just raw threats before deduplication we can use 80 as an upper bound

I've ticked the box on the second LTC run, and added one for this upper bound change https://github.com/official-stockfish/Stockfish/pull/6406#issuecomment-3513836954

dark stream Nov 12, 2025, 6:24 AM

#

violet badger I've ticked the box on the second LTC run, and added one for this upper bound ch...

So any idea what the upper bound should be?

rocky vigil Nov 12, 2025, 6:36 AM

#

rocky vigil If it’s just raw threats before deduplication we can use 80 as an upper bound

.

#

(8 attacks + 16 attacked) * 3 pieces + 8 discovered attacks

dark stream Nov 12, 2025, 6:37 AM

#

Oh, right. Btw, will this change playing strength or something like that?

rocky vigil Nov 12, 2025, 6:38 AM

#

No

#

I guess it is just update this and the logic

#

And then it is in a mergeable state

violet badger Nov 12, 2025, 6:44 AM

#

sounds good...

frosty imp Nov 12, 2025, 6:46 AM

#

wait a second

#

is the small net totalinputs correct

#

ah nvm it's correct

amber fern Nov 12, 2025, 6:48 AM

#

can we get a countdown to merge? Like on new years? Very serious request.

frosty imp Nov 12, 2025, 6:48 AM

#

ig some of the random newlines in the diff can be removed

rocky vigil Nov 12, 2025, 6:49 AM

#

for the squashed commit, would people prefer I use their public emails or noreply emails?

#

btw with the change, it can now run both positions without triggering an assert

rocky vigil Nov 12, 2025, 6:54 AM

#

violet badger so, we should probably fix the code... and probably include this position in CI ...

i think we can try to find a similar position that is actually a mate, and then add that onto matetrack

violet badger Nov 12, 2025, 6:54 AM

#

I think we could just add that position to bench, it gets caught by the asserts.

rocky vigil Nov 12, 2025, 6:55 AM

#

ah sure

violet badger Nov 12, 2025, 6:55 AM

#

rocky vigil for the squashed commit, would people prefer I use their public emails or norepl...

just what is already in the git log?

#

(in that way github should properly associate it with the accounts I think)

rocky vigil Nov 12, 2025, 6:55 AM

#

ok

amber fern Nov 12, 2025, 6:56 AM

#

where are all you guys from? Its 2am in the US right now, so I'm guessing not there?

violet badger Nov 12, 2025, 6:56 AM

#

let's try to keep this thread on development please..

frosty imp Nov 12, 2025, 6:57 AM

#

@rocky vigil pr sent

amber fern Nov 12, 2025, 6:57 AM

#

violet badger let's try to keep this thread on development please..

threat*

#

kidding lol

frosty imp Nov 12, 2025, 6:57 AM

#

also I forgot to include disservin as a coauthor

rocky vigil Nov 12, 2025, 6:58 AM

#

frosty imp also I forgot to include disservin as a coauthor

yeah disservin/linrock/vondele should also be listed as coauthors

rocky vigil Nov 12, 2025, 7:00 AM

#

violet badger I think we could just add that position to bench, it gets caught by the asserts.

do this now, or after merge?

violet badger Nov 12, 2025, 7:00 AM

#

now, as part of the correction

rocky vigil Nov 12, 2025, 7:00 AM

#

ok

#

with this change, bench becomes 2626086, can someone else confirm

#

(it has been pushed to the PR)

violet badger Nov 12, 2025, 7:08 AM

#

2626086

prime mica Nov 12, 2025, 7:09 AM

#

woot

#

Starting to wonder the true max now haha

rocky vigil Nov 12, 2025, 7:11 AM

#

violet badger just what is already in the git log?

git log contains both noreply emails and public emails probably because of differences between local commits and those done from github

#

how should I do this

#

I have uh, never really done big PRs before

prime mica Nov 12, 2025, 7:12 AM

#

I think the public email makes most sense

rocky vigil Nov 12, 2025, 7:12 AM

#

ok

#

so I'll use public email if I can find it in git logs, and noreply otherwise

prime mica Nov 12, 2025, 7:12 AM

#

Huzzah

frosty imp Nov 12, 2025, 7:12 AM

#

[email protected] plz Kappa

prime mica Nov 12, 2025, 7:12 AM

#

Goated domain name

rocky vigil Nov 12, 2025, 7:12 AM

#

frosty imp [email protected] plz <:Kappa:436339616866369553>

🤔

frosty imp Nov 12, 2025, 7:13 AM

#

huh that seems to not be my github email actulaly

rocky vigil Nov 12, 2025, 7:13 AM

#

frosty imp huh that seems to not be my github email actulaly

Shawn Xu [email protected]

#

is what git logs have

violet badger Nov 12, 2025, 7:13 AM

#

he used his bench

frosty imp Nov 12, 2025, 7:13 AM

#

nvm it is on my gh profile. just anything is fine I guess

rocky vigil Nov 12, 2025, 7:14 AM

#

i didn't find a public email for cj in this log

#

yeah it seems that cj prefers noreply

frosty imp Nov 12, 2025, 7:31 AM

#

@rocky vigil PR sent

rocky vigil Nov 12, 2025, 7:32 AM

#

ok ok

#

cool

#

is this it fr

rocky vigil Nov 12, 2025, 7:34 AM

#

frosty imp <@693549181838819338> PR sent

(I actually had almost finished making the squash commit before I saw this)

stray reef Nov 12, 2025, 7:35 AM

#

are we about to merge 👀

rocky vigil Nov 12, 2025, 7:35 AM

#

if no further suggestions

#

but somehow I feel like there will still be one

violet badger Nov 12, 2025, 7:36 AM

#

From my side, I think we're basically good to go, but can only merge later today, so still a couple of hours to fix small things, if needed.

rocky vigil Nov 12, 2025, 7:36 AM

#

ok

#

i can wait until you are able to merge then

#

to do this

violet badger Nov 12, 2025, 7:37 AM

#

I think you can squash it essentially now. Even if tiny stuff comes after, this can still be integrated.

rocky vigil Nov 12, 2025, 7:37 AM

#

ok

dark stream Nov 12, 2025, 7:38 AM

#

Btw, there are still some speedups and stuff ready for after the merge, right?

violet badger Nov 12, 2025, 7:38 AM

#

probably. We'll see.

#

one thing at a time ..

rocky vigil Nov 12, 2025, 7:39 AM

#

ok it looks like contributors came out fine

frosty imp Nov 12, 2025, 7:41 AM

#

oops seems that the inline wasn't removed

#

I've made a pr but is the branch right

stray reef Nov 12, 2025, 7:42 AM

#

oh i still have a suggestion, gotta add yoshie2000 to AUTHORS Kappa

frosty imp Nov 12, 2025, 7:42 AM

#

true

violet badger Nov 12, 2025, 7:44 AM

#

The purple elimination plan is working well.

rocky vigil Nov 12, 2025, 7:45 AM

#

stray reef oh i still have a suggestion, gotta add yoshie2000 to AUTHORS <:Kappa:4363396168...

oh right

stray reef Nov 12, 2025, 7:46 AM

#

i can pr in 5mins

rocky vigil Nov 12, 2025, 7:47 AM

#

ok just do it to the original pr branch

#

I'll redo the squash in a moment

rocky vigil Nov 12, 2025, 7:47 AM

#

frosty imp I've made a pr but is the branch right

yeah I can redo the squash

#

I've already written up the PR / squash commit msgs so it's fast

stray reef Nov 12, 2025, 7:53 AM

#

https://github.com/sscg13/Stockfish/pull/15

rocky vigil Nov 12, 2025, 7:54 AM

#

merge

#

d

#

alright lemme actually

#

redo the squash

frosty imp Nov 12, 2025, 7:59 AM

#

cool

rocky vigil Nov 12, 2025, 7:59 AM

#

https://github.com/official-stockfish/Stockfish/pull/6411

GitHub

Update NNUE architecture to SFNNv10 with Threat Inputs and net nn-4...

This PR introduces Full Threat Input features, which are a subset of Piece(Square)-Piece(Square) pairs. In any given position, the active features consist of pairs where the second piece’s square l...

frosty imp Nov 12, 2025, 7:59 AM

#

mergers when

rocky vigil Nov 12, 2025, 7:59 AM

#

when vondele gets back

stray reef Nov 12, 2025, 8:03 AM

#

ci failing nohope

rocky vigil Nov 12, 2025, 8:03 AM

#

eh why's it complaining about bench mismatch

#

???

frosty imp Nov 12, 2025, 8:04 AM

#

you didn't include bench in the commit

rocky vigil Nov 12, 2025, 8:04 AM

#

shoot

#

ok

#

force-with-lease my beloved

stray reef Nov 12, 2025, 8:04 AM

#

ci says

signature mismatch: reference 2351426 obtained: 2626086 .
tho, where is it taking the reference from then? latest commit with bench?

frosty imp Nov 12, 2025, 8:05 AM

#

yeah

rocky vigil Nov 12, 2025, 8:05 AM

#

yeah that's master

frosty imp Nov 12, 2025, 8:05 AM

#

it searches backwards

stray reef Nov 12, 2025, 8:05 AM

#

alright, then it's an easy fix

rocky vigil Nov 12, 2025, 8:05 AM

#

ye

#

i've done it

rocky vigil Nov 12, 2025, 8:05 AM

#

rocky vigil force-with-lease my beloved

lol

frosty imp Nov 12, 2025, 8:06 AM

#

finally this threat can go to rest

#

or maybe not until nnue-pytorch merge

#

nohope

rocky vigil Nov 12, 2025, 8:07 AM

#

yeah

#

also it's funny

#

how the random people cannot seem to like

#

comprehend

#

L1 is 3x smaller

#

but the net is larger

prime mica Nov 12, 2025, 8:08 AM

#

who is "random people"

prime mica Nov 12, 2025, 8:08 AM

#

frosty imp finally this threat can go to rest

thread

frosty imp Nov 12, 2025, 8:08 AM

#

surely threat smallnet could work now

prime mica Nov 12, 2025, 8:08 AM

#

;)

rocky vigil Nov 12, 2025, 8:08 AM

#

prime mica who is "random people"

the cloners and other ppl who are infamous

prime mica Nov 12, 2025, 8:08 AM

#

ah ok

rocky vigil Nov 12, 2025, 8:09 AM

#

how did we violate clang-format 💀

#

oh well

frosty imp Nov 12, 2025, 8:10 AM

#

here
https://github.com/official-stockfish/Stockfish/pull/6411/commits/f6a51ae827957f2e3c0669d567e61c4fde6a435f#diff-e428c7a719856e301ac3978f5bb612bbafba0494838d9ec4c7fc5fbda8451410R371

#

prolly because my clang-format is v19 ⚓

rocky vigil Nov 12, 2025, 8:10 AM

#

blegh

frosty imp Nov 12, 2025, 8:11 AM

#

vondele can do it during merge tho

rocky vigil Nov 12, 2025, 8:12 AM

#

yea

rocky vigil Nov 12, 2025, 8:34 AM

#

stray reef alright, then it's an easy fix

btw what was 0132r

#

it was going really good in stc

#

but then antiscaled

stray reef Nov 12, 2025, 8:35 AM

#

it's an attempt to migrate to viriformat. but it was using a different teacher net since i haven't gotten around to writing an updated relabeler in c++ yet, that's probably the reason it antiscaled

#

next one with correct teacher net ready in 5-ish hours, but also wrong channel probably

rocky vigil Nov 12, 2025, 8:36 AM

#

ah yeah this channel will probably just be used to finalize nnue-pytorch merge

#

at which point it becomes mainstream

#

and we move to the general channels

frosty imp Nov 12, 2025, 8:38 AM

#

frosty imp Basically what I wanted to do is to move features to under model/modules, and fo...

i can see if I'd be able to work on this in a few days

#

but in the meantime if someone wants to try

foggy wind Nov 12, 2025, 8:48 AM

#

violet badger once the PR is in final stage, I'd like it to be squashed, with the commit messa...

Vondele would also like the PR text as commit message.

rocky vigil Nov 12, 2025, 8:48 AM

#

ok ok

#

i misinterpreted that

violet badger Nov 12, 2025, 8:57 AM

#

makes it easier to avoid mistakes on my part, if I copy and paste from the PR message formating links etc might be off.

#

we're super close though....

rocky vigil Nov 12, 2025, 9:35 AM

#

btw I edited

#

the PR text

#

into the commit message

#

i think the only remaining thing is clang-format

violet badger Nov 12, 2025, 9:38 AM

#

that's easy enough on my side.

rocky vigil Nov 12, 2025, 9:52 AM

#

other than that I think everything else is good now?

violet badger Nov 12, 2025, 9:52 AM

#

I think so.

#

and merged 🙂

rocky vigil Nov 12, 2025, 9:53 AM

#

nice

violet badger Nov 12, 2025, 9:53 AM

#

huge thanks to all contributors

#

this has been very nice

#

particular thanks to @rocky vigil and @frosty imp for their huge effort and endurance!

rocky vigil Nov 12, 2025, 9:54 AM

#

yep

#

still got more nnue-pytorch work

#

before it's truly done though

violet badger Nov 12, 2025, 9:54 AM

#

sure, beginning of a journey, hopefully!

dark stream Nov 12, 2025, 9:55 AM

#

So, where will the conversation surrounding this move? SF-general and dev?

violet badger Nov 12, 2025, 9:56 AM

#

I propose that on-topic conversations can move there indeed.

stray reef Nov 12, 2025, 9:56 AM

#

Huge congrats everyone!

formal smelt Nov 12, 2025, 9:57 AM

#

Bruh

#

No coauthor for doing most of the code for threat inputs until it got handed off?

rocky vigil Nov 12, 2025, 9:58 AM

#

did you want it

#

sorry

formal smelt Nov 12, 2025, 9:59 AM

#

You can literally see I wrote all the code in the Monty PRs

violet badger Nov 12, 2025, 10:01 AM

#

hm, oversight and unfortunately 1min late... we can't change the commit, we can add it to the PR.

formal smelt Nov 12, 2025, 10:02 AM

#

https://github.com/official-monty/Monty/pull/87/commits/77e74a2f95da9b3f3e34519422dfb026a05178e1
the first threat input commit btw…

violet badger Nov 12, 2025, 10:03 AM

#

at least that one is referenced

rocky vigil Nov 12, 2025, 10:04 AM

#

yeah I thought while we took inspiration from that code in early impl the current impl is quite far
if we widen the co-authoring range then lofty should also be credited for having the precursor UE impl that Yoshie took inspiration from

#

ok I'll add a comment to the PR

#

@formal smelt is this okay?

formal smelt Nov 12, 2025, 10:10 AM

#

Sure

formal smelt Nov 12, 2025, 10:13 AM

#

rocky vigil yeah I thought while we took inspiration from that code in early impl the curren...

I’d argue that facilitating the whole development of threat inputs in the first place (and note Viren original pdf implemented as-is actually sucks due to explicitly excluding PSQ features) had a lot more bearing on threat inputs passing than w/e 1 elo speedup…

rocky vigil Nov 12, 2025, 10:13 AM

#

rocky vigil yeah I thought while we took inspiration from that code in early impl the curren...

this was my thought, sorry for making you (+ lofty) feel uncredited

#

that was a misunderstanding on my part, I thought Viren did all of the architecture design work

violet badger Nov 12, 2025, 10:16 AM

#

meanwhile SF 17 sees this as an opportunity

stray reef Nov 12, 2025, 10:18 AM

#

SF 18.1 coming

dark stream Nov 12, 2025, 10:20 AM

#

Oh, wait. I just thought of one thing. There was a gap between th1 and th8 performance in the previous PT. That will probably be closed this time.

formal smelt Nov 12, 2025, 10:20 AM

#

rocky vigil that was a misunderstanding on my part, I thought Viren did all of the architect...

Yes unfortunate that you have to dig into every Monty PR to see that

lapis parrot Nov 12, 2025, 10:21 AM

#

dark stream Oh, wait. I just thought of one thing. There was a gap between th1 and th8 perfo...

in actual fact there was no gap

#

8 th PT has bigger gamepair ratio

#

just that it compresses because of bigger game pair draw %

rocky vigil Nov 12, 2025, 10:21 AM

#

formal smelt I’d argue that facilitating the whole development of threat inputs in the first ...

it was discussed at some point in this thread, and the takeaway I got from that was to list direct code+net contributors as co-contributors and credit indirect contributions in PR comments, but i don't remember if you had any part in that discussion

rocky vigil Nov 12, 2025, 10:21 AM

#

formal smelt Yes unfortunate that you have to dig into every Monty PR to see that

I really cannot tell if this is meant to be sarcastic

dark stream Nov 12, 2025, 10:22 AM

#

lapis parrot just that it compresses because of bigger game pair draw %

Oh

formal smelt Nov 12, 2025, 10:27 AM

#

rocky vigil I really cannot tell if this is meant to be sarcastic

It isn’t, you see Viren opened all of them even if (in the case of 2/3 of them) his role was cargo r -ring training and uploading the network
Fixed indexing: https://github.com/official-monty/montytrain/pull/16/commits/98ca72d6cb340767484f49820dc0adf648bd8cb8 (thanks to you also for part of this ofc)
I8 quantisation: https://github.com/official-monty/Monty/pull/116/commits/0f5a66438f77052b6de404ff5080ad35ba2c2943

rocky vigil Nov 12, 2025, 10:29 AM

#

yeah that was definitely part of it, i hope that last comment in the PR makes amends at least somewhat

formal smelt Nov 12, 2025, 10:30 AM

#

Yeah seems good to me

#

Malding over

rocky vigil Nov 12, 2025, 10:32 AM

#

in retrospect I should have taken some more care handling the crediting, I forgot that SF is a way larger project and that the vast majority of people don't know the "inside info" and just go by what is written

rocky vigil Nov 12, 2025, 10:44 AM

#

violet badger sure, beginning of a journey, hopefully!

to start this off, can we test the l2=31 net?

violet badger Nov 12, 2025, 10:46 AM

#

sure

lofty cedar Nov 12, 2025, 10:48 AM

#

Wait... you merged the threat input first without merging the other PRs?

#

Why?

candid ivy Nov 12, 2025, 10:51 AM

#

cause it’s an easy merge

#

and the biggest

#

with the highest possibility of conflicts

lofty cedar Nov 12, 2025, 10:52 AM

#

I see.

amber fern Nov 12, 2025, 11:13 AM

#

will yall be moving the neural net discusions here at some point? Since threat inputs is the real actual only net? https://discord.com/channels/435943710472011776/718853716266188890

violet badger Nov 12, 2025, 11:14 AM

#

no

naive comet Nov 12, 2025, 12:01 PM

#

@rocky vigil wasn't it me.....

#

.

rocky vigil Nov 12, 2025, 12:02 PM

#

💀

#

yeah I remembered this one sorry

#

this is what happens when too many messages get sent

twilit oriole Nov 12, 2025, 12:10 PM

#

@formal smelt I was under the impression you were getting credited second from the prior list posted. On the linked document the credit I claim is for creating the feature set. On the issue of who created the PRs, you raised no issues at the time with being coauthored on all of them. I had no problem if you wanted to open some of them yourself

formal smelt Nov 12, 2025, 12:12 PM

#

twilit oriole <@236941606035521537> I was under the impression you were getting credited secon...

I'm not concerned about that, on the monty commits its coauthored
its only an "issue" now from SF perspective because there was effectively no credit to me unless a person happened to actually inspect the Monty PRs and commits within

lapis parrot Nov 12, 2025, 12:19 PM

#

ugh

#

now I seem to need to download networks by hand kek

rocky vigil Nov 12, 2025, 12:22 PM

#

huh

#

it should still autodownload

lapis parrot Nov 12, 2025, 12:22 PM

#

if you can autoopen fishtest which seem to not work I guess

violet badger Nov 12, 2025, 1:57 PM

#

nothing changed there, should still be working unless some IP block? (not by us).

lapis parrot Nov 12, 2025, 1:59 PM

#

yes, and this is what I'm referring to (IP block)

#

since our net haven't changed in ages welp, it wasn't an issue

#

well it's not really an issue anyway

rocky vigil Nov 12, 2025, 2:54 PM

#

violet badger sure

should I just put stage 5 on fishtest, or wait for a local test on gitlab first

lapis parrot Nov 12, 2025, 2:56 PM

#

just put stuff on fishtest

rocky vigil Nov 12, 2025, 2:58 PM

#

ok

violet badger Nov 12, 2025, 3:07 PM

#

yes, probably best... in this case, I couldn't run a local test, since the inference code wasn't immediately available. For model arch changes, it is nice if that's available, in that case the local test runs directly.

lofty cedar Nov 13, 2025, 4:40 AM

#

After all the efforts... we got about 1 elo PT 😭

#

All of that... for a drop of elo?

#

Like... if we knew this from the start, would we have done this at all?

native lake Nov 13, 2025, 4:41 AM

#

lofty cedar Like... if we knew this from the start, would we have done this at all?

Viren has a 4 elo patch?

lofty cedar Nov 13, 2025, 4:42 AM

#

Yeah... it was 6 elo VVLTC SPRT before.

Then the PT came and only 1 was real progress.

dark stream Nov 13, 2025, 4:42 AM

#

lofty cedar Nov 13, 2025, 4:42 AM

#

Yes, I know that.

lofty cedar Nov 13, 2025, 4:43 AM

#

dark stream

The point is not that the 6 elo was accurate. The point was that...

We got 1 freaking elo from all these?

#

There were times where you got a few random patches and like 3 elo.

#

And this time, all these for a single elo drop?

dark stream Nov 13, 2025, 4:50 AM

#

lofty cedar And this time, all these for a single elo drop?

Are you being serious here?

We have no idea what effect the verbatim patch had, so we are not really getting good reference. Ideally, it should have improved elo in PT, but unless threat-inputs actually lost elo in PT, we don't really know.
There is still "a lot of low hanging fruit" left as per Viren. So some elo gains can be expected from that, purely from speedups and things.
There is much scope for things like search tunes, which should also gain a nice bit.
Most importantly, search gains have stagnated for months at this point. This might change that.

twilit oriole Nov 13, 2025, 4:51 AM

#

There is the 8 Elo of net spsa also for later

lofty cedar Nov 13, 2025, 4:52 AM

#

Oh... yeah... I guess maybe we can gain 10 elo after these. That would be nice.

And the fact that now we aren't stagnant on net anymore too.

dark stream Nov 13, 2025, 4:54 AM

#

And honestly, the PTs haven't exactly matched sources like NCM or SPCC a lot lately. I'm curious what SPCC will show.

#

SPCC even has the verbatim patch, so...

stray gyro Nov 13, 2025, 5:47 AM

#

I'm fine even if this was neutral at all, what important for me is that we now have simple training steps that don't require SPSA.

dark stream Nov 13, 2025, 5:49 AM

#

stray gyro I'm fine even if this was neutral at all, what important for me is that we now h...

Yes, thank you. The reproduction of the net training pipeline was also one of the bigger boons.

stray gyro Nov 13, 2025, 5:49 AM

#

Well eventually master net will have SPSA optimized on top of everything, but this is a good improvement.

lapis parrot Nov 13, 2025, 6:04 AM

#

also since threat inputs are really hardware dependent

#

you can get w/e the hell noise from PT

#

someone can actually run a script that separates results by archs on pt

#

would be interesting

dark stream Nov 13, 2025, 6:17 AM

#

SF18 already on ARM.

rocky vigil Nov 13, 2025, 6:52 AM

#

lapis parrot someone can actually run a script that separates results by archs on pt

Locally on vondele fleet ltc pt is +50 so

rocky vigil Nov 13, 2025, 10:12 AM

#

formal smelt I'm not concerned about that, on the monty commits its coauthored its only an "i...

you'll also get credited on the NNUE pytorch PR

#

eventually

amber fern Nov 13, 2025, 10:15 AM

#

The progression tests are about +2 elo at the moment for the last progression tests, so thats not too bad 🙂

#

violet badger Nov 13, 2025, 12:17 PM

#

rocky vigil you'll also get credited on the NNUE pytorch PR

btw, I would also see the fact that we currently have clearly defined how to train the master net with nnue-pytorch as an opportunity to repeat this will bullet. I can only believe this is beneficial for both toolchains. I'm not religious about nnue-pytorch and wouldn't mind running some trainings with bullet, while showing equivalence (or reaching it) might be useful for all engines using bullet.

rocky vigil Nov 13, 2025, 12:40 PM

#

weren't there still some kinds of conversion issues to be resolved

violet badger Nov 13, 2025, 12:47 PM

#

so far it is not working, afaik.

#

certainly would take some effort, but could be worth it?

rocky vigil Nov 13, 2025, 12:47 PM

#

i strongly suspect this is because the default psq order in bullet is different, btw

#

in sf it goes like, white pawn, black pawn, white knight, black knight, ... , king

#

whereas in bullet it goes like white pawn, white knight, ... , black pawn, black knight, ...

violet badger Nov 13, 2025, 12:49 PM

#

idk, this was the branch no https://github.com/Disservin/sf-bullet-train

rocky vigil Nov 13, 2025, 12:49 PM

#

yep and this line https://github.com/Disservin/sf-bullet-train/blob/main/src/main.rs#L34

#

indicates it's using a bullet default

rocky vigil Nov 13, 2025, 12:49 PM

#

rocky vigil i strongly suspect this is because the default psq order in bullet is different,...

which runs into this issue

violet badger Nov 13, 2025, 12:50 PM

#

would be a good reason..

formal smelt Nov 13, 2025, 12:54 PM

#

there was also the 127/128 stuff

rocky vigil Nov 13, 2025, 12:54 PM

#

yep

#

now it's 255/256

rocky vigil Nov 13, 2025, 12:54 PM

#

formal smelt there was also the 127/128 stuff

but that wouldn't be cause for a major error

#

still good to fix

formal smelt Nov 13, 2025, 12:55 PM

#

i can't remember where it is off the top of my head but couldn't it cause overflow or something

rocky vigil Nov 13, 2025, 12:55 PM

#

oh huh

#

i thought that was a trainer-side thing

#

some form of QAT

formal smelt Nov 13, 2025, 12:56 PM

#

i thought it was so you could divide by 128 rather than 127 in inference

#

probably fine

rocky vigil Nov 13, 2025, 12:57 PM

#

yeah

#

using 1, anyways

#

is like -2 elo at most

formal smelt Nov 13, 2025, 1:00 PM

#

well it would be nice to try again
also bullet_lib api has been stable for many months now, unlike when Disservin's branch was written

rocky vigil Nov 13, 2025, 1:00 PM

#

yep

#

someone just has to write it

#

and that someone wouldn't be me :P

formal smelt Nov 13, 2025, 1:01 PM

#

probably not me either for some time

#

i am currently rewriting the compiler part

rocky vigil Nov 13, 2025, 1:01 PM

#

yoshie is the most likely person to ask to write

rocky vigil Nov 13, 2025, 2:23 PM

#

btw @stray reef would you want to look into attempting to write a bullet training config for sf net?

stray reef Nov 13, 2025, 2:34 PM

#

i believe there's already some (more or less working) training configs, but they just can't be loaded into SF?

rocky vigil Nov 13, 2025, 2:35 PM

#

rocky vigil i strongly suspect this is because the default psq order in bullet is different,...

see here

#

i think this is the main one to fix

stray reef Nov 13, 2025, 2:35 PM

#

that sounds very easy to fix then

rocky vigil Nov 13, 2025, 2:35 PM

#

yeah if the later layers quantization still works

#

then it's just a matter of modifying psq part of features to match sf

#

ofc the transposing still needs to be done

stray reef Nov 13, 2025, 2:37 PM

#

transposing seems to already be done via SavedFormat in disservins config

rocky vigil Nov 13, 2025, 2:37 PM

#

https://github.com/Disservin/sf-bullet-train/blob/main/convert_quantised_to_pytorch.py

#

there's a separate one

stray reef Nov 13, 2025, 2:37 PM

#

ah

#

is stockfish not using floats for the later layers anymore?

rocky vigil Nov 13, 2025, 2:38 PM

#

i thought it's always been ints

stray reef Nov 13, 2025, 2:38 PM

#

alright then

rocky vigil Nov 13, 2025, 2:39 PM

#

feature_set_hash needs to change, idk what else

#

i forgot what "version" was

#

but yeah it seems like it shouldn't be too much additional work

stray reef Nov 13, 2025, 2:41 PM

#

what's better, doing the pst shuffling in bullet or the python script?

rocky vigil Nov 13, 2025, 2:41 PM

#

i think it'll be easier for you if you do it in the python script

#

also beware of the input buckets and the mirroring as well

#

sf mirrors to efgh

#

and '

#

'e1' is bucket 31

stray reef Nov 13, 2025, 2:46 PM

#

h8 h7 ... g8 g7 ... e8 ... e1
or
h8 g8 ... h7 g7 ... h1 ... e1
?

rocky vigil Nov 13, 2025, 2:46 PM

#

h1 = 28 (so, the latter)

#

actually you might be better off just defining the psq part properly in bullet

#

so that you don't need extra shuffling

formal smelt Nov 13, 2025, 2:48 PM

#

if you were implementing it properly you should probably write a custom dataloader and input type and then obviate the shuffling entirely

rocky vigil Nov 13, 2025, 2:48 PM

#

so many complications

formal smelt Nov 13, 2025, 2:48 PM

#

its not really that hard

stray reef Nov 13, 2025, 2:54 PM

#

custom input type sure, but why the dataloader?

#

at least, what does it have to do with the pst order and mirroring

stray reef Nov 13, 2025, 4:48 PM

#

https://github.com/Disservin/sf-bullet-train/compare/main...Yoshie2000:sf-bullet-train:fix-inputs
Basically did it (hopefully) exactly like https://github.com/official-stockfish/Stockfish/blob/master/src/nnue/features/half_ka_v2_hm.cpp so it should work, but I can't try it right now.
One thing that confused me a bit was the merged kings stuff vs. halfkav2, i thought the stm king has no feature in halfkav2 because it's implicitly encoded in the bucket, but it doesn't seem like that's the case now

rocky vigil Nov 13, 2025, 5:01 PM

#

no the king is merged

#

it's just like post processed

stray reef Nov 13, 2025, 5:30 PM

#

why are perspective kings not excluded in https://github.com/official-stockfish/Stockfish/blob/master/src/nnue/features/half_ka_v2_hm.cpp#L40-L48 then?

rocky vigil Nov 13, 2025, 5:32 PM

#

wdym

#

the king also has a feature

#

iirc there was an idea to remove the biases to merge them into the king feature

stray reef Nov 13, 2025, 5:33 PM

#

i thought the stm king has no feature in halfkav2 because it's implicitly encoded in the bucket
i guess i misinterpreted your response to this then

rocky vigil Nov 13, 2025, 5:34 PM

#

rocky vigil the king also has a feature

like for bucket 29, ([stm] king, g1) is also a feature

#

someone should test if 32 buckets vs 16 is really worth it

#

later

rocky vigil Nov 13, 2025, 7:15 PM

#

stray reef <https://github.com/Disservin/sf-bullet-train/compare/main...Yoshie2000:sf-bulle...

are we not adding threat inputs

#

they are the same as in bullet

stray reef Nov 13, 2025, 7:29 PM

#

i think it's much easier to first test this and then add threat inputs

rocky vigil Nov 13, 2025, 7:32 PM

#

ok

#

how about reduce the L1 size to 1024 anyways

#

first

#

to make the test run faster

stray reef Nov 13, 2025, 8:31 PM

#

only need to train 1 SB anyway to see if results are reasonable

#

Alright I updated bullet on my branch, and I am now able to train something that produces a reasonable startpos eval of 43 (internal units)

Gotta stop for today, and I have no idea if I broke something in the SavedFormat (i probably did), so here's the checkpoint in case someone with more knowledge about how to get things loaded into Stockfish wants to take a look

https://1drv.ms/u/c/74d39b59afff2586/IQAPLaqfHD1gQLry3utFOuxXAWs8OVfaZnq1X46PJacO_lw?e=egTwI3

frosty imp Nov 13, 2025, 10:37 PM

#

stray reef Alright I updated bullet on my branch, and I am now able to train something that...

https://github.com/official-stockfish/nnue-pytorch/blob/3e21b519598b456c1077d0fb85e47eff38356a0f/model/utils/serialize.py#L69

rare jacinth Nov 14, 2025, 4:23 AM

#

Maybe it's worth trying having additional features for the locations of pawns on the same or adjacent files to improve pawn structure understanding

rocky vigil Nov 14, 2025, 10:51 AM

#

stray reef Alright I updated bullet on my branch, and I am now able to train something that...

@stray reef can you run the disservin converter script and upload the .nnue file? I'll try to get it to work in sf by disabling threats on big net

stray reef Nov 14, 2025, 12:34 PM

#

Read checkpoints/test-1/quantised.bin successfully.
Organized data into 8 buckets.
Writing to checkpoints/test-1/pytorch.nnue...
Ending position for bucket 0: 69717832
Bucket 0 size: 1152 bytes
Ending position for bucket 1: 69768240
Bucket 1 size: 1152 bytes
Ending position for bucket 2: 69818648
Bucket 2 size: 1152 bytes
Ending position for bucket 3: 69869056
Bucket 3 size: 1152 bytes
Ending position for bucket 4: 69919464
Bucket 4 size: 1152 bytes
Ending position for bucket 5: 69969872
Bucket 5 size: 1152 bytes
Ending position for bucket 6: 70020280
Bucket 6 size: 1152 bytes
Ending position for bucket 7: 70070688
Bucket 7 size: 1152 bytes
Integer value at position 69389475: 2171895937
Conversion complete: checkpoints/test-1/quantised.bin -> checkpoints/test-1/pytorch.nnue

https://1drv.ms/u/c/74d39b59afff2586/IQCTyS0zPPq3Q7v_ywlBV3pKAbabde9n_4bMbBtd54dD5u0?e=8TTMZi

rocky vigil Nov 14, 2025, 12:36 PM

#

this is l1=3072 right?

#

what does "integer value at position 69389475" refer to

stray reef Nov 14, 2025, 12:37 PM

#

i have no idea

rocky vigil Nov 14, 2025, 12:37 PM

#

what's startpos eval supposed to be?

#

i can get it in soon

stray reef Nov 14, 2025, 12:37 PM

#

43 + quantisation error

rocky vigil Nov 14, 2025, 12:39 PM

#

b9535843d87e is hash?

stray reef Nov 14, 2025, 12:40 PM

#

hash of what?

rocky vigil Nov 14, 2025, 12:41 PM

#

the net

#

appears so

#

info string NNUE evaluation using nn-37f18f62d772.nnue (6MiB, (22528, 128, 15, 32, 1))
info string Network replica 1: Shared memory.

 NNUE network contributions (White to move)
+------------+------------+------------+------------+
|   Bucket   |  Material  | Positional |   Total    |
|            |   (PSQT)   |  (Layers)  |            |
+------------+------------+------------+------------+
|  0         |     0.00   |  +  1.98   |  +  1.98   |
|  1         |     0.00   |  + 34.08   |  + 34.08   |
|  2         |     0.00   |  + 21.72   |  + 21.72   |
|  3         |     0.00   |  + 17.79   |  + 17.79   |
|  4         |     0.00   |  + 17.84   |  + 17.84   |
|  5         |     0.00   |  + 32.42   |  + 32.42   |
|  6         |     0.00   |  - 11.03   |  - 11.03   |
|  7         |     0.00   |  + 32.33   |  + 32.33   | <-- this bucket is used
+------------+------------+------------+------------+

NNUE evaluation        +32.33 (white side)
Final evaluation       +14.02 (white side) [with scaled NNUE, ...]```

#

appears wrong

#

(startpos)

stray reef Nov 14, 2025, 12:41 PM

#

rocky vigil b9535843d87e is hash?

ah, sha256sum of the entire file, yes

rocky vigil Nov 14, 2025, 12:41 PM

#

sigh

stray reef Nov 14, 2025, 12:41 PM

#

well it would have been too good to be true if it just worked

rocky vigil Nov 14, 2025, 12:42 PM

#

can you load startpos in to bullet

#

and get the active feature indices?

#

for white perspective

stray reef Nov 14, 2025, 12:45 PM

#

22272 22017 22146 22403 22468 22149 22022 22279 21896 21897 21898 21899 21900 21901 21902 21903 21872 21873 21874 21875 21876 21877 21878 21879 22264 22009 22138 22395 22524 22141 22014 22271

rocky vigil Nov 14, 2025, 12:46 PM

#

ok lemme see

rocky vigil Nov 14, 2025, 12:52 PM

#

stray reef `22272 22017 22146 22403 22468 22149 22022 22279 21896 21897 21898 21899 21900 2...

added: 22208 21953 22082 22339 22468 22085 21958 22215 21832 21833 21834 21835 21836 21837 21838 21839 21936 21937 21938 21939 21940 21941 21942 21943 22328 22073 22202 22459 22524 22205 22078 22335

#

huh

stray reef Nov 14, 2025, 12:53 PM

#

ah so mine are completely wrong then

rocky vigil Nov 14, 2025, 12:53 PM

#

it looks off

#

yeah

#

22208 is (stm rook, a1, bucket 31)

stray reef Nov 14, 2025, 12:55 PM

#

ah lol i just had all piece colors wrong

rocky vigil Nov 14, 2025, 12:55 PM

#

31 * 704 + 6 * 64

#

ok

#

yeah yours are all off by 64

#

lol

stray reef Nov 14, 2025, 12:56 PM

#

lemme train another SB

#

hm loss is fucked. can you send the nstm features as well?

rocky vigil Nov 14, 2025, 1:10 PM

#

stray reef hm loss is fucked. can you send the nstm features as well?


BLACK added: 22328 22073 22202 22459 22524 22205 22078 22335 21936 21937 21938 21939 21940 21941 21942 21943 21832 21833 21834 21835 21836 21837 21838 21839 22208 21953 22082 22339 22468 22085 21958 22215```

#

wait shoot why are these not the same

#

are they the same

stray reef Nov 14, 2025, 1:12 PM

#

yeah, just a different order

#

ok

rocky vigil Nov 14, 2025, 1:12 PM

#

oh yeah

#

right

#

pop order different

rocky vigil Nov 14, 2025, 1:13 PM

#

stray reef hm loss is fucked. can you send the nstm features as well?

do you want to try kiwipete

#


BLACK added: 22328 22524 22335 21936 21937 21938 22195 22196 21941 21942 21943 22058 22445 21871 21857 21924 21915 22044 22096 21969 21844 21973 21846 21832 21834 21835 22348 21837 22094 22208 22468 22215```

stray reef Nov 14, 2025, 1:14 PM

#

that looks like startpos

rocky vigil Nov 14, 2025, 1:14 PM

#

oh

#

shoot

#

huh

#

no it's just bc the white rook on a1

#

is the same among both

stray reef Nov 14, 2025, 1:14 PM

#

bruh sorry

rocky vigil Nov 14, 2025, 1:15 PM

#

lot of features will look similar

#

but some are different

stray reef Nov 14, 2025, 1:15 PM

#

ok looks good now. let's try again

rocky vigil Nov 14, 2025, 1:16 PM

#

cool

#

well hopefully startpos eval shouldn't be cooked

stray reef Nov 14, 2025, 1:18 PM

#

loss is still fucked, let's try smth where stm and ntm have different buckets and mirroring, e.g. 8/2p5/8/2kPKp1p/2p4P/2P5/3P4/8 w - - 0 1

rocky vigil Nov 14, 2025, 1:21 PM

#

ok

#


BLACK added: 12788 12781 12709 12768 13341 12764 13339 12698 12696 12685```

stray reef Nov 14, 2025, 1:22 PM

#

also correct, i'll try disabling the factoriser, maybe the issue is there

rocky vigil Nov 14, 2025, 1:22 PM

#

oh

#

ok

#

😨

#

yoshie is typing...

stray reef Nov 14, 2025, 1:32 PM

#

hm can't really find the reason now. fixing the feature index calculation increases loss by like 50%, and even before that, it was too high for 0 WDL in my opinion, and also barely drops in the next SBs, that doesn't seem right

#

we can still compare startpos eval tho

rocky vigil Nov 14, 2025, 1:33 PM

#

ok

#

sure

stray reef Nov 14, 2025, 1:35 PM

#

https://1drv.ms/u/c/74d39b59afff2586/IQCwwAZG8k1SR6K0NtiD7QjLAQpUJVtBlILxCfH0VUdXWNE?e=2KaVSe

#

hash starts with 54798df9

rocky vigil Nov 14, 2025, 1:37 PM

#

yep

#

+10.67 startpos eval (after normalization)

#

welp

rocky vigil Nov 14, 2025, 1:37 PM

#

stray reef we can still compare startpos eval tho

what was it supposed to be

stray reef Nov 14, 2025, 1:37 PM

#

41-ish

#

hm

rocky vigil Nov 14, 2025, 1:42 PM

#

can you get the psqt skip eval only?

#

for some asymmetric position

stray reef Nov 14, 2025, 1:43 PM

#

tested the pre-TI plenty arch instead of the SF arch with this script. loss is similar, but it works fine in plenty. so i'll stop worrying about loss now :P

rocky vigil Nov 14, 2025, 1:43 PM

#

ok

stray reef Nov 14, 2025, 1:44 PM

#

rocky vigil can you get the psqt skip eval only?

psqt or skip neuron or both?

rocky vigil Nov 14, 2025, 1:44 PM

#

also idk bullet syntax but is this supposed to be forward(stm).crelu().pairwise_mul()

stray reef Nov 14, 2025, 1:45 PM

#

that's being done in the lines above

rocky vigil Nov 14, 2025, 1:45 PM

#

huh

#

how does pairwise_mul work

#

is it supposed to be done before or after concat

naive comet Nov 14, 2025, 1:45 PM

#

well

rocky vigil Nov 14, 2025, 1:45 PM

#

(in bullet)

stray reef Nov 14, 2025, 1:45 PM

#

ah fuck you're right

#

i think pairwise mul worked differently back when disservin wrote this, at least i just renamed the function, and forgot that it's done differently now

rocky vigil Nov 14, 2025, 1:47 PM

#

yeah the original one has like

#

pairwise_mul_with_affine

#

or smth

#

bruh why is L1 -> L2 also factorized

#

does sf really do this

stray reef Nov 14, 2025, 1:50 PM

#

https://1drv.ms/u/c/74d39b59afff2586/IQARcuLDsJ_bTYXLiHaawDARAUOL2nUnNvSK2_aMZ2tTWvQ?e=HR8W2e 63-ish for startpos

stray reef Nov 14, 2025, 1:50 PM

#

rocky vigil does sf really do this

apparently?

#

judging by what disservin wrote at least

rocky vigil Nov 14, 2025, 1:51 PM

#

+36(.00)

#

soo

#

i think let's just try to get the basic stuff working

#

so remove the l1 -> l2 factorization

#

this is not screlu

formal smelt Nov 14, 2025, 1:55 PM

#

stray reef i think pairwise mul worked differently back when disservin wrote this, at least...

well it was called pairwise_mul_post_concat lol

stray reef Nov 14, 2025, 1:56 PM

#

rocky vigil this is not screlu

ok i'll change

rocky vigil Nov 14, 2025, 1:56 PM

#

literally endless whack a mole tho

#

lemme check the concat order of dual activation

stray reef Nov 14, 2025, 1:59 PM

#

https://1drv.ms/u/c/74d39b59afff2586/IQDmpB0l9uUFRopEkoXs4Wh7AdIGX9C81iP9MdKqiDmxqrk?e=uHneav no l1 factoriser, crelu instead of screlu, startpos eval 80

rocky vigil Nov 14, 2025, 2:01 PM

#

rocky vigil lemme check the concat order of dual activation

this is the wrong concat order, it's sqr first and then standard

rocky vigil Nov 14, 2025, 2:01 PM

#

rocky vigil literally endless whack a mole tho

^^

stray reef Nov 14, 2025, 2:02 PM

#

bruh

stray reef Nov 14, 2025, 2:02 PM

#

rocky vigil this is the wrong concat order, it's sqr first and then standard

like this, right?

out = out.abs_pow(2.0).concat(out);
out = out.crelu();

rocky vigil Nov 14, 2025, 2:03 PM

#

yeah

#

if this does what I think it does

#

it's concatting ac_0_out to ac_sqr_0_out

#

which means the squared stuff should be first

stray reef Nov 14, 2025, 2:03 PM

#

looks like it

#

https://1drv.ms/u/c/74d39b59afff2586/IQAqHdtk-D6BQp0-2SGiQ1rRARoo1epiEQnMYaaFs1CQhDM?e=8bSi1N 61-ish

rocky vigil Nov 14, 2025, 2:07 PM

#

Final evaluation       -11.99 (white side) [with scaled NNUE, ...]```

#

💢

#UE Threat Inputs for AB