prime mica Nov 9, 2025, 6:40 PM

#

I don't get it lol

green moat Nov 9, 2025, 6:42 PM

#

prime mica I don't get it lol

Kieren is Australian.
Yeah, I still don't get it why he should be upset about a branch named emu
🤷‍♂️

foggy wind Nov 9, 2025, 6:43 PM

#

This I guess: https://en.wikipedia.org/wiki/Emu_War

green moat Nov 9, 2025, 6:44 PM

#

foggy wind This I guess: https://en.wikipedia.org/wiki/Emu_War

Oh!

prime mica Nov 9, 2025, 6:45 PM

#

lololol

#

fighting anti-emu bigotry one PR at a time

#

my friend has a pet emu

#

haven't yet met her tho

#

(the emu)

foggy wind Nov 9, 2025, 6:46 PM

#

GROUPED BY x86

x86 | Elo:     1.09 ±    1.46 | LOS:  92.8% | LLR:  1.07 | [269, 7246, 15053, 7370, 302]
ARM | Elo:    20.68 ±    6.52 | LOS: 100.0% | LLR:  1.89 | [7, 276, 744, 425, 20]

prime mica Nov 9, 2025, 6:46 PM

#

hm ok

rocky vigil Nov 9, 2025, 6:46 PM

#

lmao

lapis parrot Nov 9, 2025, 6:46 PM

#

foggy wind This I guess: https://en.wikipedia.org/wiki/Emu_War

zactly

rocky vigil Nov 9, 2025, 6:47 PM

#

well there you have the contributions

lapis parrot Nov 9, 2025, 6:47 PM

#

haven't you seen the memes?

prime mica Nov 9, 2025, 6:47 PM

#

ugh

violet badger Nov 9, 2025, 6:47 PM

#

without vondele fleet LLR printers. .... I object. I print Elo, not LLR

lapis parrot Nov 9, 2025, 6:47 PM

#

#

etc

prime mica Nov 9, 2025, 6:48 PM

#

#NotAllEmus

lapis parrot Nov 9, 2025, 6:48 PM

#

there are a lot of this stuff, you can google yourself

rocky vigil Nov 9, 2025, 6:48 PM

#

foggy wind ``` GROUPED BY x86 x86 | Elo: 1.09 ± 1.46 | LOS: 92.8% | LLR: 1.07 | [...

despite only being 4.6% of games, ARM is responsible for 64% of LLR

prime mica Nov 9, 2025, 6:48 PM

#

superior architecture

#

^_^

#

CISCcels seething

rocky vigil Nov 9, 2025, 6:49 PM

#

well everyone keep saying x86 is dead

#

so we gotta look towards the future

prime mica Nov 9, 2025, 6:49 PM

#

true

#

deprecate x86-64-*

violet badger Nov 9, 2025, 6:49 PM

#

we'll support RISC-V only for the future

lapis parrot Nov 9, 2025, 6:49 PM

#

even at my work we have a buld that supports arm

#

but there are severe downsides though

prime mica Nov 9, 2025, 6:49 PM

#

violet badger we'll support RISC-V only for the future

finally the academic in you comes out

violet badger Nov 9, 2025, 6:50 PM

#

EPI for the win

prime mica Nov 9, 2025, 6:50 PM

#

lapis parrot but there are severe downsides though

true

lapis parrot Nov 9, 2025, 6:50 PM

#

well, what is this function called

#

to calculate 1/x for x being a float

#

fast but not precise

prime mica Nov 9, 2025, 6:50 PM

#

vrcpss

lapis parrot Nov 9, 2025, 6:50 PM

#

nah

prime mica Nov 9, 2025, 6:50 PM

#

lol

lapis parrot Nov 9, 2025, 6:51 PM

#

well in general this function doesn't exist in library of arm cpus we use

#

but exists in dsp

prime mica Nov 9, 2025, 6:51 PM

#

https://developer.arm.com/documentation/ddi0406/cb/Application-Level-Architecture/Instruction-Details/Alphabetical-list-of-instructions/VRECPE

#

maybe recent extension tho

violet badger Nov 9, 2025, 6:51 PM

#

rsqrtss

lapis parrot Nov 9, 2025, 6:51 PM

#

well you should understand that we use controllers etc

prime mica Nov 9, 2025, 6:51 PM

#

what do you work on :o

#

that's very cool

lapis parrot Nov 9, 2025, 6:52 PM

#

relay protection

#

recipf

#

ofc

#

at least in what we use you can't really use this in arm because library doesn't exist, note that this is a big production cycles so you can't simply switch to newer stuff out of the blue

prime mica Nov 9, 2025, 6:53 PM

#

for sure

lapis parrot Nov 9, 2025, 6:54 PM

#

so in general I tend to exclude divisions unless absolutely necessary

prime mica Nov 9, 2025, 6:55 PM

#

ideal

foggy wind Nov 9, 2025, 7:00 PM

#

I would say non functional with avx512icl and gcc 15.2.1

Result of 200 runs
==================
base (...fish.ostrich) =    2055743  +/- 4626
test (...tockfish.emu) =    2053770  +/- 4626
diff                   =      -1973  +/- 2362

speedup        = -0.0010
P(speedup > 0) =  0.0510

prime mica Nov 9, 2025, 7:00 PM

#

yikes

#

we'll see fishtest then

#

might be arch dependent

#

could you also try out https://tests.stockfishchess.org/tests/live_elo/69108025ec1d00d2c195c5d6 when you have time

#

no bench change = non-functional
no bench change, slowdown = dysfunctional

foggy wind Nov 9, 2025, 7:05 PM

#

There is a new warning for snowy-egret-2

prime mica Nov 9, 2025, 7:05 PM

#

screenshot?

#

probably just some unused variable or smth

foggy wind Nov 9, 2025, 7:06 PM

#

position.cpp: In member function 'Stockfish::Position& Stockfish::Position::set(const std::string&, bool, Stockfish::StateInfo*)':
position.cpp:204:16: warning: 'void* memset(void*, int, size_t)' clearing an object of type 'class Stockfish::Position' with no trivial copy-assignment; use value-initialization instead [-Wclass-memaccess]
  204 |     std::memset(this, 0, sizeof(Position));
      |     ~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~
In file included from position.cpp:19:
position.h:80:7: note: 'class Stockfish::Position' declared here
   80 | class Position {
      |       ^~~~~~~~

prime mica Nov 9, 2025, 7:07 PM

#

eh that's fine

#

it's because I added a dummy DirtyThreats to Position

#

we can silence it by casting to char*

lapis parrot Nov 9, 2025, 7:08 PM

#

prime mica we can silence it by casting to char*

then it will be sunday silence

prime mica Nov 9, 2025, 7:08 PM

#

:)

#

technically not UB

#

because wherever I use DirtyThreats I use placement new before it

lapis parrot Nov 9, 2025, 7:08 PM

#

well google "sunday silence"

prime mica Nov 9, 2025, 7:08 PM

#

https://en.wikipedia.org/wiki/Sunday_Silence ??

Sunday Silence

Sunday Silence (March 25, 1986 – August 19, 2002) was an American-bred Thoroughbred racehorse and sire. In 1989, he won the Kentucky Derby and the Preakness Stakes but failed to complete the Triple Crown when he was defeated in the Belmont Stakes. Nevertheless, he won the Breeders' Cup Classic and was voted American Champion Three-Year-Old Col...

lapis parrot Nov 9, 2025, 7:08 PM

#

indeed

#

would be a goated reference to the goat

prime mica Nov 9, 2025, 7:09 PM

#

I still don't get it

#

does "char" or "cast" have a meaning in horse racing

lapis parrot Nov 9, 2025, 7:09 PM

#

nah, just that it will be silencing at the sunday

#

nothing more than this

foggy wind Nov 9, 2025, 7:14 PM

#

prime mica could you also try out https://tests.stockfishchess.org/tests/live_elo/69108025e...

Result of 200 runs
==================
base (...fish.ti_base) =    1997515  +/- 4078
test (...nowy-egret-2) =    2009546  +/- 4115
diff                   =     +12031  +/- 1905

speedup        = +0.0060
P(speedup > 0) =  1.0000

prime mica Nov 9, 2025, 7:14 PM

#

whew

#

hopefully it'll finally pass fishtest

#

ur the best

foggy wind Nov 9, 2025, 7:23 PM

#

prime mica might be arch dependent

sss But it matches my gcc 15 result.

GROUPED BY COMPILER VERSION

g++ 13     | Elo:     2.32 ±    3.55 | LOS:  90.0% | LLR:  0.55 | [23, 899, 2438, 949, 27]
g++ 15     | Elo:    -0.39 ±    5.10 | LOS:  44.0% | LLR: -0.13 | [16, 500, 1190, 509, 9]
g++ 14     | Elo:     4.04 ±    5.30 | LOS:  93.3% | LLR:  0.48 | [11, 442, 1116, 478, 17]
g++ 11     | Elo:     1.24 ±    7.10 | LOS:  63.4% | LLR:  0.06 | [5, 243, 622, 239, 11]
clang++ 20 | Elo:     1.49 ±    8.25 | LOS:  63.8% | LLR:  0.06 | [1, 185, 440, 186, 4]
g++ 12     | Elo:   -10.32 ±   13.22 | LOS:   6.3% | LLR: -0.23 | [3, 77, 177, 62, 1]
clang++ 22 | Elo:   -43.66 ±   43.48 | LOS:   2.3% | LLR: -0.08 | [0, 13, 14, 5, 0]

prime mica Nov 9, 2025, 7:24 PM

#

interesting what is this

violet badger Nov 9, 2025, 7:24 PM

#

blackmail material

prime mica Nov 9, 2025, 7:25 PM

#

I think this is too SSS

prime mica Nov 9, 2025, 7:25 PM

#

violet badger blackmail material

lololol

#

clang developers shitting their pants rn

#

GNU

foggy wind Nov 9, 2025, 7:25 PM

#

prime mica interesting what is this

https://tests.stockfishchess.org/tests/view/69107881ec1d00d2c195c5c2

prime mica Nov 9, 2025, 7:25 PM

#

gotcha

#

idk if it's only -0.1% on Zen 5 and decent on other architectures then I think it's an easy choice

#

but we'll see, might fail

#

I figured out a cool prefetch trick that seems to work ok...

foggy wind Nov 9, 2025, 7:26 PM

#

Even if it is neutral on gcc 15 and works well with older versions, everything is fine.

prime mica Nov 9, 2025, 7:26 PM

#

do the psqt accumulation first and in those loops, prefetch the first chunk of the weights accumulation

#

finnicky tho

#

when u have time if you could check out https://tests.stockfishchess.org/tests/live_elo/6910ec7cec1d00d2c195c6aa that'd be swell

#

I think because you have the X3D (?) it'll be neutral-to-negative

#

because so much cache

#

but maybe better on fishtest

violet badger Nov 9, 2025, 7:42 PM

#

meanwhile LTC works wonderfully?

#

this is a super bizarre patch..

lapis parrot Nov 9, 2025, 7:43 PM

#

Jinx

#

sss

prime mica Nov 9, 2025, 7:44 PM

#

if it goes well, thoughts on a VLTC test?

violet badger Nov 9, 2025, 7:44 PM

#

rather SMP

prime mica Nov 9, 2025, 7:44 PM

#

if this scales nicely then it should pass very quickly anyway

#

oh sure

violet badger Nov 9, 2025, 7:44 PM

#

which I happen to run locally right now 😉

prime mica Nov 9, 2025, 7:44 PM

#

lol

#

Grace Hopper or "fitbit"

violet badger Nov 9, 2025, 7:44 PM

#

x86

prime mica Nov 9, 2025, 7:44 PM

#

cool beans

#

if this scales indefinitely TC-wise that would be so legendary

violet badger Nov 9, 2025, 7:45 PM

#

well, that never happens, but looking good SMP at 10+0.1

prime mica Nov 9, 2025, 7:45 PM

#

Torch shaking in its boots 😩

prime mica Nov 9, 2025, 7:45 PM

#

violet badger well, that never happens, but looking good SMP at 10+0.1

oh interesting

lapis parrot Nov 9, 2025, 7:45 PM

#

well in general it "does"

#

somewhat

prime mica Nov 9, 2025, 7:46 PM

#

is it bc of elo compression

lapis parrot Nov 9, 2025, 7:46 PM

#

at least 120+1.2 SPSA did scale way past 120+1.2

#

elo compression on uho books exists

violet badger Nov 9, 2025, 7:46 PM

#

'indefinitely' is poorly defined 😉

lapis parrot Nov 9, 2025, 7:46 PM

#

but it's not big

prime mica Nov 9, 2025, 7:46 PM

#

violet badger 'indefinitely' is poorly defined 😉

fair enough haha

violet badger Nov 9, 2025, 7:46 PM

#

chess is O(1)

prime mica Nov 9, 2025, 7:46 PM

#

true

lapis parrot Nov 9, 2025, 7:46 PM

#

yeah at infinity it will play perfect chess anyway

prime mica Nov 9, 2025, 7:46 PM

#

rare professor who cares about the big-O constant

prime mica Nov 9, 2025, 7:47 PM

#

lapis parrot yeah at infinity it will play perfect chess anyway

cosmic bit flip tho :)

lapis parrot Nov 9, 2025, 7:47 PM

#

just disable TT

violet badger Nov 9, 2025, 7:47 PM

#

we already documented one during SF development

lapis parrot Nov 9, 2025, 7:47 PM

#

morelayers

prime mica Nov 9, 2025, 7:47 PM

#

violet badger we already documented one during SF development

really??

#

that's awesome

violet badger Nov 9, 2025, 7:47 PM

#

let me find this..

green moat Nov 9, 2025, 7:47 PM

#

violet badger let me find this..

https://github.com/official-stockfish/WDL_model/issues/88#issuecomment-1775008814

violet badger Nov 9, 2025, 7:48 PM

#

nice you had the tab still open 😉

green moat Nov 9, 2025, 7:48 PM

#

A thumbs up by vondele.
My life is now complete
😭

#

😉

foggy wind Nov 9, 2025, 7:52 PM

#

prime mica when u have time if you could check out https://tests.stockfishchess.org/tests/l...

Result of 200 runs
==================
base (...fish.ti_base) =    2000882  +/- 4020
test (...apped-nunlet) =    1988707  +/- 3553
diff                   =     -12175  +/- 1773

speedup        = -0.0061
P(speedup > 0) =  0.0000

CPU: 16 x AMD Ryzen 9 9950X3D 16-Core Processor
Hyperthreading: on

prime mica Nov 9, 2025, 7:52 PM

#

yuck

#

all that cache...

violet badger Nov 9, 2025, 7:53 PM

#

meanwhile:

   # PLAYER    :  RATING  ERROR   POINTS  PLAYED   (%)
   1 patch     :     8.2    3.2  11799.0   23138    51
   2 master    :     0.0   ----  11339.0   23138    49

prime mica Nov 9, 2025, 7:53 PM

#

yoooo

#

time controls?

violet badger Nov 9, 2025, 7:54 PM

#

10+0.1t16

#

on x86

prime mica Nov 9, 2025, 7:54 PM

#

very promising

violet badger Nov 9, 2025, 7:54 PM

#

let me see what I get singlethreaded on the same hardware.

#

same hardware, same TC, single threaded

   # PLAYER    :  RATING  ERROR   POINTS  PLAYED   (%)
   1 master    :     0.0   ----  16984.0   32768    52
   2 patch     :   -13.0    2.8  15784.0   32768    48

prime mica Nov 9, 2025, 7:59 PM

#

yikes lmao

#

idk if we're ever gonna get a 7% speedup

foggy wind Nov 9, 2025, 8:00 PM

#

Looks like excellent scaling 😄

violet badger Nov 9, 2025, 8:00 PM

#

well, with that kind of scaling this might not be needed.

prime mica Nov 9, 2025, 8:00 PM

#

do we just ignore STC

violet badger Nov 9, 2025, 8:00 PM

#

not 'just ignore'

prime mica Nov 9, 2025, 8:00 PM

#

and hope that search patches will bring it up

violet badger Nov 9, 2025, 8:00 PM

#

we focus on LTC and SMP LTC (i.e. PT).

#

but STC is a great tool to get there...

prime mica Nov 9, 2025, 8:01 PM

#

excellent

violet badger Nov 9, 2025, 8:01 PM

#

especially for speedups 😉

prime mica Nov 9, 2025, 8:01 PM

#

lol

#

one trick pony

violet badger Nov 9, 2025, 8:01 PM

#

but that kind of difference between single threaded and multithreaded is kind of insane.

prime mica Nov 9, 2025, 8:02 PM

#

indeed...

#

I'ma do a similar test locally to see how it looks on Zen 5

#

do you have a script you used?

violet badger Nov 9, 2025, 8:03 PM

#

not really, but can share the fastchess commandline.

prime mica Nov 9, 2025, 8:03 PM

#

yeah sure

#

that's what I meant

violet badger Nov 9, 2025, 8:04 PM

#

threads=1
taskset --cpu-list $tasksetlow-$tasksethigh \
./fastchess -tournament roundrobin -concurrency $(($size/$threads)) -rounds 16 -games 2 -repeat -srand $RANDOM \
            -openings file=./UHO_Lichess_4852_v1.epd format=epd order=random\
            -engine name=master cmd=./stockfish.master.x86 tc=10+0.1\
            -engine name=patch cmd=./stockfish.patch.x86 tc=10+0.1\
            -config outname=config-foo\
            -pgnout file=games-foo.pgn\
            -each proto=uci option.Threads=$threads option.Hash=$((16*threads)) >& out-foo

prime mica Nov 9, 2025, 8:04 PM

#

gotcha

#

did you use SMT or no

violet badger Nov 9, 2025, 8:04 PM

#

yes.

prime mica Nov 9, 2025, 8:04 PM

#

cool beans

#

why is it not printing anything

#

or is that expecetd

violet badger Nov 9, 2025, 8:10 PM

#

look for a file named out-foo 😉

prime mica Nov 9, 2025, 8:10 PM

#

I am blind thank you

#

huzzah it's wroking

prime mica Nov 9, 2025, 8:27 PM

#

if you think it's worth the data, I'd try running the STC tournament with no SMT...

#

I'm suspecting that the i8->i16 conversion spam doesn't play well with SMT

#

(not that that's a solvable problem)

prime mica Nov 9, 2025, 8:43 PM

#

Results of master vs patch (10+0.1, 1t, 16MB, UHO_Lichess_4852_v1.epd):
Elo: -3.04 +/- 4.16, nElo: -5.90 +/- 8.08
LOS: 7.63 %, DrawRatio: 51.06 %, PairsRatio: 0.95
Games: 7094, Wins: 1827, Losses: 1889, Draws: 3378, Points: 3516.0 (49.56 %)
Ptnml(0-2): [32, 859, 1811, 829, 16], WL/DD Ratio: 1.14

ST penalty not quite so bad over here so far

violet badger Nov 9, 2025, 8:56 PM

#

so that's quite good.

#

With more threads (10+0.1t256) still good..

   # PLAYER    :  RATING  ERROR  POINTS  PLAYED   (%)
   1 patch     :    13.8   11.0  1063.5    2048    52
   2 master    :     0.0   ----   984.5    2048    48

prime mica Nov 9, 2025, 8:58 PM

#

numerous

violet badger Nov 9, 2025, 9:02 PM

#

Have you seen this https://tests.stockfishchess.org/actions?max_actions=1&action=&user=&text=&before=1762721564.042212&run_id=

#

make -j ARCH=x86-64-sse41-popcnt profile-build errors out

prime mica Nov 9, 2025, 9:05 PM

#

ugh

#

we never ported i8 to sse

#

just avx2+ and neon

#

should be easy enough

violet badger Nov 9, 2025, 9:05 PM

#

ok, part of the cleanup effort..

prime mica Nov 9, 2025, 9:06 PM

#

I'll do it rn, why not

violet badger Nov 9, 2025, 9:06 PM

#

sure

prime mica Nov 9, 2025, 9:06 PM

#

so much threat inputs progress in the past few weeks msheart_eyes

#

There are decades where nothing happens; and there are weeks where decades happen.

violet badger Nov 9, 2025, 9:07 PM

#

So, the multithreaded cousin of this one looks like:

Results of master vs patch (10+0.1, 8t, 64MB, UHO_Lichess_4852_v1.epd):
Elo: -17.93 +/- 13.31, nElo: -36.33 +/- 26.92
LOS: 0.41 %, DrawRatio: 51.25 %, PairsRatio: 0.66
Games: 640, Wins: 140, Losses: 173, Draws: 327, Points: 303.5 (47.42 %)
Ptnml(0-2): [1, 93, 164, 62, 0], WL/DD Ratio: 0.91

prime mica Nov 9, 2025, 9:07 PM

#

yikes what

#

I thought 16t was really good?

violet badger Nov 9, 2025, 9:07 PM

#

mind the order (master vs patch)

#

so roughly 25Elo difference on the same machine between 1t and 8t at STC

prime mica Nov 9, 2025, 9:08 PM

#

I get my signs right 50% of the time

#

as in, threat inputs is good?

violet badger Nov 9, 2025, 9:08 PM

#

yeah.

prime mica Nov 9, 2025, 9:08 PM

#

powerful

violet badger Nov 9, 2025, 9:08 PM

#

this is super bizarre.

#

but well.

#

good.

rocky vigil Nov 9, 2025, 9:10 PM

#

prime mica > There are decades where nothing happens; and there are weeks where decades hap...

in this case, there are months where nothing happens, and there are weeks where months happen

prime mica Nov 9, 2025, 9:10 PM

#

lol

#

Lenin is displeased

#

hmph how to polyfill _mm_cvtepi8_epi16 for < SSE 4.1

#

on SSSE3 you could pshufb + srai

rocky vigil Nov 9, 2025, 9:21 PM

#

yoshie has stuff https://github.com/Yoshie2000/PlentyChess/commit/2cd1b78bab4b77ccbe0af6efd2bbaa136d0fb32e

prime mica Nov 9, 2025, 9:21 PM

#

advanced

#

@stray reef is it ok if I copy this and would you like credit

rocky vigil Nov 9, 2025, 9:22 PM

#

so there is some SSSE3 thing I think

#

idk what to do for generic tho

prime mica Nov 9, 2025, 9:23 PM

#

the fallback looks SSE2 compaible

rocky vigil Nov 9, 2025, 9:23 PM

#

oh interesting

prime mica Nov 9, 2025, 9:23 PM

#

I'll write three implementations, one for SSE4.1, one for SSSE3, and one for SSE2

#

then we should be good to go

torn lagoon Nov 9, 2025, 9:25 PM

#

Sf doesn't support non-sse2?

prime mica Nov 9, 2025, 9:26 PM

#

well we'll have a generic C fallback tha's slow as molasses

#

not sure whether that's done yet

rocky vigil Nov 9, 2025, 9:30 PM

#

prime mica not sure whether that's done yet

the generic fallback is literally for loops

#

doesn't it get implicitly casted

prime mica Nov 9, 2025, 9:31 PM

#

lol if so then that's great

#

ok! all three versions have the right bench

#

https://github.com/anematode/Stockfish/tree/threat-inputs-sse-port

stray reef Nov 9, 2025, 9:45 PM

#

prime mica <@415167192296849409> is it ok if I copy this and would you like credit

feel free to copy it, don't have to credit, it probably sucks anyway

prime mica Nov 9, 2025, 9:45 PM

#

nah it's beautifu

#

thank u

foggy wind Nov 9, 2025, 9:49 PM

#

Wrong bench for general-64: Nodes searched : 3117291

prime mica Nov 9, 2025, 9:50 PM

#

🤦

#

ok lemme fix

#

while ur around would you mind benching https://tests.stockfishchess.org/tests/live_elo/691101f3ec1d00d2c195c6fd vs threat-inputs-i8?

foggy wind Nov 9, 2025, 9:50 PM

#

Does ARM already work without NEON? And 32-bit ARM?

prime mica Nov 9, 2025, 9:50 PM

#

non-NEON ARM will probably use the fallback

#

ngl I don't see why it's wrong...

#

huh, it's correct locally...

#

make -j build ARCH=general-64 right?

foggy wind Nov 9, 2025, 9:52 PM

#

yea

prime mica Nov 9, 2025, 9:53 PM

#

maybe (after ur done benching) you can try my SSE port branch...

foggy wind Nov 9, 2025, 9:53 PM

#

I did a profile-build, but it shouldn't matter

prime mica Nov 9, 2025, 9:53 PM

#

but I don't see how that would change it tbh

#

ye

foggy wind Nov 9, 2025, 9:53 PM

#

prime mica maybe (after ur done benching) you can try my SSE port branch...

I tried the sse branch

prime mica Nov 9, 2025, 9:53 PM

#

ughh

#

oh ok I can reproduce it now

#

I think I just forgot to build lmao

stray reef Nov 9, 2025, 9:54 PM

#

prime mica <@415167192296849409> is it ok if I copy this and would you like credit

btw the shifts are unnecessary in this version i think, i previously used _mm_set1_epi64 instead of _mm_cvtsi64_si128 and forgot to remove them

prime mica Nov 9, 2025, 9:54 PM

#

oh lol I see

#

ok honestly I have no clue why general-64 is bugged

rocky vigil Nov 9, 2025, 9:59 PM

#

That still uses vector

prime mica Nov 9, 2025, 9:59 PM

#

what

rocky vigil Nov 9, 2025, 9:59 PM

#

Only 32 bit uses generic fallback

#

Or smth

#

Idk

prime mica Nov 9, 2025, 9:59 PM

#

idt so

#

when I make changes to the generic fallback it changes the bench

#

        for (const auto index : removed)
        {
            const IndexType offset = Dimensions * index;

            for (IndexType j = 0; j < Dimensions; ++j)
                toAcc[j] = fromAcc[j] - featureTransformer.threatWeights[offset + j];

            for (std::size_t k = 0; k < PSQTBuckets; ++k)
                toPsqtAcc[k] =
                  fromPsqtAcc[k] - featureTransformer.threatPsqtWeights[index * PSQTBuckets + k];
        }

        for (const auto index : added)
        {
            const IndexType offset = Dimensions * index;

            for (IndexType j = 0; j < Dimensions; ++j)
                toAcc[j] += featureTransformer.threatWeights[offset + j];

            for (std::size_t k = 0; k < PSQTBuckets; ++k)
                toPsqtAcc[k] += featureTransformer.threatPsqtWeights[index * PSQTBuckets + k];
        }

#

I have to be missing something really obvious

rocky vigil Nov 9, 2025, 10:01 PM

#

prime mica ```cpp for (const auto index : removed) { const Inde...

Removed loop looks off

prime mica Nov 9, 2025, 10:01 PM

#

OH

#

-=

#

I am a dumbas

#

thx

foggy wind Nov 9, 2025, 10:02 PM

#

prime mica while ur around would you mind benching https://tests.stockfishchess.org/tests/l...

Result of 200 runs
==================
base (...fish.ti_base) =    1990667  +/- 3984
test (....emu-inlined) =    2048729  +/- 3966
diff                   =     +58062  +/- 2201

speedup        = +0.0292
P(speedup > 0) =  1.0000

prime mica Nov 9, 2025, 10:02 PM

#

ok not bad

#

better than ostrich which is what matters

#

rare force_inline W

#

thx as always <3

#

OK

#

sse2 inefficiency fixed, general-64 works again

#

so we should be good to go

prime mica Nov 9, 2025, 10:10 PM

#

prime mica while ur around would you mind benching https://tests.stockfishchess.org/tests/l...

@warm thistle if ur around I'd appreciate a bench on ur computer too

foggy wind Nov 9, 2025, 10:13 PM

#

prime mica better than ostrich which is what matters

hm same for me

Result of 200 runs
==================
base (...fish.ti_base) =    1996106  +/- 4152
test (...fish.ostrich) =    2056736  +/- 4537
diff                   =     +60631  +/- 2106

speedup        = +0.0304
P(speedup > 0) =  1.0000

prime mica Nov 9, 2025, 10:13 PM

#

O nvm huh

#

it rly rips through the indexing on your computer lol

#

ok well we'll wait for fishtest then

warm thistle Nov 9, 2025, 10:17 PM

#

prime mica <@458684594703695872> if ur around I'd appreciate a bench on ur computer too

on it

#

Result of  20 runs
==================
base (./sf-old       ) =    1385104  +/- 7425
test (./stockfish    ) =    1420016  +/- 9039
diff                   =     +34912  +/- 3671

speedup        = +0.0252
P(speedup > 0) =  1.0000

CPU: 8 x AMD Ryzen 7 7700X 8-Core Processor
Hyperthreading: on

prime mica Nov 9, 2025, 10:20 PM

#

hmph ok

#

so similar to ostrich as well

violet badger Nov 9, 2025, 10:52 PM

#

no surprise, but good:

Verify node counts: 
               g++-9 :    2324801
              g++-10 :    2324801
              g++-11 :    2324801
              g++-12 :    2324801
              g++-13 :    2324801
          clang++-11 :    2324801
          clang++-12 :    2324801
          clang++-13 :    2324801
          clang++-14 :    2324801
          clang++-15 :    2324801
          clang++-16 :    2324801
          clang++-17 :    2324801
          clang++-18 :    2324801
          clang++-19 :    2324801
          clang++-20 :    2324801

#

I should probably add a loop over our architectures..

#

time to call it a day. I suggest to start both SMP runs on fishtest once the LTC passes.

prime mica Nov 9, 2025, 10:58 PM

#

gn!

frosty imp Nov 10, 2025, 12:11 AM

#

are we preparing for the PR now?

lapis parrot Nov 10, 2025, 1:44 AM

#

it's only at 2.6 LLR though

twilit oriole Nov 10, 2025, 1:56 AM

#

It passed

warm thistle Nov 10, 2025, 1:56 AM

#

🎉

shell breach Nov 10, 2025, 2:43 AM

#

🎉🥳🎉🥳🎉🥳🥳🎊🎊🎊🍻🍻🍻

frosty imp Nov 10, 2025, 2:47 AM

#

I'm assuming the branch is threat-i8-QA-255?

#

shouldn't the smallnet also be updated with the QA=255 quantization

#

@violet badger would it be possible to look into merging nnue-pytorch#370? I have some refactors planned that should make the feature system easier to work with

rocky vigil Nov 10, 2025, 3:14 AM

#

twilit oriole It passed

Alright lemme send the ltc smp in

rocky vigil Nov 10, 2025, 3:22 AM

#

frosty imp I'm assuming the branch is threat-i8-QA-255?

yep

rocky vigil Nov 10, 2025, 3:22 AM

#

frosty imp shouldn't the smallnet also be updated with the QA=255 quantization

this is a separate patch that we can test later

frosty imp Nov 10, 2025, 3:24 AM

#

same can be said for all of those though

rocky vigil Nov 10, 2025, 3:26 AM

#

sure, how long will smallnet training take?

#

i think the threat-i8-QA-255 branch can also be used to train a smallnet

frosty imp Nov 10, 2025, 3:26 AM

#

we can just requantize?

rocky vigil Nov 10, 2025, 3:26 AM

#

just use HalfKAv2_hm^ feature set

frosty imp Nov 10, 2025, 3:27 AM

#

rocky vigil i think the threat-i8-QA-255 branch can also be used to train a smallnet

isn't the threat weight clipping hard coded

rocky vigil Nov 10, 2025, 3:27 AM

#

oh right

#

nvm

rocky vigil Nov 10, 2025, 3:27 AM

#

frosty imp we can just requantize?

does anyone have the original checkpoint though

frosty imp Nov 10, 2025, 3:27 AM

#

ig just requantizing from nnue

rocky vigil Nov 10, 2025, 3:30 AM

#

frosty imp ig just requantizing from nnue

ok try to do this as a simpl i guess

#

replace x with x * 255 / 127

#

actually no

#

just replace x with x * 2

prime mica Nov 10, 2025, 3:45 AM

#

let's gooooo

#

so proud of everyone

dark stream Nov 10, 2025, 3:53 AM

#

So, if this passes, then will it be merged? Or will you all try for more first?

prime mica Nov 10, 2025, 3:53 AM

#

a lot of cleanup work to do first...

#

@frosty imp you've already done a lot of cleaning up right

frosty imp Nov 10, 2025, 3:54 AM

#

somewhat

#

the later additions were not cleaned up at all

prime mica Nov 10, 2025, 3:54 AM

#

gotcha

frosty imp Nov 10, 2025, 3:55 AM

#

I think some of the nicer inference cleanups need trainer side coordination

prime mica Nov 10, 2025, 3:56 AM

#

ah

frosty imp Nov 10, 2025, 3:56 AM

#

but threat index calculation & co should be fine

prime mica Nov 10, 2025, 3:57 AM

#

anything I can help with?

frosty imp Nov 10, 2025, 3:57 AM

#

ofc

#

just clean up anything you see

prime mica Nov 10, 2025, 3:58 AM

#

lol ok

#

and then PR to your fork?

frosty imp Nov 10, 2025, 3:58 AM

#

I have i8 merged but not your speedup

prime mica Nov 10, 2025, 3:59 AM

#

kk

#

https://github.com/xu-shawn/Stockfish/tree/threat_inputs_refactor this one?

GitHub

GitHub - xu-shawn/Stockfish at threat_inputs_refactor

A free and strong UCI chess engine. Contribute to xu-shawn/Stockfish development by creating an account on GitHub.

frosty imp Nov 10, 2025, 4:03 AM

#

https://github.com/xu-shawn/Stockfish/tree/threat_inputs

#

the refactor branch is wip. need trainer side changes

prime mica Nov 10, 2025, 4:03 AM

#

kk

frosty imp Nov 10, 2025, 4:07 AM

#

frosty imp <https://github.com/xu-shawn/Stockfish/tree/threat_inputs>

this branch should be updated now

#

with clang-format

prime mica Nov 10, 2025, 4:07 AM

#

huzzah

#

will you apply the diff to the most recent SPRTs yourself or should I do that and PR it

frosty imp Nov 10, 2025, 4:25 AM

#

PR plz

#

oh cool I see the PR

prime mica Nov 10, 2025, 4:31 AM

#

prime mica will you apply the diff to the most recent SPRTs yourself or should I do that an...

oh these are already included

#

I'll clean them up a bit though

frosty imp Nov 10, 2025, 4:42 AM

#

oops I broke the compile by removing the friend struct Position thing

prime mica Nov 10, 2025, 4:42 AM

#

friendship breakups suck

#

lmk when you've fixed

#

oh u did ok

frosty imp Nov 10, 2025, 4:47 AM

#

fix already pushed

#

ye

prime mica Nov 10, 2025, 4:57 AM

#

huh I get a segfault with sanitize=undefined,address

#

oh well we'll figure it out later

#

seems to be a misaligned struct

#

it's segfaulting on a memcpy that expanded to vmovdqa instructions

frosty imp Nov 10, 2025, 5:03 AM

#

hmm

prime mica Nov 10, 2025, 5:05 AM

#

https://github.com/xu-shawn/Stockfish/pull/29 anyway this PR fixes an OOB read in my LUTslop

frosty imp Nov 10, 2025, 5:06 AM

#

merged

prime mica Nov 10, 2025, 5:06 AM

#

danke

naive comet Nov 10, 2025, 5:07 AM

#

Shawn have you clang formatted

prime mica Nov 10, 2025, 5:07 AM

#

yes

frosty imp Nov 10, 2025, 5:07 AM

#

yeah

prime mica Nov 10, 2025, 5:30 AM

#

honestly the code isn't that bad

#

the only serious pain point imo is nnue_accumulator.cpp

#

which I gather u've been working on

violet badger Nov 10, 2025, 5:31 AM

#

frosty imp <@713871252246495262> would it be possible to look into merging nnue-pytorch#370...

will be traveling today. I assume that needs some light testing at least?

frosty imp Nov 10, 2025, 5:31 AM

#

I would probably do that to be safe

#

although it's a simple reorganization. probably nothing will go wrong

rocky vigil Nov 10, 2025, 5:32 AM

#

https://tests.stockfishchess.org/tests/live_elo/69117499ec1d00d2c195c804

#

shawn

violet badger Nov 10, 2025, 5:32 AM

#

yeah, so will be a bit later.

rocky vigil Nov 10, 2025, 5:32 AM

#

u've done smth wrong

prime mica Nov 10, 2025, 5:32 AM

#

sss

#

:P

prime mica Nov 10, 2025, 5:33 AM

#

violet badger will be traveling today. I assume that needs some light testing at least?

ooh where are you going?

violet badger Nov 10, 2025, 5:33 AM

#

just work..

#

meanwhile, some results for 60+0.6t256.

   # PLAYER    :  RATING  ERROR  POINTS  PLAYED   (%)
   1 patch     :     6.3   10.7  1042.0    2048    51
   2 master    :     0.0   ----  1006.0    2048    49

A bit sss, but looks good.

rocky vigil Nov 10, 2025, 5:34 AM

#

looks decent yeah

#

no horrible regression at tcec conditions

violet badger Nov 10, 2025, 5:35 AM

#

well, likely quite reasonable progress

rocky vigil Nov 10, 2025, 5:36 AM

#

@frosty imp how did you requantize the smallnet?

frosty imp Nov 10, 2025, 5:38 AM

#

rocky vigil <@453859636890828802> how did you requantize the smallnet?

converted .nnue to .pt in master nnue-pytorch

#

then used https://github.com/xu-shawn/nnue-pytorch/tree/QA_255 to convert it back to NNUE

rocky vigil Nov 10, 2025, 5:41 AM

#

try just multiplying every weight by 2 in the nnue

frosty imp Nov 10, 2025, 5:41 AM

#

hmm isn't that 254 quant tho

naive comet Nov 10, 2025, 5:42 AM

#

doesn't matter practically speaking

frosty imp Nov 10, 2025, 5:42 AM

#

i'll try that later

frosty imp Nov 10, 2025, 6:38 AM

#

https://tests.stockfishchess.org/tests/view/69115a26ec1d00d2c195c7cd

#

wrong hash on this one oof

prime mica Nov 10, 2025, 6:39 AM

#

is it an SSE

#

yeah it is

#

so that's why, the test doesn't have my SSE patch

frosty imp Nov 10, 2025, 6:40 AM

#

oh I mean hash size

#

prime mica Nov 10, 2025, 6:40 AM

#

ohhhh

#

what's it supposed to be?

#

128?

frosty imp Nov 10, 2025, 6:41 AM

#

512

prime mica Nov 10, 2025, 6:41 AM

#

O

#

big boi

rocky vigil Nov 10, 2025, 6:45 AM

#

💀

#

eh

#

who cares

stray reef Nov 10, 2025, 7:04 AM

#

a new era of nnue just started, great job everyone peepoHappy

prime mica Nov 10, 2025, 7:04 AM

#

all thx to u

#

(and many others)

naive comet Nov 10, 2025, 7:06 AM

#

LFG bois LFG

#

I reckon many speedups to come too

stray reef Nov 10, 2025, 7:10 AM

#

so many good things coming from this at once. master net will be reproducable again, there will be new nets again after a long time, no spsa needed rn, probably some smart speedups incoming, SF 18 is coming 🚀

prime mica Nov 10, 2025, 7:13 AM

#

naive comet I reckon many speedups to come too

yessir I have a few ideas still...

#

and obviously others will find fruit

violet badger Nov 10, 2025, 7:20 AM

#

frosty imp <@713871252246495262> would it be possible to look into merging nnue-pytorch#370...

ok, gave it some testing, have a look at the PR for some copilot comments.

#

I assume that is a step towards getting threats into the main brach, right?

frosty imp Nov 10, 2025, 7:38 AM

#

should allow refactoring feature transformers in the next PR, which will make getting threats in main easy

lapis parrot Nov 10, 2025, 8:29 AM

#

prime mica 128?

there is a button

#

to set up VLTC+

#

you need to click ***

dark stream Nov 10, 2025, 8:34 AM

#

If the running test ends the way it is looking like, then threat inputs does indeed scale very well.

prime mica Nov 10, 2025, 8:36 AM

#

life is good

dark stream Nov 10, 2025, 8:37 AM

#

prime mica life is good

Thanks for your hard work, and everybody's in general.

foggy wind Nov 10, 2025, 10:02 AM

#

The LTC looked much more x86 friendly.

GROUPED BY ARCH

64bit AVX512ICL VNNI AVX512 BMI2 AVX2 SSE41 SSSE3 SSE2 POPCNT | Elo:     3.24 ±    2.91 | LOS:  98.6% | LLR:  1.11 | [9, 1429, 3509, 1528, 20]
64bit BMI2 AVX2 SSE41 SSSE3 SSE2 POPCNT                       | Elo:     2.63 ±    4.49 | LOS:  87.4% | LLR:  0.35 | [8, 590, 1469, 637, 5]
64bit AVX2 SSE41 SSSE3 SSE2 POPCNT                            | Elo:     2.47 ±    6.17 | LOS:  78.4% | LLR:  0.17 | [2, 304, 774, 320, 4]
64bit VNNI BMI2 AVX2 SSE41 SSSE3 SSE2 POPCNT                  | Elo:    10.76 ±    6.57 | LOS:  99.9% | LLR:  0.86 | [0, 242, 669, 314, 2]
64bit AVX512 BMI2 AVX2 SSE41 SSSE3 SSE2 POPCNT                | Elo:    -1.66 ±    8.16 | LOS:  34.5% | LLR: -0.13 | [1, 200, 444, 190, 2]
64bit VNNI AVX512 BMI2 AVX2 SSE41 SSSE3 SSE2 POPCNT           | Elo:    -2.00 ±    8.39 | LOS:  32.0% | LLR: -0.15 | [2, 183, 417, 178, 0]
64bit POPCNT NEON_DOTPROD                                     | Elo:    23.63 ±   10.79 | LOS: 100.0% | LLR:  0.71 | [1, 85, 248, 151, 1]

GROUPED BY x86

x86 | Elo:     3.11 ±    2.01 | LOS:  99.9% | LLR:  2.19 | [22, 2948, 7282, 3167, 33]
ARM | Elo:    23.63 ±   10.79 | LOS: 100.0% | LLR:  0.71 | [1, 85, 248, 151, 1]

green moat Nov 10, 2025, 11:40 AM

#

naive comet I reckon many speedups to come too

And net SPSA as well...
🙂

#

eventually

naive comet Nov 10, 2025, 11:53 AM

#

prime mica yessir I have a few ideas still...

me too btw

torn lagoon Nov 10, 2025, 11:55 AM

#

green moat And net SPSA as well... 🙂

I believe it was agreed this won't happen?

vestal gale Nov 10, 2025, 11:56 AM

#

sprt?

green moat Nov 10, 2025, 11:57 AM

#

torn lagoon I believe it was agreed this won't happen?

As I understand it will happen eventually, when no more Elo could be squeezed

green moat Nov 10, 2025, 11:58 AM

#

vestal gale sprt?

SPSA, sorry, I fixed it

#

Are there any preliminary results on L2=31 TI nets?

#

Tomorrow the nets produced with this job will be available:
https://gitlab.com/cscs-ci/ci-testing/webhook-ci/mirrors/5137461961076608/2926829081096545/-/pipelines/2149181872

torn lagoon Nov 10, 2025, 12:22 PM

#

green moat As I understand it will happen eventually, when no more Elo could be squeezed

You can always train a better net

#

So this point will never come

finite wind Nov 10, 2025, 12:24 PM

#

What is the current elo vs master?

lapis parrot Nov 10, 2025, 12:25 PM

#

2 stc 3,5 ltc

#

at least SPRT elo wise

finite wind Nov 10, 2025, 12:30 PM

#

Over 10k posts in this thread, long battle 👍

rocky vigil Nov 10, 2025, 12:30 PM

#

foggy wind The LTC looked much more x86 friendly. ``` GROUPED BY ARCH 64bit AVX512ICL VNNI...

ltc by ISA type

lapis parrot Nov 10, 2025, 2:39 PM

#

so make a PR? test will pass soon I guess

twilit oriole Nov 10, 2025, 2:40 PM

#

what branch is it again

#

like is the final branch going to be the shawn one or the one sscg13 has on the test

#

For the PR message need to decide how much detail to go in. Like do I discuss the alternate schemes that failed before this one (or things that were tried and failed in general) or stick to just explaining the final product

lapis parrot Nov 10, 2025, 2:43 PM

#

I think you should write failed alternates in a separate doc

#

and link it in PR

#

otherwise it's too much text

twilit oriole Nov 10, 2025, 2:43 PM

#

yep

#

if someone wants to make that doc feel free. and then i can just add the section talking about the other failed input schemes

#

it would be good to summarise the findings of this 10k messages i think

rocky vigil Nov 10, 2025, 2:51 PM

#

twilit oriole like is the final branch going to be the shawn one or the one sscg13 has on the ...

Shawn one probably with extra cleanups and stuff

rocky vigil Nov 10, 2025, 2:51 PM

#

twilit oriole it would be good to summarise the findings of this 10k messages i think

Yes this is important I think

regal steeple Nov 10, 2025, 2:59 PM

#

twilit oriole For the PR message need to decide how much detail to go in. Like do I discuss th...

I feel like sscg or shawn should make the pr if they wish to do so since they put in the vast majority of the work, not to discredit you but it feels a little weird that you are the one making the pr when you """only""" proposed the idea

rocky vigil Nov 10, 2025, 3:02 PM

#

I personally don’t care, I think it would make most sense to have Shawn make the pr since his branch is being actively updated with cleanup work

#

There are also corresponding PRs Shawn and I need to make to nnue-PyTorch

twilit oriole Nov 10, 2025, 3:04 PM

#

regal steeple I feel like sscg or shawn should make the pr if they wish to do so since they pu...

I don't think that is true, there was months of prior work to land on this scheme and do a first test in Stockfish

regal steeple Nov 10, 2025, 3:04 PM

#

But you did not do any stuff to get it to work in sf (which is where this pr is made)?

formal smelt Nov 10, 2025, 3:05 PM

#

i agree one of shawn/sscg should open the pr
is the pr not gonna have like 10 quadrillion coauthors anyway?

rocky vigil Nov 10, 2025, 3:05 PM

#

Yeah

formal smelt Nov 10, 2025, 3:06 PM

#

coolio dont forget me :p else i'll be briefly sad

prime mica Nov 10, 2025, 3:06 PM

#

ofc

#

shall we create a Google Doc

rocky vigil Nov 10, 2025, 3:06 PM

#

Don’t worry viren claims to have a big list

formal smelt Nov 10, 2025, 3:07 PM

#

its just his name in a very large font size

rocky vigil Nov 10, 2025, 3:07 PM

#

Might be time to reveal :P

formal smelt Nov 10, 2025, 3:08 PM

#

viren me lofty yoshie sscg shawn and then all the SF speedup gang?

#

in chronological order even

rocky vigil Nov 10, 2025, 3:08 PM

#

~~ in chronological order I think I come before yoshie~~

#

Disservin, vondele, linrock also need to be credited

formal smelt Nov 10, 2025, 3:09 PM

#

this will be the holiest PR in existence

rocky vigil Nov 10, 2025, 3:09 PM

#

Tbf I think viren should just reveal the list

formal smelt Nov 10, 2025, 3:10 PM

#

what are the current elo numbers at STC/LTC/SMP?

rocky vigil Nov 10, 2025, 3:10 PM

#

formal smelt this will be the holiest PR in existence

Can we make it larger than the original nnue pr

twilit oriole Nov 10, 2025, 3:10 PM

#

Yeah I will I'm on phone rn I don't have it on me lol

rocky vigil Nov 10, 2025, 3:10 PM

#

formal smelt what are the current elo numbers at STC/LTC/SMP?

2 / 3.5 / 6 so far

twilit oriole Nov 10, 2025, 3:12 PM

#

formal smelt its just his name in a very large font size

How else will I farm this for the resume Kappa tbh it probably doesn't matter all that much now

rocky vigil Nov 10, 2025, 3:13 PM

#

I’m happy now that when I say I’m a sf dev it doesn’t mean I just made a one line simp

rocky vigil Nov 10, 2025, 3:15 PM

#

rocky vigil 2 / 3.5 / 6 so far

This is huge w/o spsa, the comparable master arch number is -5

rocky vigil Nov 10, 2025, 3:16 PM

#

prime mica shall we create a Google Doc

Yeah if you can guarantee it’ll last

stray gyro Nov 10, 2025, 3:16 PM

#

Increased TP of the current VLTC test to 100%...

rocky vigil Nov 10, 2025, 3:16 PM

#

I don’t trust my personal acc with important stuff like this

#

That’ll get referenced many times in the future

twilit oriole Nov 10, 2025, 3:17 PM

#

It will not be referenced directly. It will be downloaded and attached through GitHub lol

#

It's only for the collab stage

rocky vigil Nov 10, 2025, 3:17 PM

#

Oh cool

stray gyro Nov 10, 2025, 3:17 PM

#

^

rocky vigil Nov 10, 2025, 3:17 PM

#

Yeah just make one then

prime mica Nov 10, 2025, 3:19 PM

#

kk

#

DM me your emails? (Or put them here)

rocky vigil Nov 10, 2025, 3:20 PM

#

Mine is the same one that appears on my github

prime mica Nov 10, 2025, 3:21 PM

#

🐻

twilit oriole Nov 10, 2025, 3:21 PM

#

Just share it publicly

rocky vigil Nov 10, 2025, 3:21 PM

#

That also works

#

I don’t think anyone will grief

twilit oriole Nov 10, 2025, 3:22 PM

#

Well it has version history anyways

prime mica Nov 10, 2025, 3:26 PM

#

https://docs.google.com/document/d/1ju8PKCBmDFT0JxhErczeN87XaOThWKZEobOEMWTpdZk/edit?usp=sharing

Google Docs

Stockfish threat inputs PR summary

#

Finally snowy egret has a chance ugh

#

The memcpys were pissing me off

#

Maybe we should add a proper move assignment operator to ValueList

#

which won’t fix the problem but at least it won’t copy the whole thing

#

I think finding an upper bound for the threats list size no longer matters tho with egret

#

Speedups don't seem terribly important for the PR description right

rocky vigil Nov 10, 2025, 3:47 PM

#

idk might as well highlight

#

the effort

prime mica Nov 10, 2025, 3:48 PM

#

maybe we briefly describe the most important ones?

rocky vigil Nov 10, 2025, 3:48 PM

#

like if we're gonna make a big doc

#

we have plenty of space for everything

prime mica Nov 10, 2025, 3:48 PM

#

I'm planning to write an in-depth blog post about it (bc some of the techniques are interesting imo) so we can also link that

rocky vigil Nov 10, 2025, 3:48 PM

#

ah nice

prime mica Nov 10, 2025, 3:52 PM

#

aura

rocky vigil Nov 10, 2025, 4:04 PM

#

wow writing this stuff is harder than I though

prime mica Nov 10, 2025, 4:05 PM

#

just stream of consciousness it!

#

and then we can reorganize

#

@foggy wind would u mind benching https://tests.stockfishchess.org/tests/live_elo/6911b37fec1d00d2c195c8f8

#

works ok locally

#

trolled by a loongarch worker lmaoo

#

I should do a loong vsx port some time

foggy wind Nov 10, 2025, 4:22 PM

#

prime mica <@398510765910523904> would u mind benching https://tests.stockfishchess.org/tes...

Result of 200 runs
==================
base (...fish.ostrich) =    2056887  +/- 4406
test (...grine-falcon) =    2051799  +/- 4516
diff                   =      -5087  +/- 2380

speedup        = -0.0025
P(speedup > 0) =  0.0000

prime mica Nov 10, 2025, 4:22 PM

#

ugh

rocky vigil Nov 10, 2025, 4:22 PM

#

oh yeah meanwhile

prime mica Nov 10, 2025, 4:22 PM

#

I wonder if it gets inlined, are you using clang?

rocky vigil Nov 10, 2025, 4:22 PM

#

the VVLTC with 1/2 hash for both sides

#

officially passed

prime mica Nov 10, 2025, 4:23 PM

#

🥳

rocky vigil Nov 10, 2025, 4:23 PM

#

so yep

#

need cleanups

#

and preparing of the PR

foggy wind Nov 10, 2025, 4:23 PM

#

Congratulations to everyone who put in a lot of hard work 🙂

prime mica Nov 10, 2025, 4:23 PM

#

thank u for all the hlep

rocky vigil Nov 10, 2025, 4:23 PM

#

there is still a bit more to come

#

in terms of cleanup work

foggy wind Nov 10, 2025, 4:24 PM

#

there is also still this warning:

position.cpp: In member function 'void Stockfish::Position::update_piece_threats(Stockfish::Piece, Stockfish::Square, Stockfish::DirtyThreats*)':
position.cpp:1104:18: warning: declaration of 'threatened' shadows a previous local [-Wshadow]
 1104 |         Bitboard threatened = ray & qAttacks & occupied;
      |                  ^~~~~~~~~~
position.cpp:1057:14: note: shadowed declaration is here
 1057 |     Bitboard threatened;
      |              ^~~~~~~~~~

prime mica Nov 10, 2025, 4:24 PM

#

yeah that'll be fixed in cleanup

#

not a code error, just slightly sloppy code

rocky vigil Nov 10, 2025, 4:28 PM

#

actually it probably wouldn't hurt to take a look at itnow

green moat Nov 10, 2025, 5:57 PM

#

Meanwhile Stage 4 net with L2=31 available.
https://gitlab.com/cscs-ci/ci-testing/webhook-ci/mirrors/5137461961076608/2926829081096545/-/jobs/12028659745/artifacts/download
Tomorrow morning (CET) Stage 5 net will be available

rocky vigil Nov 10, 2025, 6:15 PM

#

Btw on net format, let’s try to print i8 verbatim and then leb128 for i16

#

And update the trainer accordingly

prime mica Nov 10, 2025, 6:24 PM

#

is there a reason we use leb128 instead of something a bit simpler and more compact

#

the vast marjotiy of weights are in [-127,127] so

#

the format can just be 0x80 + (2 bytes) for i16s that don't fit, and the literal value otherwise (which will be sign extended)

#

should be like 10% smaller

rocky vigil Nov 10, 2025, 6:25 PM

#

I mean we could unironically just write the weights verbatim

prime mica Nov 10, 2025, 6:26 PM

#

based

rocky vigil Nov 10, 2025, 6:26 PM

#

The sad part is because of packus preprocessing it doesn’t simplify memory sharing

rocky vigil Nov 10, 2025, 6:27 PM

#

rocky vigil Btw on net format, let’s try to print i8 verbatim and then leb128 for i16

@frosty imp does this sound good?
when we agree on a new net format afterwards I can modify the bitcoin miner script

prime mica Nov 10, 2025, 6:29 PM

#

lol

green moat Nov 10, 2025, 7:00 PM

#

snowy-egret-2 passed
https://tests.stockfishchess.org/tests/live_elo/69108025ec1d00d2c195c5d6

rocky vigil Nov 10, 2025, 7:06 PM

#

cool

#

more merges

violet badger Nov 10, 2025, 7:18 PM

#

rocky vigil officially passed

congrats... please go ahead with the PR.

rocky vigil Nov 10, 2025, 7:19 PM

#

i think there are still many cleanups to make

violet badger Nov 10, 2025, 7:19 PM

#

yes sure

rocky vigil Nov 10, 2025, 7:19 PM

#

we haven't prepared the PR message

frosty imp Nov 10, 2025, 7:19 PM

#

is the doc supposed to be the PR message

rocky vigil Nov 10, 2025, 7:20 PM

#

i think we could also include like the most important parts

rocky vigil Nov 10, 2025, 7:20 PM

#

green moat snowy-egret-2 passed https://tests.stockfishchess.org/tests/live_elo/69108025ec1...

also what shall we do with this

lapis parrot Nov 10, 2025, 7:20 PM

#

idk just make PR message "new arch" with SPRT results

#

the end

rocky vigil Nov 10, 2025, 7:20 PM

#

append it to the pr?

prime mica Nov 10, 2025, 7:20 PM

#

lapis parrot idk just make PR message "new arch" with SPRT results

good

prime mica Nov 10, 2025, 7:20 PM

#

rocky vigil append it to the pr?

sure, up to you

#

we can also just merge it later into master

frosty imp Nov 10, 2025, 7:20 PM

#

https://github.com/official-stockfish/Stockfish/pull/5149

frosty imp Nov 10, 2025, 7:21 PM

#

frosty imp <https://github.com/official-stockfish/Stockfish/pull/5149>

can just imitate this

prime mica Nov 10, 2025, 7:21 PM

#

frosty imp can just imitate this

seems nice and concise lol

#

personally I'd just link to an external doc or Wiki entry?

#

for a more extensive explanation

rocky vigil Nov 10, 2025, 7:21 PM

#

I think in the actual message we just put the SPRT results, a brief description of threat inputs, and the contributors

#

we are also waiting on Viren's list of contributors over time

rocky vigil Nov 10, 2025, 7:23 PM

#

prime mica https://docs.google.com/document/d/1ju8PKCBmDFT0JxhErczeN87XaOThWKZEobOEMWTpdZk/...

viren also wanted to write down the exploration process here and other detailed stuff

#

so let's wait for that

#

i mean there's no rush

rocky vigil Nov 10, 2025, 7:23 PM

#

rocky vigil <@453859636890828802> does this sound good? when we agree on a new net format a...

in the meanwhile what are opinions on this

green moat Nov 10, 2025, 7:24 PM

#

guys, the PR description is not important. It can be modified/updated afterwards

frosty imp Nov 10, 2025, 7:24 PM

#

rocky vigil in the meanwhile what are opinions on this

I can try that after refactoring the feature modules

rocky vigil Nov 10, 2025, 7:24 PM

#

ok cool

#

i would also prefer that like we be able to make the prs to sf / nnue-pytorch at the same time

#

since they're companion prs

frosty imp Nov 10, 2025, 7:25 PM

#

what if we create an issue first with the list of tasks

#

https://github.com/official-stockfish/Stockfish/issues/2823

#

like this

rocky vigil Nov 10, 2025, 7:25 PM

#

yeah that seems good

violet badger Nov 10, 2025, 7:25 PM

#

rocky vigil in the meanwhile what are opinions on this

I'd avoid that, waste of compute, I also would really like the net to be the result of the ci pipeline, or fully captured by the nettest script.

rocky vigil Nov 10, 2025, 7:25 PM

#

ok cool

#

so just the random shas then

violet badger Nov 10, 2025, 7:25 PM

#

yes

rocky vigil Nov 10, 2025, 7:26 PM

#

as for the net format itself?

#

i think LEB128 on i8 weights is a waste

violet badger Nov 10, 2025, 7:26 PM

#

I would keep it, or we have to redo training?

frosty imp Nov 10, 2025, 7:26 PM

#

I would say threat weights and psqt weights be stored separately

violet badger Nov 10, 2025, 7:26 PM

#

which we can of course

frosty imp Nov 10, 2025, 7:26 PM

#

yeah

rocky vigil Nov 10, 2025, 7:26 PM

#

wouldn't need to redo training, is just a change in serializing net

#

i think

#

the weights themselves would remain the same

#

just formatted differently

violet badger Nov 10, 2025, 7:27 PM

#

well, I would at need fix the script ... new trainer sha would need rerun.

#

more after dinner 😉

green moat Nov 10, 2025, 7:27 PM

#

Is there any work/experiment to do on smallnet?

rocky vigil Nov 10, 2025, 7:27 PM

#

green moat Is there any work/experiment to do on smallnet?

yes, attempt to re-quantize it to QA=255

green moat Nov 10, 2025, 7:28 PM

#

also, sscg13, don't forget snowy-egret 2
🙂

#

https://tests.stockfishchess.org/tests/view/69108025ec1d00d2c195c5d6

prime mica Nov 10, 2025, 7:28 PM

#

meh

rocky vigil Nov 10, 2025, 7:28 PM

#

i think anematode can just tack it onto shawn's branch

prime mica Nov 10, 2025, 7:28 PM

#

We can do it later imo

#

Or that

#

it’s a small change

rocky vigil Nov 10, 2025, 7:29 PM

#

it's not functional

#

I think we should hold off on all functional changes (i.e. requantize smallnet) until after merge

prime mica Nov 10, 2025, 7:29 PM

#

great

green moat Nov 10, 2025, 7:30 PM

#

Also, tomorrow morning we will have Stage 5 net with L2=31.
Stage 4 net already available, should we test it?
🤔

rocky vigil Nov 10, 2025, 7:36 PM

#

rocky vigil I think we should hold off on all functional changes (i.e. requantize smallnet) ...

^^

frosty imp Nov 10, 2025, 7:47 PM

#

I think it's fine to merge if tested

violet badger Nov 10, 2025, 7:48 PM

#

we should now work towards a mergable PR. There should probably be still a final test to verify the cleanup didn't introduce a regression or so. However, larger change, I would keep for afterwards and use the normal process to improve stuff.

prime mica Nov 10, 2025, 7:49 PM

#

Agree

#

The flurry of improvements will last a while probably

#

So might as well get it in

twilit oriole Nov 10, 2025, 7:52 PM

#

i think if possible try to avoid links in the text itself, i think take them outside to the top of the page. otherwise it is a bit much

#

like just list out monty, yukari, plentychess prs, nnue trainer prs etc. before going into the text

rocky vigil Nov 10, 2025, 7:54 PM

#

ok

#

how to get a link to the original pdf in the first message

twilit oriole Nov 10, 2025, 8:07 PM

#

hm i think we can actually put that in the PR message itself instead. and attached through github. It seems to be the easiest way to understand the general concept quickly

#

A graphical version of an earlier scheme (with less refinement) that illustrates the core concepts can be found at: <Link the initial monty inputs v2 pdf> I copy it here for later

rocky vigil Nov 10, 2025, 8:14 PM

#

tbh I don't think the doc really needs to go into detail about speedups seeing as it's more to serve as an introduction into threat inputs

prime mica Nov 10, 2025, 8:15 PM

#

agree

#

it's tangential

twilit oriole Nov 10, 2025, 8:15 PM

#

hmm well if someone does write it it's still better even if not strictly necessary. otherwise those concepts will be lost to time

#

it's not like you can git blame the speedups itself

prime mica Nov 10, 2025, 8:16 PM

#

I'm planning to write a blog post

#

would be happy to include the others' work as well

rocky vigil Nov 10, 2025, 8:16 PM

#

prime mica I'm planning to write a blog post

yeah that's a good place to add it

#

and we can always link it later

prime mica Nov 10, 2025, 8:16 PM

#

great

rocky vigil Nov 10, 2025, 8:18 PM

#

what was the range of x86 speed loss again compared to master?

twilit oriole Nov 10, 2025, 8:18 PM

#

For the PR message itself I think the PR links of the 3 engines (Monty, Yukari, Plentychess) being included is also probably important. Maybe that PR link section can just move there itself

#

I think its just going to be a bunch of links and short summary. Only way to have it condense

rocky vigil Nov 10, 2025, 8:19 PM

#

yep

lapis parrot Nov 10, 2025, 8:20 PM

#

I would recommend to not overdo a PR

rocky vigil Nov 10, 2025, 8:20 PM

#

rocky vigil what was the range of x86 speed loss again compared to master?

i will just give a range of 15 to 5% if no one digs up more specific numbers

lapis parrot Nov 10, 2025, 8:21 PM

#

since new arch and stuff is probably good to speedup development in other areas

#

so as soon as you make it the better it is imho

#

even if it's not complete, relevant info can be put on github after it

rocky vigil Nov 10, 2025, 8:21 PM

#

i think we can get everything done within 1-2 days if we speed it

#

which should be fairly fast

lapis parrot Nov 10, 2025, 8:22 PM

#

yeah just saying, my last project at work is more or less finished 2 weeks ago and I'm still making docs to close it

#

(:

rocky vigil Nov 10, 2025, 8:22 PM

#

ah

#

I think the "full" doc looks solid now

#

so we can work on preparing PR message

#

and after that shawn and I need to lock in on the other things

twilit oriole Nov 10, 2025, 8:23 PM

#

Well the PR message can be done in 1 hour, the branch being ready is the main thing lol

rocky vigil Nov 10, 2025, 8:24 PM

#

@frosty imp what is remaining before threat inputs can be PR'd to nnue-pytorch

#

and I'll try to set up a tracking issue for the main sf PR

rocky vigil Nov 10, 2025, 8:26 PM

#

frosty imp <https://github.com/official-stockfish/Stockfish/issues/2823>

ok so attempting to create a new issue now enforces that it follows the "typical issue" format

#

so idk if this is actually the best approach now

rocky vigil Nov 10, 2025, 8:28 PM

#

rocky vigil and I'll try to set up a tracking issue for the main sf PR

yknow what let's just try to do it here

rocky vigil Nov 10, 2025, 8:31 PM

#

foggy wind there is also still this warning: ``` position.cpp: In member function 'void Sto...

one thing absolutely should try to do soon is fix this

#

most other stuff is optional and can be done after the initial PR

#

but this warning should definitely go

violet badger Nov 10, 2025, 8:32 PM

#

you can also make a PR and use the first PR comment to keep a list of items?

rocky vigil Nov 10, 2025, 8:32 PM

#

true, would need to wait for shawn to do that

violet badger Nov 10, 2025, 8:32 PM

#

I think you could create that?

#

not that it matters too much.

rocky vigil Nov 10, 2025, 8:33 PM

#

so pull shawn's branch and then create a pr with it

#

sure

violet badger Nov 10, 2025, 8:33 PM

#

yes, creating a PR would have the advantage of CI running.

#

let's see what it uncovers 😉

prime mica Nov 10, 2025, 8:34 PM

#

oh dear

violet badger Nov 10, 2025, 8:34 PM

#

🧟

prime mica Nov 10, 2025, 8:34 PM

#

like uncovering a rock w/ a bajillion roaches and worms underneath it

#

(maybe)

#

or hopefully it's a nicely mowed lawn

rocky vigil Nov 10, 2025, 8:34 PM

#

alright give me a few minutes to set it up

lapis parrot Nov 10, 2025, 8:39 PM

#

prime mica like uncovering a rock w/ a bajillion roaches and worms underneath it

don't worry

#

roaches are 2 supply

#

so maxing out on them is not good

rocky vigil Nov 10, 2025, 8:40 PM

#

speaking of vvltc results

#

I do wonder which pair master got double killed in...

#

https://github.com/official-stockfish/Stockfish/pull/6406
to be updated

GitHub

Update NNUE architecture to SFNNv10 with Threat Inputs by sscg13 ·...

Rest of message to be written...
Passed STC:
LLR: 2.93 (-2.94,2.94) <0.00,2.00>
Total: 63424 W: 16956 L: 16591 D: 29877
Ptnml(0-2): 276, 7522, 15797, 7795, 322
https://tests.stockfish...

lapis parrot Nov 10, 2025, 8:42 PM

#

rocky vigil I do wonder which pair master got double killed in...

you can search for it lol

#

with ctrl+f ,1

#

and downloading relevant pgn

#

but in general it's pretty meaningless

rocky vigil Nov 10, 2025, 8:43 PM

#

they can be downloaded by machine?

lapis parrot Nov 10, 2025, 8:43 PM

#

this pairs happen in dev vs sf 17 from both sides

rocky vigil Nov 10, 2025, 8:43 PM

#

interesting

#

ah sure

lapis parrot Nov 10, 2025, 8:43 PM

#

you can open the test

prime mica Nov 10, 2025, 8:43 PM

#

do the positions tend to be very sharp or something

#

or does one side just make a blunder early on

#

(or both)

lapis parrot Nov 10, 2025, 8:43 PM

#

click on any number below idx

#

and download all pgns played by the worker

rocky vigil Nov 10, 2025, 8:44 PM

#

yeah PT has them

lapis parrot Nov 10, 2025, 8:44 PM

#

usually neither

rocky vigil Nov 10, 2025, 8:44 PM

#

actually in higher frequency

lapis parrot Nov 10, 2025, 8:44 PM

#

just some time trouble

#

where one side shows 0,00 and other shows +2

#

or some tactical miss

#

where losing side lacked ike 1-2 plies of search to see it

violet badger Nov 10, 2025, 8:47 PM

#

first zombie identified 😉

rocky vigil Nov 10, 2025, 8:47 PM

#

a lot of checks failing

#

besides that

#

what else should I add to task list

prime mica Nov 10, 2025, 8:48 PM

#

merging in a couple speedups

#

https://tests.stockfishchess.org/tests/live_elo/6911b37fec1d00d2c195c8f8 this will probably pass if I'm understanding its effect correctly

#

and then snowy egret ofc

#

both are pretty straightforward and non functional

violet badger Nov 10, 2025, 8:48 PM

#

task list, PR to nnue-repo

twilit oriole Nov 10, 2025, 8:49 PM

#

Merge them after or redo the overall sprts I think

#

Otherwise the listed Elos will be wrong I guess

violet badger Nov 10, 2025, 8:49 PM

#

Task: verify correct .yaml is mentioned/linked for the training recipe

rocky vigil Nov 10, 2025, 8:49 PM

#

is that a task for this pr or for nnue-pytorch pr though

#

which I need to get confirmation from shawn

#

that everything is ready

violet badger Nov 10, 2025, 8:50 PM

#

just somewhere we can keep track 😉

green moat Nov 10, 2025, 8:53 PM

#

vondele, given how PRs work, wouldn't it be better to merge threats_input only when it will be difficult even for anematode to find some speedup?
My fear is that, if threats_input is merged soon, some next speedups might get lost in merge waves or could interfere with other gains....
Just saying...
🙂

prime mica Nov 10, 2025, 8:53 PM

#

nahhh

rocky vigil Nov 10, 2025, 8:53 PM

#

it is better to merge this and have it become master

prime mica Nov 10, 2025, 8:53 PM

#

^

rocky vigil Nov 10, 2025, 8:54 PM

#

well all other pending ones will need to get redone

#

that's unavoidable

#

but the sooner we do it the less time is wasted

green moat Nov 10, 2025, 8:54 PM

#

rocky vigil it is better to merge this and have it become master

ok, ok
🙂

rocky vigil Nov 10, 2025, 9:02 PM

#

it does look like we got quite the bugs in the walls

#

that need to be cleansed

prime mica Nov 10, 2025, 9:03 PM

#

mfw

rocky vigil Nov 10, 2025, 9:05 PM

#

we also gotta add all the other contributors still unlisted here, once viren gets his list

lofty cedar Nov 10, 2025, 9:05 PM

#

Relatively little for a patch that changes 30+ files I'd say.

rocky vigil Nov 10, 2025, 9:06 PM

#

oh bruh nobody removed this ahhhh

#

somebody remove it

prime mica Nov 10, 2025, 9:07 PM

#

msheart_eyes

violet badger Nov 10, 2025, 9:08 PM

#

rocky vigil it does look like we got quite the bugs in the walls

large number are due to not writing the network out correctly I think (from SF)

rocky vigil Nov 10, 2025, 9:08 PM

#

well that gives a new entry to task list

#

"remove unused code"

#

and "fix write NNUE"

rocky vigil Nov 10, 2025, 9:09 PM

#

rocky vigil and "fix write NNUE"

this is much simpler if we change the nnue format

#

though we could also hack it

#

it still irks me that read_leb128(a), read_leb128(b) is not equivalent to read_leb128(a+b)

lofty cedar Nov 10, 2025, 9:11 PM

#

But it seems no UB right?

prime mica Nov 10, 2025, 9:11 PM

#

that is funny

#

there's definitely something... bc it crashes for me locally with sanitizers on (but without a message)

rocky vigil Nov 10, 2025, 9:11 PM

#

lofty cedar But it seems no UB right?

the write functions probably actually have UB

#

since nobody touched them

lofty cedar Nov 10, 2025, 9:12 PM

#

Oh... yeah... but other than that...

#

Though I thought Stockfish was going to be slower to adopt threat input than this. It's pretty fast. Only Monty, Yukari, and Plentychess adopted it faster?

#

Impressive considering the baseline net is much better in Stockfish.

rocky vigil Nov 10, 2025, 9:16 PM

#

so, the hack fix for this is to declare a combined array, write the threat weights and normal weights into the combined array, and then write_leb_128 the combined array

prime mica Nov 10, 2025, 9:16 PM

#

great

rocky vigil Nov 10, 2025, 9:16 PM

#

but like

#

do we wanna do hack fix

#

or wait longer and attempt to do a better fix

twilit oriole Nov 10, 2025, 9:17 PM

#

I am out of date on this, is the issue the i8 weights are too hard to compress or smth?

rocky vigil Nov 10, 2025, 9:17 PM

#

no

#

the issue is that everything is still leb128

#

and it's screwing stuff

twilit oriole Nov 10, 2025, 9:17 PM

#

What is needed instead

rocky vigil Nov 10, 2025, 9:18 PM

#

basically I don't like the current format bc we have to declare these huge combined arrays

#

in roder to read and write

twilit oriole Nov 10, 2025, 9:18 PM

#

The hack fix doesn't sound too bad to me tbh

rocky vigil Nov 10, 2025, 9:18 PM

#

yeah

#

ideally what I would prefer is:
(i8 threat weights) (leb128 psq weights) (rest of network)

#

instead of (leb128 combined weights) (rest of network)

twilit oriole Nov 10, 2025, 9:19 PM

#

Why verbatim?

rocky vigil Nov 10, 2025, 9:19 PM

#

ok confusing

#

just the bytes directly

green moat Nov 10, 2025, 9:19 PM

#

Also, some poor guy (shawn_xu?) will have to update new SF NNUEv10 architecture scheme in nnue-pytorch....
🙂

prime mica Nov 10, 2025, 9:19 PM

#

basically just memcpy

rocky vigil Nov 10, 2025, 9:20 PM

#

rocky vigil instead of (leb128 combined weights) (rest of network)

this necessitates a leb128 read into a combined array (and similar for write)

#

when it shouldn't be necessary at all

#

yknow what I'll do hack fix for now

#

see if it fixes anything

violet badger Nov 10, 2025, 9:21 PM

#

Right now, hack fix, and work on a new format for another round... yeah

#

if anybody asks about the architectures that SF supports most solidly, refer to this picture please

rocky vigil Nov 10, 2025, 9:36 PM

#

btw @twilit oriole I've attached the PR links directly in the PR msg

#

so I think we can get rid of them in the other doc now

#

smallnet printing has been hacked correctly

#

still working on threatnet

#

alright

#

net printing fixed...

#

ok it seems like the next issue is the declaration shadows local variable

#

which turns into an error on some CI

violet badger Nov 10, 2025, 9:47 PM

#

-Werror

rocky vigil Nov 10, 2025, 9:48 PM

#

i think that can be easily fixed

#

let me try to do it as well...

#

what is this test checking?

#

I'll attempt to fix this as well if I understand what it wants

prime mica Nov 10, 2025, 9:57 PM

#

it tells you if you're including a header that's not needed for it to compile

#

or if you're not including a header that, didn't your compiler transitively include it by another header, would make the program fail to compile

rocky vigil Nov 10, 2025, 9:58 PM

#

well if u wannar ead the output

#

https://github.com/official-stockfish/Stockfish/actions/runs/19247091247/job/55023637038?pr=6406

#

and lemme know what it wants

violet badger Nov 10, 2025, 10:03 PM

#

last one to fix, don't worry

rocky vigil Nov 10, 2025, 10:03 PM

#

ok

#

I'll push the declaration shadows local variable fix now

#

https://github.com/official-stockfish/Stockfish/pull/6406/commits/17c4bab191ce5a53aee929d4861aac568dc98a1c

violet badger Nov 10, 2025, 10:04 PM

#

but it is explicit on what it wants : https://github.com/official-stockfish/Stockfish/actions/runs/19247091247/job/55023637038?pr=6406#step:7:180

rocky vigil Nov 10, 2025, 10:04 PM

#

this cannot cause any performance regression right

prime mica Nov 10, 2025, 10:04 PM

#

nah

rocky vigil Nov 10, 2025, 10:04 PM

#

i'll chalk up the abnormally slow bench to my laptop being weird then

prime mica Nov 10, 2025, 10:05 PM

#

yes

#

:P

violet badger Nov 10, 2025, 10:05 PM

#

laptop regression confirmed by remote diagnosis.

#

The matetrack error is more interesting.

rocky vigil Nov 10, 2025, 10:06 PM

#

btw what happened in https://github.com/official-stockfish/Stockfish/actions/runs/19247561380/job/55025208790?pr=6406

#

yeah and matetrack

prime mica Nov 10, 2025, 10:07 PM

#

lol wtf

#

invalid explicitly-specified argument for template parameter

violet badger Nov 10, 2025, 10:07 PM

#

maybe Apple clang version 15.0.0 (clang-1500.3.9.4) issue?

#

or feature?

rocky vigil Nov 10, 2025, 10:08 PM

#

at least nothing else erroring out so far

violet badger Nov 10, 2025, 10:08 PM

#

the error was there before, so nothing random

prime mica Nov 10, 2025, 10:08 PM

#

my apple clang is 17 unfortunately

#

maybe because it's declared inline for no reason?

rocky vigil Nov 10, 2025, 10:11 PM

#

ouch https://github.com/official-stockfish/Stockfish/actions/runs/19247561380/job/55025208765?pr=6406 also has an issue with the sse41 i8 conversio

#

yeah a lot of compilers having issues with that

#

interesting

prime mica Nov 10, 2025, 10:15 PM

#

ugh

#

_mm_cvtsi64x_si128 maybe

#

actually that's even worse

#

hm

#

we could do _mm_set_epi64x(0,x)

#

miserable

rocky vigil Nov 10, 2025, 10:20 PM

#

idk compiler diffs are weird

prime mica Nov 10, 2025, 10:20 PM

#

is there a way to check whether an identifier exists in the preprocessor

#

no right

rocky vigil Nov 10, 2025, 10:21 PM

#

yeah prob not

#

tough

#

i mean I wouldn't know tho

#

somehow master has none of these compilation issues

#

sigh

prime mica Nov 10, 2025, 10:22 PM

#

lmao

#

#

https://stackoverflow.com/a/38547909

Stack Overflow

Initializing an __m128 type from a 64-bit unsigned int

The _mm_set_epi64 and similar *_epi64 instructions seem to use and depend on __m64 types. I want to initialize a variable of type __m128 such that the upper 64 bits of it are 0, and the lower 64 bi...

#

average non portability

rocky vigil Nov 10, 2025, 10:22 PM

#

also i think someone who worked on the incremental threat can revisit this

prime mica Nov 10, 2025, 10:47 PM

#

@rocky vigil ok I think try replacing _mm_cvtsi64_si128(x) with _mm_loadu_si64(&x)

#

OH WAIT

#

it's because it's building on 32-bit

#

ughhhhhh

#

ok yeah then _mm_loadu_si64 should work

frosty imp Nov 10, 2025, 11:56 PM

#

@rocky vigil PR sent

warm thistle Nov 11, 2025, 12:16 AM

#

i may have a speedup ```
Result of 20 runs

base (./sf-old ) = 1374133 +/- 10832
test (./stockfish ) = 1383722 +/- 11199
diff = +9589 +/- 2380

speedup = +0.0070
P(speedup > 0) = 1.0000

CPU: 8 x AMD Ryzen 7 7700X 8-Core Processor
Hyperthreading: on


it's also a simp so we'll see

prime mica Nov 11, 2025, 12:17 AM

#

exciting

candid ivy Nov 11, 2025, 12:20 AM

#

twilit oriole For the PR message need to decide how much detail to go in. Like do I discuss th...

the bulk of the documentation can be added as a markdown file in the pytorch repo and the stockfish pr just quick summary and link to that

#

(if that hasn’t been said yet)

frosty imp Nov 11, 2025, 12:24 AM

#

huh @warm thistle it was added here for optimization

warm thistle Nov 11, 2025, 12:24 AM

#

oh hm

#

idk removing it seems to do better on my machine..

#

ig we see what fishtest says

prime mica Nov 11, 2025, 12:26 AM

#

seems reasonable to get rid of it if it doesn't help

#

maybe it should be an assert instead of an assume

#

but I don't see how it could be faster

frosty imp Nov 11, 2025, 1:12 AM

#

@rocky vigil made a few more PRs. will be back for more once those get merged

lofty cedar Nov 11, 2025, 1:53 AM

#

Great job everyone! This was a long journey, but now, after the cleanup finished, we could finally merge to master!

#

And then we can start search tune and so on to gain some more.

#

You're awesome!

prime mica Nov 11, 2025, 1:54 AM

#

Says u

#

The sama

rocky vigil Nov 11, 2025, 4:50 AM

#

frosty imp <@693549181838819338> made a few more PRs. will be back for more once those get ...

Aight lemme check

rocky vigil Nov 11, 2025, 5:07 AM

#

aight they're all merged

frosty imp Nov 11, 2025, 5:08 AM

#

https://github.com/official-stockfish/Stockfish/pull/6406#discussion_r2512105698

#

can resolve this now

rocky vigil Nov 11, 2025, 5:09 AM

#

yep

#

bruh IWYU came up with new compilation errors

frosty imp Nov 11, 2025, 5:26 AM

#

pr made

jolly tangle Nov 11, 2025, 5:42 AM

#

rocky vigil bruh IWYU came up with new compilation errors

lol yeah it can take a few iterations to make IWYU happy

rocky vigil Nov 11, 2025, 5:43 AM

#

frosty imp pr made

merged btw

#

matetrack issue seems to be related to tb

#

i have no idea how that happened

#

we ostensibly didn't touch any part of tb probing

#

btw @frosty imp how is progress on nnue-pytorch going

#

should I make a PR now for that

#

and you see what needs to be changed

frosty imp Nov 11, 2025, 5:48 AM

#

I'm working on a feature set refactor so best to wait after that

#

since there's prolly going to be heavy merge conflicts

rocky vigil Nov 11, 2025, 5:49 AM

#

yeah

#

the merge conflicts already there

#

cool

rocky vigil Nov 11, 2025, 5:49 AM

#

frosty imp I'm working on a feature set refactor so best to wait after that

i mean it won't be too hard to rebase stuff

#

manually

#

later

#

yeah i assume we are not gonna change the net format or anything

violet badger Nov 11, 2025, 5:59 AM

#

For the matetrack, I trying to find the reproducer, haven't extracted it yet, but seen this error message, which is probably the reason:
stockfish: syzygy/tbprobe.cpp:1148: void Stockfish::{anonymous}::set(T&, uint8_t*) [with T = TBTable<Stockfish::<unnamed>::WDL>; uint8_t = unsigned char]: Assertion `e.hasPawns == bool(*data & HasPawns)' failed.

#

unless that rings an immediate bell, I'll try to extract the testcase

#

syzygy/tbprobe.cpp:1073: uint8_t* Stockfish::{anonymous}::set_sizes(PairsData*, uint8_t*): Assertion `d->base64[i] * 2 >= d->base64[i + 1]' failed.

#

something is fishy 🙂

stray reef Nov 11, 2025, 6:20 AM

#

maybe salting the fish helps

rocky vigil Nov 11, 2025, 6:20 AM

#

and this doesn't trigger with master?

#

very very strange

stray reef Nov 11, 2025, 6:20 AM

#

i went to bed when sscg was fixing still, i woke up and sscg is still going kekgasm

rocky vigil Nov 11, 2025, 6:21 AM

#

stray reef i went to bed when sscg was fixing still, i woke up and sscg is still going <:ke...

nah I also went to bed

#

and woke up and started fixing more

stray reef Nov 11, 2025, 6:21 AM

#

guess i just need more sleep than most people here

split warren Nov 11, 2025, 6:22 AM

#

stray reef i went to bed when sscg was fixing still, i woke up and sscg is still going <:ke...

If this is true, u clearly need more sleep Kappa or sscg needs to speed up

rocky vigil Nov 11, 2025, 6:25 AM

#

btw does anyone else have cleanups they would like to propose to the pr

#

if so just pr it to my branch

stray reef Nov 11, 2025, 6:25 AM

#

planned to read the diff in a lecture later (3-4h from now)

rocky vigil Nov 11, 2025, 6:26 AM

#

fair

stray reef Nov 11, 2025, 6:28 AM

#

one minor thing that comes to mind tho: we no longer require safe_destination() in bitboard.h, it can be moved back to make the diff simpler

rocky vigil Nov 11, 2025, 6:28 AM

#

ah

#

btw on the x86-32-sse41-popcnt comp failure

#

LLM is suggesting to use _mm_cvtsi32_si128 instead

#

idk how trustworthy that is

#

i get the general issue of attempting to manipulate 64 bit stuff on 32 bit comp

#

but do we have a way to distinguish between 32 bit sse41 and 64 bit sse41

stray reef Nov 11, 2025, 6:33 AM

#

could FullThreats::append_active_indices be simplified for pawns using one of the newly introduced attacks_bb() functions in bitboard.h?

#

just throwing some ideas here, not sure how much cleanup should be done now vs. afterwards

rocky vigil Nov 11, 2025, 6:37 AM

#

stray reef could `FullThreats::append_active_indices` be simplified for pawns using one of ...

afaik for pawns they're done in bulk

#

it is faster for refreshing

#

though refreshing takes negligible amount of total time

violet badger Nov 11, 2025, 6:38 AM

#

$ cat test3.inp 
setoption name syzygyPath value ../../syzygy/3-4-5/
position fen 8/8/8/8/6b1/1N1P4/5K1p/7k b - - 0 1
go nodes 100000
$ cat test3.inp - |  ../Stockfish/src/stockfish
Stockfish dev-20251110-b5a26a84 by the Stockfish developers (see AUTHORS file)
info string Found 145 WDL and 145 DTZ tablebase files (up to 5-man).
info string Available processors: 0-31
info string Using 1 thread
info string NNUE evaluation using nn-49c1193b131c.nnue (125MiB, (102384, 1024, 15, 32, 1))
info string NNUE evaluation using nn-37f18f62d772.nnue (6MiB, (22528, 128, 15, 32, 1))
info string Network replica 1: Shared memory.
info depth 1 seldepth 3 multipv 1 score cp -40 nodes 11 nps 11000 hashfull 0 tbhits 0 time 1 pv g4f3
info depth 2 seldepth 3 multipv 1 score cp -33 nodes 26 nps 26000 hashfull 0 tbhits 0 time 1 pv g4f3
info depth 3 seldepth 4 multipv 1 score cp -27 nodes 138 nps 138000 hashfull 0 tbhits 0 time 1 pv g4e2
info depth 4 seldepth 5 multipv 1 score cp -97 nodes 811 nps 811000 hashfull 0 tbhits 0 time 1 pv g4h5 b3d2
info depth 5 seldepth 6 multipv 1 score cp -93 nodes 983 nps 491500 hashfull 0 tbhits 0 time 2 pv g4e2 d3d4 e2f3 b3d2 f3g2
info depth 6 seldepth 7 multipv 1 score cp -87 nodes 1024 nps 512000 hashfull 0 tbhits 0 time 2 pv g4e2 d3d4 e2f3 b3d2 f3g2
info depth 7 seldepth 9 multipv 1 score cp -96 nodes 1268 nps 634000 hashfull 0 tbhits 0 time 2 pv g4e2 d3d4 e2f3 b3d2 f3g2 d2c4 g2f3
terminate called after throwing an instance of 'std::length_error'
  what():  vector::_M_default_append

#

under valgrind

📎 message.txt

#

no idea what is going on there..

rocky vigil Nov 11, 2025, 6:39 AM

#

huh it looks like it's in the TB code

rocky vigil Nov 11, 2025, 6:40 AM

#

violet badger ```bash $ cat test3.inp setoption name syzygyPath value ../../syzygy/3-4-5/ pos...

can you use n-ary searching to figure out which node the crash occurs on?

violet badger Nov 11, 2025, 6:41 AM

#

educate me ....

rocky vigil Nov 11, 2025, 6:41 AM

#

i.e. repeat "go nodes x/ucinewgame" with increasing values of x

#

until crash

violet badger Nov 11, 2025, 6:41 AM

#

ah, I see.

#

I can probably just print the fen of the probing..

rocky vigil Nov 11, 2025, 6:42 AM

#

yeah to get an fen

#

essentially

#

how updated is probing code?

#

maybe some issue like https://github.com/syzygy1/probetool/commit/f3f8227dafdcd7039ad0da445fcf7bea20cf9bfe

violet badger Nov 11, 2025, 6:44 AM

#

I think it is more some corruption that just happens to trigger that.

rocky vigil Nov 11, 2025, 6:44 AM

#

in any case it's strange

violet badger Nov 11, 2025, 6:46 AM

#

if you happen to have TB around, can you test if you can reproduce?

rocky vigil Nov 11, 2025, 6:46 AM

#

i only have shatranj TB lol

violet badger Nov 11, 2025, 6:46 AM

#

ok dw

rocky vigil Nov 11, 2025, 6:46 AM

#

what endgame?

#

I can download the specific 5 man

violet badger Nov 11, 2025, 6:47 AM

#

setoption name syzygyPath value ../../syzygy/3-4-5/
position fen 8/8/8/8/6b1/1N1P4/5K1p/7k b - - 0 1
go nodes 100000```

#

but let me see if I get the fen

rocky vigil Nov 11, 2025, 6:48 AM

#

yeah this is 6 piece (root pos)

violet badger Nov 11, 2025, 6:50 AM

#

If I print out the fens it probs I get

...
Probe: 8/8/8/8/3P4/5KN1/8/6kr w - - 0 7
Probe: 8/8/8/8/3P4/5K2/8/6kN b - - 0 7
Probe: 8/8/8/8/3P4/5KN1/8/6kb w - - 0 7
==1805787== Thread 2:
==1805787== Invalid read of size 1

#

with that last fen triggering the error

rocky vigil Nov 11, 2025, 6:51 AM

#

ah

#

bishop underpromotion is strange

#

so this is KNPkb

violet badger Nov 11, 2025, 6:52 AM

#

but if I search that last fen nothing happens.

#

hmm that could be.

rocky vigil Nov 11, 2025, 6:53 AM

#

what if you try a precursor position like 8/8/8/8/3P4/5KN1/7p/6k1 b - - 0 1

violet badger Nov 11, 2025, 6:55 AM

#

no problem

rocky vigil Nov 11, 2025, 6:56 AM

#

huh

#

doesn't seem to be a probing problem

violet badger Nov 11, 2025, 6:56 AM

#

no something is strange..

rocky vigil Nov 11, 2025, 6:57 AM

#

underpromotion, it being a check, captures available in the position are all edge cases of TB idk

violet badger Nov 11, 2025, 6:58 AM

#

well, we've never had TB issues.

rocky vigil Nov 11, 2025, 6:58 AM

#

but why would it only crash when root pos is far away

rocky vigil Nov 11, 2025, 6:59 AM

#

violet badger but if I search that last fen nothing happens.

can you also get the internal data being passed to the TB probing, see if that differs somehow?

violet badger Nov 11, 2025, 7:03 AM

#

If I compile with sanitize=undefined I get:

Probe: 8/8/8/8/3P4/5KN1/8/6kb w - - 0 7
syzygy/tbprobe.cpp:1081:22: runtime error: shift exponent 64 is too large for 64-bit type 'long unsigned int'
syzygy/tbprobe.cpp:1042:31: runtime error: shift exponent 151 is too large for 64-bit type 'long long unsigned int'
syzygy/tbprobe.cpp:1043:31: runtime error: shift exponent 209 is too large for 64-bit type 'long long unsigned int'
terminate called after throwing an instance of 'std::length_error'
  what():  vector::_M_default_append

rocky vigil Nov 11, 2025, 7:07 AM

#

that is very strange

#

and I assume it doesn't occur when the position is probed directly?

violet badger Nov 11, 2025, 7:08 AM

#

it doesn't trigger on master... let me check on the branch

#

no the positions searches fine as rootpos

rocky vigil Nov 11, 2025, 7:12 AM

#

and no warnings on shift exponent

#

ok

violet badger Nov 11, 2025, 7:12 AM

#

right

rocky vigil Nov 11, 2025, 7:12 AM

#

i guess the issue must be in the internal data being passed somehow

violet badger Nov 11, 2025, 7:13 AM

#

I think so, but have to stop debugging now.. later today I can look into it again.

frosty imp Nov 11, 2025, 7:47 AM

#

Issue in make move maybe?

violet badger Nov 11, 2025, 8:04 AM

#

it is something rare, I'm currently playing games with syzygy enabled, and it is not triggering after a few 100 games.

#

but does trigger on that testcase.

violet badger Nov 11, 2025, 8:50 AM

#

OK, finally, have a setup where this triggers reliably while playing games (basically book of random 6men positions).

#

     34  0-1 {Black mates}
     12  0-1 {White disconnects}
      9  1-0 {Black disconnects}
     25  1-0 {White mates}
      3  1/2-1/2 {Draw by fifty moves rule}
      7  1/2-1/2 {Draw by insufficient mating material}

#

and it is specific to the branch, not happening for master.

violet badger Nov 11, 2025, 8:56 AM

#

prime mica it's because it's building on *32-bit*

probably this should be PRed to the branch?

#

there is something similar for armv7 neon https://github.com/official-stockfish/Stockfish/actions/runs/19256000717/job/55050601986?pr=6406#step:10:161

prime mica Nov 11, 2025, 9:17 AM

#

violet badger probably this should be PRed to the branch?

Yep I will in a bit

prime mica Nov 11, 2025, 9:17 AM

#

violet badger OK, finally, have a setup where this triggers reliably while playing games (basi...

Ai ya

#

Is there a field of Position or StateInfo that only tbprobe reads

#

I’m surprised address sanitizer isn’t catching anything

proper oxide Nov 11, 2025, 9:23 AM

#

is there a reason there's double_inc_update for threats?

#

it seems to me like there's no optimization there?

#

this is a slight speedup for me

--- src/nnue/nnue_accumulator.cpp
+++ src/nnue/nnue_accumulator.cpp
@@ -212,17 +212,6 @@ void AccumulatorStack::forward_update_incremental(
             DirtyPiece& dp1 = psq_accumulators[next].diff;
             DirtyPiece& dp2 = psq_accumulators[next + 1].diff;
 
-            if (std::is_same_v<FeatureSet, ThreatFeatureSet> && dp2.remove_sq != SQ_NONE
-                && ((threat_accumulators[next].diff.threateningSqs & square_bb(dp2.remove_sq))
-                    || (threat_accumulators[next].diff.threatenedSqs & square_bb(dp2.remove_sq))))
-            {
-                double_inc_update<Perspective>(featureTransformer, ksq, threat_accumulators[next],
-                                               threat_accumulators[next + 1],
-                                               threat_accumulators[next - 1], dp2);
-                next++;
-                continue;
-            }
-
             if (std::is_same_v<FeatureSet, PSQFeatureSet> && dp1.to != SQ_NONE
                 && dp1.to == dp2.remove_sq)
             {

prime mica Nov 11, 2025, 9:25 AM

#

lol

#

advanced

prime mica Nov 11, 2025, 9:39 AM

#

violet badger If I compile with sanitize=undefined I get: ``` Probe: 8/8/8/8/3P4/5KN1/8/6kb w ...

ok what's particularly demented is that these shift operands come from the TB file itself...

#

so I think data is getting misaligned somehow in the TB read logic

#

huh but the only usage of a possibly-bad pos in mapped is constructing the file name...

#

is there a consistency check utility function for Position anywhere?

prime mica Nov 11, 2025, 9:49 AM

#

rocky vigil but do we have a way to distinguish between 32 bit sse41 and 64 bit sse41

yeah

#

just check whether __i386__ or __x86_64__ is defined

#

I don't think the LLM's suggestion makes much sense

amber fern Nov 11, 2025, 9:58 AM

#

Is the threat inputs branch merged with the official SF branch yet? If not, when with that happen 🙂

prime mica Nov 11, 2025, 9:59 AM

#

patience

amber fern Nov 11, 2025, 10:02 AM

#

What this guy said

#

Also, you said the same thing to that guy 😂

prime mica Nov 11, 2025, 10:04 AM

#

lol

amber fern Nov 11, 2025, 10:05 AM

#

prime mica lol

Yes I just read through 400+ messages on this thread, the entire history of the last ~4 days, y'all have a lot to say

prime mica Nov 11, 2025, 10:06 AM

#

it's a complex change!

green moat Nov 11, 2025, 10:15 AM

#

By the way, the new L2=31 Stage 4-5 nets are now available. Has someone already tested them? Are they outdaded/superseded by other nets? 😐
Stage 5: https://gitlab.com/cscs-ci/ci-testing/webhook-ci/mirrors/5137461961076608/2926829081096545/-/jobs/12028659754/artifacts/download
Stage 4: https://gitlab.com/cscs-ci/ci-testing/webhook-ci/mirrors/5137461961076608/2926829081096545/-/jobs/12028659745/artifacts/download

amber fern Nov 11, 2025, 10:24 AM

#

Fish test time!

rocky vigil Nov 11, 2025, 10:41 AM

#

green moat By the way, the new L2=31 Stage 4-5 nets are now available. Has someone already ...

patience

#

we'll do it after the merge

#UE Threat Inputs for AB

i may have a speedup ``` Result of 20 runs

i may have a speedup ```
Result of 20 runs