#Alternative NNUE design
25 messages · Page 1 of 1 (latest)
thats doing what im doing manually, isn't it?
The pairwise term? Sure, but that boils down to simply summing up and taking a square.
mhm...
where are you headed
im not trying to approximate anything
Neural networks are seen as universal approximators.
you know what i meant
It needs to approximate something.
why not add a non-linearity then
What are you trying to do? Remember the compute efficiency.
Adding complete nonlinearity would be O(n^2)
Too expensive.
add query weights maybe
like with transformers
learnt ofc
Well, I already have my efficiently updatable transformer design that uses decoder.
But probably more suitable for llms than chess.
yeah
wait i just realized
that isnt the same thing
that only looks at tokens backwards
not forwards