#The NNUE is sparse

6 messages · Page 1 of 1 (latest)

tidal raft
#

Even the later layers are sparse. If you could use this information to speed up the NNUE eval somehow, that could be a good idea.

violet lodge
#

the later layers are so minimal that significant improvement is needed for a overall speedup can be measured

tidal raft
#

Maybe that could be the case...

#

But what about we could make the later layers bigger?

tidal raft
violet lodge
#

yes because the later layers are too small for even the entire L1 output to fit fully into an avx512 vector