small speedup ideas | Stockfish | Page 1

narrow plinth Dec 10, 2023, 12:49 AM

#

Template Thread based on main thread vs non main thread. Use std::array for Threads instead of vector.

tardy jasper Dec 10, 2023, 12:50 AM

#

is it possible to resize arrays on the fly like vectors?

narrow plinth Dec 10, 2023, 12:50 AM

#

no but it is limited at compile time anyway

#

ucioption.cpp it is limited to 1024 max right now

light cliff Dec 10, 2023, 12:54 AM

#

why would an array help over a vector?

narrow plinth Dec 10, 2023, 12:55 AM

#

faster to access

light cliff Dec 10, 2023, 12:55 AM

#

no?

narrow plinth Dec 10, 2023, 12:55 AM

#

wut?

light cliff Dec 10, 2023, 12:55 AM

#

identical operation

narrow plinth Dec 10, 2023, 12:55 AM

#

no it's not

#

vectors use heap memory and can move around

light cliff Dec 10, 2023, 12:56 AM

#

heap memory is still memory

#

no difference to the stack

#

moving around is irrelevant

narrow plinth Dec 10, 2023, 12:56 AM

#

yes, it means extra indirection....

light cliff Dec 10, 2023, 12:56 AM

#

accessing both is still just effectively a pointer deref

#

what indirection

narrow plinth Dec 10, 2023, 12:57 AM

#

im not gonna debate basics with you lol....

#

array is faster by a bit

light cliff Dec 10, 2023, 12:57 AM

#

then don't be wrong? lmfao

narrow plinth Dec 10, 2023, 12:58 AM

#

it is well known that accessing arrays is faster than vector

maiden shard Dec 10, 2023, 12:58 AM

#

narrow plinth it is well known that accessing arrays is faster than vector

Vectors are arrays

narrow plinth Dec 10, 2023, 12:58 AM

#

nope

light cliff Dec 10, 2023, 12:59 AM

#

yep

#

it's not "well known"

#

cuz it's not true

#

wtf are you on

narrow plinth Dec 10, 2023, 12:59 AM

#

they are very different data types

#

https://www.geeksforgeeks.org/advantages-of-vector-over-array-in-c/

light cliff Dec 10, 2023, 1:00 AM

#

GEEKSFORGEEKS LMAO

#

you legit have to be trolling

narrow plinth Dec 10, 2023, 1:01 AM

#

Vectors in c++:

Vectors are sequence containers which utilize continuous storage locations to store elements. They can manage storage and grow dynamically in an efficient way. These abilities come at a price: vectors consume more memory in exchange for the ability to handle storage and growing dynamically in size. vector<int> v; where v is the variable of type Vector store integer elements.

Advantages of Vector:

Size of the vector can be changed
Multiple objects can be stored
Elements can be deleted from a vector

Disadvantages of Vector:

A vector is an object, memory consumption is more.

Declare an array in C++:

An array stores a fixed-size sequential collection of elements of the same type. It is used to store a collection of data, but the array can be considered as a collection of variables of the same type stored at contiguous memory locations. All arrays consist of contiguous memory locations, with the lowest address corresponds to the first element and the highest address to the last element.

Advantages of Arrays:

Arrays supports efficient random access to the members.
It is easy to sort an array.
They are more appropriate for storing fixed number of elements
Disadvantages of Arrays:
Elements can not be deleted
Dynamic creation of arrays is not possible
Multiple data types can not be stored

#

are you? wtf

#

no one with an iota of c++ knowledge thinks that vector and array have same cost to access

light cliff Dec 10, 2023, 1:02 AM

#

yeah you're trolling

#

o/

narrow plinth Dec 10, 2023, 1:03 AM

#

so your position is that a data structure that is variable sized is the same cost to index into as a fixed size one?

#

LMFAO

#

please stop, this is very embarrassing for you

tardy jasper Dec 10, 2023, 1:46 AM

#

If you think this is a possible speedup then maybe you can submit a test on fishtest?

narrow plinth Dec 10, 2023, 1:49 AM

#

i will soonish

#

might combine with 1 other thing

#

last one probably biggest gain, remove thread voting and no need for rootmoves for every thread

strange kettle Dec 10, 2023, 10:41 AM

#

Ciekce has a compulsive urge to correct others, taking everything into "you're wrong lol" territory in order to feel intellectually dominant. I guess something went wrong at some point in its development

tardy jasper Dec 10, 2023, 10:57 AM

#

wtf

#

you clearly haven't been in engine dev

#

ciekce is literally one of the most helpful for beginners

strange kettle Dec 10, 2023, 11:16 AM

#

Yeah, he is very helpful

#

I dont doubt that

terse crag Dec 10, 2023, 12:15 PM

#

narrow plinth no one with an iota of c++ knowledge thinks that vector and array have same cost...

oh my god, do you actually not know how indexing works?

#

incredible

strange kettle Dec 10, 2023, 12:16 PM

#

He doesnt know

narrow plinth Dec 10, 2023, 5:38 PM

#

do you? why do you think indexing into a fixed size array that has a fixed memory location has the same cost as a variable sized vector that can move?

hollow mortar Dec 10, 2023, 5:53 PM

#

vectors do vary in size but accessing an existing element is the exact same operation iirc

#

you might as well submit a test to fishtest to see the results

narrow plinth Dec 10, 2023, 5:55 PM

#

no, there are special instructions/addressing modes if you know base address and max size

#

it is certainly not the same cost

hollow mortar Dec 10, 2023, 5:56 PM

#

could you provide an example

narrow plinth Dec 10, 2023, 5:56 PM

#

can you do basic research ?

hollow mortar Dec 10, 2023, 5:56 PM

#

ah yes my google research would be like "array special instruction/addressing modes"

#

If i ask you to provide an example it's because i know researching it would take longer

#

since it's very vague

narrow plinth Dec 10, 2023, 5:59 PM

#

do you think that a constant base address that can be encoded into an immediate operand at compile time is the same as looking it up in a pointer at runtime?

#

please tell me

#

if you don't know basics like this, there is no helping you

#

it is very sad the complete lack of knowledge demonstrated by many people here

hollow mortar Dec 10, 2023, 6:02 PM

#

well if you really wanna dive into this topic i can tell you that usually common base addresses of stack arrays are not compiled into immediate operands each time, but rather put into registers upon use since it's faster and takes up less space. Even so, looking it up in a pointer (??) at runtime doesn't really sound like it would speed up Stockfish significantly compared to any other evaluation or search feature's speedups.

narrow plinth Dec 10, 2023, 6:03 PM

#

hollow mortar well if you really wanna dive into this topic i can tell you that usually common...

da hell? no shot you think that putting something into a register is faster than an immrdiate operand?

#

please just stop

hollow mortar Dec 10, 2023, 6:03 PM

#

reading an immediate operand in ram is slower than using an already-used and cached value in a cpu register

narrow plinth Dec 10, 2023, 6:04 PM

#

you literally don't even know what it is lol

#

an immediate is encoded into the instruction stream directly

#

there is no lookup

#

please stop before you embarrass yourself further

hollow mortar Dec 10, 2023, 6:06 PM

#

so how would you estimate this very significant speedup as

narrow plinth Dec 10, 2023, 6:06 PM

#

i didn't say it was significant, did i?

#

i said small

#

literally in the title

hollow mortar Dec 10, 2023, 6:06 PM

#

well clearly you thought it was significant enough to make a whole post about it

narrow plinth Dec 10, 2023, 6:07 PM

#

yeah, and i made many posts about removing tb

#

but i literally only estimated 0.5-1 elo

#

and it came in at 0.6

#

but next time, talk about what you understand instead of sprouting nonsense

hollow mortar Dec 10, 2023, 6:09 PM

#

so how much elo do you think this would gain?

narrow plinth Dec 10, 2023, 6:09 PM

#

just the switch to std array?

#

maybe 0.1 at most

#

obviously at multi threaded test

#

probably closer to 0.05

hollow mortar Dec 10, 2023, 6:17 PM

#

now that i think about it, in order to compile a vector/array's base address as an immediate value, wouldn't that need to be a static address? I can't really figure how SF would ever use static addresses even with the use of arrays.

maiden shard Dec 10, 2023, 6:17 PM

#

narrow plinth do you think that a constant base address that can be encoded into an immediate ...

You’re also addressing with the array index into the thread array, along with the offset into the thread class, along with a potential offset into an array in the thread class like history
How is a constant base(assuming it even is a constant base) going to yield any significant increase in performance

narrow plinth Dec 10, 2023, 6:19 PM

#

no one said significant except other people

hollow mortar Dec 10, 2023, 6:20 PM

#

i think what he means is how is the speedup ever going to compensate for all the noise that comes from everything else

narrow plinth Dec 10, 2023, 6:20 PM

#

that is why i put multiple ideas

#

not just 1

#

i don't think only 1 is measurable

#

fixed size allocation, can go on stack, can know base address

maiden shard Dec 10, 2023, 6:22 PM

#

narrow plinth fixed size allocation, can go on stack, can know base address

The stack is not at a constant address

hollow mortar Dec 10, 2023, 6:22 PM

#

yep

narrow plinth Dec 10, 2023, 6:23 PM

#

maiden shard The stack is not at a constant address

virtual memory means any process pretends it owns the whole address space, it can put stuff whereever it likes at compile time

hollow mortar Dec 10, 2023, 6:23 PM

#

compiler still won't use immediate values

narrow plinth Dec 10, 2023, 6:23 PM

#

yes it will

#

you literally don't even know what an immediate is

#

and you still have the nerve to say shit

hollow mortar Dec 10, 2023, 6:24 PM

#

an operand in the instruction itself

narrow plinth Dec 10, 2023, 6:24 PM

#

lmfao

hollow mortar Dec 10, 2023, 6:24 PM

#

in the form of a number

#

or address in this case

maiden shard Dec 10, 2023, 6:25 PM

#

narrow plinth yes it will

Do you have a godbolt link or something to show us?
I’ve never checked for this before

narrow plinth Dec 10, 2023, 6:25 PM

#

then why did you think a register lookup is faster?

hollow mortar Dec 10, 2023, 6:26 PM

#

narrow plinth then why did you think a register lookup is faster?

because:

it makes the instruction shorter which means easier decoding
it can easily be cached
cpus have other physical registers besides the logical ones bytecode uses so it wouldn't cause any problems with registers availability

#

the difference in speed between the two, in absence of bottlenecks should be close to none

narrow plinth Dec 10, 2023, 6:29 PM

#

even if you were correct, which you are not, memory savings of not having to store and lookup base address would still be there

#

https://wolchok.org/posts/cxx-trap-1-constant-size-vector/

hollow mortar Dec 10, 2023, 6:31 PM

#

considering that using immediates still adds bytes to the code, we're literally talking about a bunch of bytes as memory saving, which would be irrelevant since they would still most likely be within an allocated memory page

narrow plinth Dec 10, 2023, 6:34 PM

#

so it's in a register, how did you load it into the register?

#

from memory, which requires another instruction or immediate

#

please stop this

hollow mortar Dec 10, 2023, 6:35 PM

#

just once tho lol

#

in your case it would use immediates every time

narrow plinth Dec 10, 2023, 6:35 PM

#

what do you mean just once, you allocate a whole register for it for the lifetime of the program? no shot

hollow mortar Dec 10, 2023, 6:36 PM

#

of course i mean each time it's needed, but then that register is used for addressing multiple times before being assigned to something else

narrow plinth Dec 10, 2023, 6:37 PM

#

yes, but that doesn't mean loading the register is free

#

the link has benchmarks near the bottom

hollow mortar Dec 10, 2023, 6:47 PM

#

i just took a look at the benchmark

#

so it seems like there are no immediate values being used for addressing

#

except jumps and calls of course

#

but that's obvious

#

It looks like the bench_array code is just better instead

#

even though it even has a nop instruction despite the O3 flag which is kinda funny

#

the actual cause for this difference in speed seems to be a bad use of xmm registers in bench_pushBack and bench_prealloc

#

this probably would never occur in the Stockfish executables that use AVX2 however in such artificial situation with a huge number of writes over a large array

#

actually

#

it looks like the bench_pushBack doesn't use xmm registers at all

#

only the general purpose ones

narrow plinth Dec 10, 2023, 7:07 PM

#

it doesn't because he made array at run time

#

with new

#

your theory would mean that compilers would just use register addressing and only use immediates to load the register, which doesn't happen

hollow mortar Dec 10, 2023, 7:15 PM

#

narrow plinth it doesn't because he made array at run time

yeah and that means the benchmark doesn't really fit with your proposal so there isn't any actual measurement

narrow plinth Dec 10, 2023, 8:16 PM

#

tell us what compiler turns an immediate into a register load to index?

#

you have no clue bro

hollow mortar Dec 10, 2023, 8:28 PM

#

at least i know what a benchmark actually measures

strange kettle Dec 10, 2023, 8:34 PM

#

Now this is intense

#

You guys should resolve your differences in an octagon

#

No rules

hollow mortar Dec 10, 2023, 8:37 PM

#

he's so hostile he would probably agree

strange kettle Dec 10, 2023, 8:48 PM

#

And you wouldnt?

hollow mortar Dec 10, 2023, 9:00 PM

#

strange kettle And you wouldnt?

I wouldn't agree to violence no, but i would win nonetheless

strange kettle Dec 10, 2023, 9:08 PM

#

Who doesn't sign up for a few good elbows to the nasal septum?

#

Come on

narrow plinth Dec 10, 2023, 9:27 PM

#

hollow mortar at least i know what a benchmark actually measures

uh huh, that's why you think register indexing is faster

#

because you sure demonstrated that bro

#

very good benchmark you posted

#

i see why zuppa had you blocked now

hollow mortar Dec 10, 2023, 9:37 PM

#

narrow plinth very good benchmark you posted

at least i didn't use one that doesn't have anything to do with my claim

narrow plinth Dec 10, 2023, 9:39 PM

#

better than nothing lmfao

#

blocked this fool until he shows da numbers

#

probably the heat death of the universe will come first

strange kettle Dec 10, 2023, 9:40 PM

#

Zuppa blocked you?

narrow plinth Dec 10, 2023, 9:40 PM

#

LMFAO

#

zuppa blocked yes a long time ago

strange kettle Dec 10, 2023, 9:40 PM

#

So he is a veteran

#

In the field of being blocked by zuppa

#

Im a novice

narrow plinth Dec 10, 2023, 9:43 PM

#

you can do it...just takes time

#

i really don't even know how you can even conceive of such shit

#

register indexing better than immediate lmfao

hollow mortar Dec 10, 2023, 9:54 PM

#

he unblocked me lol

hollow mortar Dec 10, 2023, 9:56 PM

#

narrow plinth i really don't even know how you can even conceive of such shit

indeed i also don't know how your parents conceived you

narrow plinth Dec 10, 2023, 9:57 PM

#

LOLOLOL

#

find any benchmarks yet?????

#

i see you posted 0

hollow mortar Dec 10, 2023, 9:57 PM

#

you're the one who made the first claim you should find them

narrow plinth Dec 10, 2023, 9:57 PM

#

your claim was that immediate is worse

#

post your benchmark

hollow mortar Dec 10, 2023, 9:57 PM

#

your claim was that immediate is better

#

so much to create a 0.1 elo gain

narrow plinth Dec 10, 2023, 9:57 PM

#

and?

hollow mortar Dec 10, 2023, 9:58 PM

#

and you didn't prove it yet

narrow plinth Dec 10, 2023, 9:58 PM

#

and?

#

gain is gain

#

if you stop trolling here

#

then maybe i could actually do it instead of replying to you

hollow mortar Dec 10, 2023, 9:58 PM

#

sounds like excuses

narrow plinth Dec 10, 2023, 9:59 PM

#

what productive thing have you done here?

#

where is your benchmark?

#

show me

hollow mortar Dec 10, 2023, 9:59 PM

#

where is yours?

narrow plinth Dec 10, 2023, 9:59 PM

#

name 1 single compiler which does it your way

#

because it is better

#

or are compiler writers all dumbasses?

hollow mortar Dec 10, 2023, 10:00 PM

#

i said that in general compilers use register indexing when using arrays, and gcc does that on your famous benchmark

#

so my statement isn't false

#

you genuinely thought compilers use immediate addressing all the time with arrays so

#

i'd say the one making shit up is you

narrow plinth Dec 10, 2023, 10:12 PM

#

lololol

#

nice benchmark you posted

#

in general, blah blah blah

#

cpu makers dumb af for even including it

#

if what you say is true

#

even if there are extra shadowed registers, that is still limited

#

and you still have to use one of the architectural registers when addressing it

#

read the synthesis os paper

#

regardless, an extra load is not free

#

unlike an immediate

#

whether you load it from memory or an immediate

#

and extra pointer still takes up space in data cache and memory regardless of if a tlb miss is caused or not

#

but sure, keep thinking that BS

#

with no evidence

#

it is quite unfortunate how much nonsense you believe

#

just say I, Yes. I believe that including extra code to grow and shrink a vector is free.

#

do it, lets see how smart you are.

#

even if your method was 'faster', which it isn't, the extra code would make a vector slower than an array overall for this use case

strange kettle Dec 10, 2023, 10:47 PM

#

What does smp stand for

narrow plinth Dec 10, 2023, 10:47 PM

#

simultaneous multi processing

strange kettle Dec 10, 2023, 10:47 PM

#

Ohh

narrow plinth Dec 10, 2023, 10:47 PM

#

which basically means threads for sf usually

strange kettle Dec 10, 2023, 10:48 PM

#

Patches passing with the old master will be rescheduled ig

#

After the new mergins I mean

narrow plinth Dec 10, 2023, 10:49 PM

#

sometimes i wonder about sf methodology

#

how many of these patches tested at same time interfere with each other

jovial juniper Dec 10, 2023, 10:50 PM

#

In theory maintainer says rebase and retest if they are likely to interfere. Although I suppose you could argue on "likely"

strange kettle Dec 10, 2023, 10:51 PM

#

If a given patch starts with a prev master and completes testing with an updated master, how you guys proceed

jovial juniper Dec 10, 2023, 10:52 PM

#

if they are likely to interfere

strange kettle Dec 10, 2023, 10:52 PM

#

Retest with the updated?

#

And how you measure that

hollow mortar Dec 10, 2023, 10:52 PM

#

so while you were here yapping i actually did a benchmark

#

by recycling some of the code that your dude used

strange kettle Dec 10, 2023, 10:53 PM

#

Like how you measure

hollow mortar Dec 10, 2023, 10:53 PM

#

to enfore the use of immediate addressing i had to allocate a global fixed size array and just guess a big enough size

#

but it worked

#

https://quick-bench.com/q/vgkBjn6SJxiO4HbDy5RxuTkLhqI

Quick C++ Benchmarks

Quickly benchmark C++ runtimes

strange kettle Dec 10, 2023, 10:53 PM

#

If they are likely to interfere

hollow mortar Dec 10, 2023, 10:53 PM

#

keep in mind that i did this in 5 minutes at midnight

strange kettle Dec 10, 2023, 10:54 PM

#

I guess its entirely dependent on the patch

#

So the question is rather vague

jovial juniper Dec 10, 2023, 10:54 PM

#

strange kettle And how you measure that

If it modifies same part of code then definitely retest. And some things like net and big tunes probably get retested. But mostly intuition on what probably does AFAIK.

strange kettle Dec 10, 2023, 10:54 PM

#

Still Im curious

hollow mortar Dec 10, 2023, 10:54 PM

#

if you look at the disassembly you can see that bench_immediate does use immediate addressing

strange kettle Dec 10, 2023, 10:55 PM

#

jovial juniper If it modifies same part of code then definitely retest. And some things like ne...

Okay thank you for your explanation!

hollow mortar Dec 10, 2023, 10:56 PM

#

hollow mortar https://quick-bench.com/q/vgkBjn6SJxiO4HbDy5RxuTkLhqI

@narrow plinth have fun

#

it seems like you weren't that right after all

#

Now I'm going to sleep since i'm done talking to you

#

have a good day

strange kettle Dec 10, 2023, 10:58 PM

#

Have a good night

#

I hope you have sweet dreams

narrow plinth Dec 10, 2023, 11:02 PM

#

ah yes, testing array vs array when we were talking about array vs vector

#

what a genius

strange kettle Dec 10, 2023, 11:03 PM

#

Can you test it yourself in that page?

narrow plinth Dec 10, 2023, 11:03 PM

#

im not at keyboard right now

#

maybe later when I'm home

strange kettle Dec 10, 2023, 11:03 PM

#

Oko

narrow plinth Dec 10, 2023, 11:04 PM

#

but it looks like compiler is vectorizing everything, i will have to take closer look

#

at a quick glance, the compiler vectorized one loop but not the other

#

when i tried it

#

it got different results

#

from his run

#

his run immediate was slightly slower, i clear results and got a huge difference

#

it probably depends on what cpu is assigned on server side

#

sometimes its under 1% difference, then sometimes it is 1.7x

#

i will try it locally when i get home

#

yeah something weird is going on

narrow plinth Dec 11, 2023, 2:10 AM

#

i will try changing it to an add or shift

#

using an expensive multiply seems wrong

#

to be clear, in sf all this is accessed is when creating or deleting thread and once per go command

#

and yes, chasing the pointer that is stored in it is probably more cost than the actual iteration

long glacier Dec 23, 2023, 8:50 PM

#

narrow plinth you literally don't even know what it is lol

you're wrong bro

ember canyon Dec 24, 2023, 1:16 PM

#

arguing in ideas channel is the most useless thing to do

#

everything can be responded to by "write patch and test it on fishtest"

#

but you keep hearing how they will test it soon(TM)

#

while defending the idea

narrow plinth Dec 25, 2023, 1:06 AM

#

no point if test won't get approved

long glacier Dec 26, 2023, 10:04 PM

#

best thing bro can do atm is delete this post fr

narrow plinth Dec 26, 2023, 10:09 PM

#

delete yourself

#

you contributed nothing LMFAO

viscid stag Dec 27, 2023, 5:21 AM

#

it's better to contribute nothing than to contribute utter bs and expose your own problems 😔

narrow plinth Dec 27, 2023, 6:30 AM

#

what problems?

narrow plinth Dec 27, 2023, 6:51 AM

#

do you believe that a fixed size array that is rarely accessed would be faster as a vector?

#

are you stupid?

tardy jasper Dec 27, 2023, 8:05 AM

#

narrow plinth do you believe that a fixed size array that is rarely accessed would be faster a...

I believe you mean the opposite here

#

#1183208858701811822 message

narrow plinth Dec 27, 2023, 8:10 AM

#

I was replying to someone who said i was full of BS and exposing my own problems

#

so that means he thinks the opposite for some reason

#

he must be really stupid

#

or maybe he is actually stupid enough to think testing array vs array is equivalent to array vs vector? LOL

strange kettle Dec 27, 2023, 10:08 AM

#

Calm down

narrow plinth Mar 19, 2024, 4:58 PM

#

https://wolchok.org/posts/cxx-trap-1-constant-size-vector/

maiden shard Mar 19, 2024, 7:18 PM

#

A single pointer is a performance trap?

narrow plinth Mar 19, 2024, 7:22 PM

#

maiden shard A single pointer is a performance trap?

why have an extra pointer and allocate on the heap?

#

what benefit does this give you exactly?

sick tiger Mar 20, 2024, 7:30 AM

#

Did anything happen out of this ? As aggressive as he sounded I think he is correct though. Array that is allocated on stack with fixed size technically should be slight faster, don't know if it matters too much though

rugged turtle Mar 20, 2024, 9:37 AM

#

the things are almost all premature optimizations

#

in none hot paths

narrow plinth Mar 20, 2024, 10:52 PM

#

yes it's not a hot path, main impact would be less memory use not actual clock cycles saved directly

#

but still a small impact

#

i will probably implement it soon in my fork, been doing other stuff mostly

#small speedup ideas