thread_local cache | Together C & C++ | Page 1

narrow rampart Jan 15, 2024, 1:54 PM

#

ThreadSafeQueue<someBigAndCostlyShit> q;
class someBigAndCostlyShit {
someBigAndCostlyShit() {} //does this prevent eng from constructing?

void initialize()
{
eng.seed(currentTime());
}
~someBigAndCostlyShit
{
q.enqueue(this);
}
std::mt19937_64 eng;
}
void threadfunc()
{
  thread_local someBigAndCostlyShit a;
  std::optional<someBigAndCostlyShit> val = q.pop();
  if(val.has_value())
  {
    a = val.value();
  }
  else
  {
    a.initialize();
  }
}

I want to implement a class that will cache itself in a queue when a thread exits to prevent constructing it again when a new thread calls threadfunc, how does this look
I'm afraid that thread_local someBigAndCostlyShit a; will be expensive anyways because of default constructor to mt19937_64
I'm not sure which random number generator to use and if my solution even makes sense
I want to have a random function from x to y but my server is async and I guess a lot of new threads can spawn and often die, so creating these all the time would be costly
thats why I wanna cache them

distant briarBOT Jan 15, 2024, 1:54 PM

#

When your question is answered use !solved to mark the question as resolved.

Remember to ask specific questions, provide necessary details, and reduce your question to its simplest form. For tips on how to ask a good question use !howto ask.

radiant jewel Jan 15, 2024, 2:15 PM

#

Thread local sucks big on windows for some reason btw major performance diff between Linux

#

There are certainly faster random number generators as well, I also don’t like the use of a global variable

#

What matters is how expensive that object creation is compared to the execution time of the thread

narrow rampart Jan 15, 2024, 2:24 PM

#

I assume rng generators per thread would be expensive so I came up with this

#

and I plan to run on linux anyways

#

and for a global variable well I dont see any other option

radiant jewel Jan 15, 2024, 2:25 PM

#

You can inject a dependency?

#

A reference to the queue

narrow rampart Jan 15, 2024, 2:26 PM

#

sounds shitty

radiant jewel Jan 15, 2024, 2:26 PM

#

narrow rampart I assume rng generators per thread would be expensive so I came up with this

If your thread runs for 1sec then the creation of a random number generator is really no problem

narrow rampart Jan 15, 2024, 2:26 PM

#

I could also make the variable static in the function itself

narrow rampart Jan 15, 2024, 2:27 PM

#

radiant jewel If your thread runs for 1sec then the creation of a random number generator is r...

problem is, if 1000 people are sending packets all the time and asio async spawns threads constantly

#

and its a waste of time to make a new generator to get 1/2 numbers

#

so I could just reuse it

radiant jewel Jan 15, 2024, 2:28 PM

#

Your thread safe queue will probably just even it out

#

Because 1000 threads are trying to acquire the lock

#

Which I assume your queue has

narrow rampart Jan 15, 2024, 2:28 PM

#

yeah

#

but they wouldnt be all 1000 at once

radiant jewel Jan 15, 2024, 2:29 PM

#

Well I think it’s a lot of premature optimization, you still haven’t told me how long the thread actually runs for ?

narrow rampart Jan 15, 2024, 2:29 PM

#

I think making a new generator for each thread in case where async can spawn shit ton of threads that will die shortly after is a waste

narrow rampart Jan 15, 2024, 2:29 PM

#

radiant jewel Well I think it’s a lot of premature optimization, you still haven’t told me how...

its unknown

radiant jewel Jan 15, 2024, 2:29 PM

#

So anything from 1 nanosecond to 10 billion years?

narrow rampart Jan 15, 2024, 2:30 PM

#

yes

radiant jewel Jan 15, 2024, 2:30 PM

#

And your average is also unknown ?

narrow rampart Jan 15, 2024, 2:30 PM

#

Invocation of the handler will be performed in a manner equivalent to using boost::asio::io_service::post().

reef vault Jan 15, 2024, 2:30 PM

#

sounds like you'd benefit from having a thread pool as well tbh

reef vault Jan 15, 2024, 2:31 PM

#

radiant jewel Thread local sucks big on windows for some reason btw major performance diff bet...

maybe not related but last time I checked one reason for a huge decrease in perf of thread local on windows was making it compliant in some "edge cases" with dll stuff
just so happened that I personally am often in those "edge cases" and that the linux impl I tested at the time was bugged

#

might not matter here though

narrow rampart Jan 15, 2024, 2:32 PM

#

I use async_accept in loop then I use async_read in loop for connections

reef vault Jan 15, 2024, 2:33 PM

#

narrow rampart ```cpp ThreadSafeQueue<someBigAndCostlyShit> q; class someBigAndCostlyShit { som...

exluding the queue, is this more about how you "cache" the "thread local" instance or just about the queueing and caching in general?

narrow rampart Jan 15, 2024, 2:33 PM

#

well I think I got the how? so just more about the sense of this

#

if my class wrapper would even work and prevent the generator from running its shit on this line thread_local someBigAndCostlyShit a;

reef vault Jan 15, 2024, 2:34 PM

#

excluding premature optimization and whatnot caching instances and creating new instances on demand is fine-ish, just that if you really have the one instance per thread and have a ton of thread I'd really much rather have a thread pool and something somewhat similar to coroutines

narrow rampart Jan 15, 2024, 2:35 PM

#

reef vault sounds like you'd benefit from having a thread pool as well tbh

I guess I do have one

reef vault Jan 15, 2024, 2:35 PM

#

narrow rampart if my class wrapper would even work and prevent the generator from running its s...

I'd have to actually look at the type

narrow rampart Jan 15, 2024, 2:35 PM

#

I have a vector of std::thread and then I emplace back 8 threads that do io_context.run

reef vault Jan 15, 2024, 2:35 PM

#

narrow rampart I guess I do have one

but do you spawn a ton of thread to deal with all the async stuff

narrow rampart Jan 15, 2024, 2:35 PM

#

and from what I see that post will only invoke the handlers from the 8 threads

#

🤔

#

I actually didnt read into asio documentation

reef vault Jan 15, 2024, 2:35 PM

#

then you have only 8 instances max

narrow rampart Jan 15, 2024, 2:35 PM

#

I thought it can spawn threads on async operaitons

#

but it seems it just uses whatever thread I do io_context.run on

reef vault Jan 15, 2024, 2:37 PM

#

well depending on the exact setup the bottom line is you don't want too many concurrent instances and you want them only on the threads that will use it

#

depending on what exactly you do you can have one generator churn out packet of random numbers, push those packets in a queue, and grab packets in different threads

narrow rampart Jan 15, 2024, 2:37 PM

#

I never did such a thing that required a thread pool so idk how many threads to use

#

assuming 1500 people sending few packets a second of sizes about 500 bytes each

fringe hatch Jan 15, 2024, 2:39 PM

#

I think the real question here is: why are you spawning new threads at such high rates that creating an std::mt19937 once per thread becomes performance relevant?

narrow rampart Jan 15, 2024, 2:39 PM

#

fringe hatch I think the real question here is: why are you spawning new threads at such high...

turns out I'm not

sour plank Jan 15, 2024, 2:39 PM

#

wait are you generating a new instance of a rng mt19937 generator when you spawn a new thread? And you are scared that generating mt19937 will be expensive?

reef vault Jan 15, 2024, 2:39 PM

#

well apparently he might not be, but that's not confirmed?

narrow rampart Jan 15, 2024, 2:39 PM

#

I thought asio could spawn a thread to execute the handler

#

but it just uses the threads that I gave it

fringe hatch Jan 15, 2024, 2:40 PM

#

one would hope so ye ^^

narrow rampart Jan 15, 2024, 2:40 PM

#

so the main idea is out of the window I guess

#

now its just to decide how many threads I will need

reef vault Jan 15, 2024, 2:41 PM

#

if you don't know try hardware_concurrency or whatever

#

depends how many threads outside the pool you expect there to be

#

.cppref hardware_concurrency

#

is that not the bot command

distant briarBOT Jan 15, 2024, 2:42 PM

#

cppreference.com

std::thread::hardware_concurrency

static unsigned int
hardware_concurrency() noexcept;

narrow rampart Jan 15, 2024, 2:42 PM

#

is there a number that beyond which adding threads doesnt give performance

#

like above the number of cpu threads or cores

reef vault Jan 15, 2024, 2:43 PM

#

yes there is, but the "exact" number depends on what you're doing

narrow rampart Jan 15, 2024, 2:43 PM

#

and is there any way to approximate that number without trial and error

reef vault Jan 15, 2024, 2:43 PM

#

if most threads are asleep most of the time you can "afford" to have more but it's kinda weird

#

if your threads are always active and want to churn out computation then there's not much point to go beyond what your machine can actually do

narrow rampart Jan 15, 2024, 2:44 PM

#

idk to be honest what asio does under the hood with threads

#

I just know that io_context.run runs its event processing loop

reef vault Jan 15, 2024, 2:44 PM

#

or more like it's detrimental because then the os has to spend more time scheduling and switching between threads

narrow rampart Jan 15, 2024, 2:44 PM

#

how does windows not freeze then

reef vault Jan 15, 2024, 2:45 PM

#

what do you mean

#

like the OS?

#

because it's in control

#

and gets to decide what runs or not

narrow rampart Jan 15, 2024, 2:45 PM

#

from what I see in task manager I ahve 3536 threads running

reef vault Jan 15, 2024, 2:45 PM

#

sure

#

most of them are either sleeping or do not get to do anything

#

unless your cpu is able to concurrently have 3536 threads progress then some of them must be inactive, no question asked

#

whether they are inactive because they have nothing to do and are waiting to be woken up/signalled, or they were forcibly dragged away from the cpu to put a different thread in its place, is unspecified

narrow rampart Jan 15, 2024, 2:47 PM

#

so I should just use the number of CPU threads

#

or cores * threads

#

if I have 8 cores and 16 threads then it means I have 2 threads per core yeah?

radiant jewel Jan 15, 2024, 2:50 PM

#

Depends on what type of execution they run

narrow rampart Jan 15, 2024, 2:50 PM

#

so I should spawn 16 threads

radiant jewel Jan 15, 2024, 2:51 PM

#

If they do performance intensive tasks you should only spawn as many threads as you will have

#

Since these will takeover all your system resources

#

If they are just io or are stopping their execution regularly because they wait for data then spawning more is okay

narrow rampart Jan 15, 2024, 2:52 PM

#

yeah they are io

#

I guess I would have to do trial and error

#thread_local cache