Semaphore Recycler Pattern | Together C & C++ | Page 1

mighty forge Dec 6, 2024, 4:41 AM

#

I am using a llvm::ThreadPool to dispatch tasks to process source files for my compiler front-end. The process tasks use buffers of memory that I would like to reuse across tasks. My idea for doing this is to use the recycler pattern with a semaphore for synchronization, but I have never used semaphore before. How would I use a semaphore from the C++ standard library to check out instances of a struct from an std::vector across multiple threads? After a task is finished on one thread, it should give back the resources to the recycler vector, and then when that same thread picks up a new source processing task, it should ask for a new one (possibly the same) resource object from the recycler vector. Obviously, I want no data races. I have never done this kind of thing before. I can initially set the size of the recycler vector to thread_pool.getMaxConcurrency(), which will ensure that the vetor will always have enough resource objects contained within for all active tasks.

steep mesaBOT Dec 6, 2024, 4:41 AM

#

When your question is answered use !solved to mark the question as resolved.

Remember to ask specific questions, provide necessary details, and reduce your question to its simplest form. For tips on how to ask a good question use !howto ask.

mighty forge Dec 6, 2024, 4:56 AM

#

Look here to see what I am talking about: https://github.com/Journeyman1337/requite_cxx/blob/471b728ebac74b7a9c0eefa9a5491ca5032db3f0/src/run.cpp#L37

magic falcon Dec 6, 2024, 5:58 AM

#

i've only used semaphore on windows..... but i presume it's similar... semaphore is like a bouncer at a nightclub with his little counter in hand.. he uses that to check if he's over or under full.. then permits people (tasks) to enter. your vector is more like the one barman that now has 20 people to serve, he can only serve one at a time.. to access the bar (the vector) you must first acquire the barmans undivided attention (a lock).

#

the bouncer at the door and the barman have nothing to with each other!

#

it's 2 seperate problems

slate field Dec 6, 2024, 6:04 AM

#

mighty forge I am using a llvm::ThreadPool to dispatch tasks to process source files for my c...

It sounds like what you want is just a parallel queue

#

You can implement one with semaphores, but also maybe just use an existing implementation

past zephyr Dec 6, 2024, 6:07 AM

#

you can see an example of the simplest possible threadsafe/parallel ringbuffer here: https://gist.github.com/vorlac/8941f2f3549165d9a5e7db26e6d04fc6#file-producer_consumer_example-cpp-L61-L162

#

and an example of a thread pool that can use that same ring buffer to pass tasks (functors in this case) to the thread pool to process as they appear in the buffer: https://gist.github.com/vorlac/f3c1e4641d438e1856990a65269da2ba

#

that last snippet just includes some member functions that don't exist in the ring_buffer from the first list, but all it does is tracks all reader/writer threads working with the queue so it knows when it's safe to exit the loop

#

the only part missing is another thread, which would be the queue "writer(s)" pushing the tasks to the buffer to send them to the threadpool

mighty forge Dec 7, 2024, 4:22 AM

#

I made this thing https://github.com/Journeyman1337/requite_cxx/blob/8eb0e7b603fedfb6607543dcee4e805f51ec66c8/include/requite/scratch.hpp#L12

#

example of usage

#

https://github.com/Journeyman1337/requite_cxx/blob/8eb0e7b603fedfb6607543dcee4e805f51ec66c8/src/run.cpp#L26

#

Is it good? What do you think.

past zephyr Dec 7, 2024, 4:28 AM

#

the references might be an issue. it seems like that's the data that's planned to be shared across threads (which is what you're synchronizing around in _claimDataAtomic() but at a quick glance i didn't see how it would be atomic. typically a safe way to do it is to transfer ownership of data from thread -> queue -> some other thread. holding onto an unsynchronized reference/pointer into the shared data is probably going to be an issue unless i missed something.

#

i should have a bit more time later so i can take a look at it then

mighty forge Dec 7, 2024, 4:29 AM

#

past zephyr i should have a bit more time later so i can take a look at it then

The vector is never resized. The Scratch type uses RAII to give it back automatically

#

it doesn't actually move out of the vector, but no other threads will be accessing the data at that position at the same time

past zephyr Dec 7, 2024, 4:30 AM

#

that doesn't matter as much as another thread potentially touching that data at the same time

#

ok as long as you can guarantee that you probably don't really even need synchronization if each thread will only every be touching data unique to itself where no other threads can access it

mighty forge Dec 7, 2024, 4:31 AM

#

past zephyr ok as long as you can guarantee that you probably don't really even need synchro...

this->_scheduler_ptr is a thread pool

#

the amount of threads in the pool is variable. the amount of tasks is variable. This is a compiler, and at this stage there is one task per source and header file

#

the scratch pool has the same amount of entries as there are threads in the pool, which is what the getMaxConncurrency() call is getting.

past zephyr Dec 7, 2024, 4:33 AM

#

ok yeah i'll take a closer look at it a bit later then to check out what it's doing. i was only able to take a quick glance

mighty forge Dec 7, 2024, 4:33 AM

#

so there never should be more running tasks then the scratch data avaliable in the pool. If more tasks are run then getMaxConcurrency, then the extra tasks should wait in line until one of the threads in the pool finishes a different task.

past zephyr Dec 7, 2024, 4:36 AM

#

you should get in the habit of using managed pointers. there's no reason not to nowadays unless you specifically wanted to work with everything using raw pointers in this project

mighty forge Dec 7, 2024, 4:43 AM

#

past zephyr you should get in the habit of using managed pointers. there's no reason not to ...

I do

#

I could possibly use std::ref I guess

#

Done

#Semaphore Recycler Pattern