Speeding up accessing the resource partitioner and the topology info #3015

hkaiser · 2017-11-19T02:15:55Z

this reduces the overheads in the scheduler significantly

sithhell · 2017-11-19T07:17:41Z

src/runtime/threads/detail/thread_pool_base.cpp

@@ -47,7 +47,7 @@ namespace hpx { namespace threads { namespace detail
    ///////////////////////////////////////////////////////////////////////////
    mask_cref_type thread_pool_base::get_used_processing_units() const
    {
-        std::lock_guard<pu_mutex_type> l(used_processing_units_mtx_);
+//         std::lock_guard<pu_mutex_type> l(used_processing_units_mtx_);


That lock is needed for concurrent enabling/disabling of PUs

This function is called constantly from a hot loop from inside the scheduler. If the lock is really needed we have to find a way not to use this function as it is accounting for about 50% of the speedup I was seeing (%5 of the overall runtime).

Back when I implemented it at first, I measured the same. The real question however is if we support that use case or not. Leaving out the lock might speed up things, but also eliminates a use case.

Well the obvious thing to do is have differnet flags set for the schedulers at creation time. If the user knows that they are not hot swapping PUs from pools, then this functionality is not required, we can add it to the background_work and suchlike flags used at scheduler creation time.

we should add a 'dynamic_pool' flag (or similar) to the pool or scheduler when it is created and if the user attempts to change it at runtime it can throw if necessary...

Alternatively, we could cache the mask inside the scheduler and update it on a 'as needed basis', i.e. only whenever PUs have been enabled/disabled.

biddisco

Good work. Thanks for spotting this

- this reduces the overheads in the scheduler significantly

biddisco · 2017-11-19T11:33:45Z

I wanted to test this, so I fixed the commented out mutex and amended the commit

hkaiser · 2017-11-21T01:28:47Z

@biddisco, @sithhell: the current code protecting the mask is broken anyways as the function returns a reference, see here: https://github.com/STEllAR-GROUP/hpx/blob/master/src/runtime/threads/detail/thread_pool_base.cpp#L48-L52.

The only option I see now is to implement what @biddisco suggested, namely to introduce a flag for the pools enabling dynamic behavior on demand (besides fixing the way the variable is protected). Thoughts?

hkaiser · 2017-11-21T01:32:21Z

@sithhell can we go ahead with applying this patch now?

sithhell

LGTM

biddisco · 2017-11-21T08:14:56Z

I suspect that this was merged using my version, that had the mutex put back - this goes against what @hkaiser wanted I believe.

hkaiser · 2017-11-21T12:43:18Z

@biddisco I think we can merge this as the mutex problem was there before and the patch improves performance as is. We need to solve it, though as the current code is both, slow and broken.

hkaiser added category: threadmanager type: enhancement labels Nov 19, 2017

hkaiser added this to the 1.1.0 milestone Nov 19, 2017

hkaiser requested review from sithhell and biddisco November 19, 2017 02:16

sithhell requested changes Nov 19, 2017

View reviewed changes

biddisco reviewed Nov 19, 2017

View reviewed changes

Speeding up accessing the resource partitioner and the topology info

7afcce6

- this reduces the overheads in the scheduler significantly

biddisco force-pushed the performance_optimizations branch from d0e95d2 to 7afcce6 Compare November 19, 2017 11:33

sithhell approved these changes Nov 21, 2017

View reviewed changes

sithhell merged commit d421371 into master Nov 21, 2017

sithhell deleted the performance_optimizations branch November 21, 2017 06:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speeding up accessing the resource partitioner and the topology info #3015

Speeding up accessing the resource partitioner and the topology info #3015

hkaiser commented Nov 19, 2017

sithhell Nov 19, 2017

hkaiser Nov 19, 2017

sithhell Nov 19, 2017

biddisco Nov 20, 2017

biddisco Nov 20, 2017

hkaiser Nov 20, 2017

biddisco left a comment

biddisco commented Nov 19, 2017

hkaiser commented Nov 21, 2017

hkaiser commented Nov 21, 2017

sithhell left a comment

biddisco commented Nov 21, 2017

hkaiser commented Nov 21, 2017

Speeding up accessing the resource partitioner and the topology info #3015

Speeding up accessing the resource partitioner and the topology info #3015

Conversation

hkaiser commented Nov 19, 2017

sithhell Nov 19, 2017

Choose a reason for hiding this comment

hkaiser Nov 19, 2017

Choose a reason for hiding this comment

sithhell Nov 19, 2017

Choose a reason for hiding this comment

biddisco Nov 20, 2017

Choose a reason for hiding this comment

biddisco Nov 20, 2017

Choose a reason for hiding this comment

hkaiser Nov 20, 2017

Choose a reason for hiding this comment

biddisco left a comment

Choose a reason for hiding this comment

biddisco commented Nov 19, 2017

hkaiser commented Nov 21, 2017

hkaiser commented Nov 21, 2017

sithhell left a comment

Choose a reason for hiding this comment

biddisco commented Nov 21, 2017

hkaiser commented Nov 21, 2017