Suspend speedup #3137

msimberg · 2018-02-02T12:03:34Z

This speeds up #3129.

Proposed Changes

Adds two member functions, suspend_direct and resume_direct to thread_pool_base which don't spawn a new thread for suspending/resuming but instead call suspend/resume_internal directly.
Use suspend_direct and resume_direct in threadmanager::suspend/resume to speed up suspend/resume.
First notify all threads that they should suspend/resume before doing the blocking calls to suspend/resume_processing_unit_internal which wait for the threads to finish suspending/resuming.

I will post some more detailed benchmarks later, but rough performance numbers on dual-socket 18-core Xeon machine are (depends of course on the number of threads used):

HPX start/stop: 50 ms
HPX suspend/resume: 500 µs
OpenMP enter and exit a #pragma parallel for region: 100 µs

…completely

…ed completely

…ot spawn a thread)

…r::suspend/resume Suspending/resuming directly without spawning a new thread is significantly faster than using the callback versions and waiting for the callbacks to signal completion.

…ad_pool

hkaiser · 2018-02-02T12:55:59Z

Nice! Thanks!

hkaiser · 2018-02-02T13:20:21Z

src/runtime_impl.cpp

+    int runtime_impl::suspend()
+    {
+        std::uint32_t initial_num_localities = get_initial_num_localities();
+        if (initial_num_localities > 1)


What is the rationale of this limitation? Wouldn't it be perfectly ok to suspend the runtime on one of the nodes at a time?

Simply that I haven't tested it in those situations, and I did not think it would be useful. But you're right, it should be okay in those cases as well. Do you see a use case for it?

Providing a migration path for distributed MPI+X applications.

This does not have to be part of this PR, however.

Ok, I'd like to keep it separate in that case so that it can go in with appropriate tests/examples showing how it would be used in that case (and to see if there are any corner cases to be aware of).

hkaiser · 2018-02-02T13:23:48Z

tests/unit/resource/suspend_runtime.cpp

+    hpx::util::detail::yield_while([rt]()
+        {
+            return rt->get_state() < hpx::state_running;
+        }, "");


This tells me that a) yield_while shouldn't be in detail namespace and b) that its second argument should have a default value (or it has to be exposed through a different API).

Fair point. Would it be okay if I move only yield_while into hpx/util/yield_while.hpp and leave yield_k where it is? I see yield_k as being a lower level helper function.

Sounds good to me. Could be done in a separate PR, though.

Ok, good. I'll do this in a separate PR as soon as this is merged.

hkaiser

LGTM, thanks!

msimberg · 2018-02-15T10:04:46Z

For the record, this is what the timings look like. This is with the balanced thread binding, so one can see bumps at 19 and 37 threads, going to another numa domain and to hyperthreading, respectively.

The openmp benchmark is a #pragma parallel for doing a simple addition to avoid it being optimized away. For fairness HPX spawns a number of empty tasks equal to the number of threads before suspending. Plain resume/suspend without any tasks running is at least a factor of two faster than in the graph.

There is most likely some room to improve further but this is already good enough for blocks of work of at least 0.1 s.

Linear y-axis:

Linear y-axis without HPX start/stop:

hkaiser · 2018-02-15T13:12:51Z

@msimberg very nice results! Would you mind adding the benchmark itself to the tests/performance/local folder in the repo?

msimberg · 2018-02-15T13:17:45Z

Don't mind at all, will add them soon.

msimberg added 7 commits February 2, 2018 09:48

Notify all threads to resume before waiting for them to have resumed …

165351b

…completely

Notify all threads to suspend before waiting for them to have suspend…

14fdfae

…ed completely

Add direct thread pool suspend/resume functions (i.e. ones which do n…

2644cc9

…ot spawn a thread)

Use suspend/resume_direct instead of callback version in threadmanage…

42db3be

…r::suspend/resume Suspending/resuming directly without spawning a new thread is significantly faster than using the callback versions and waiting for the callbacks to signal completion.

Add empty implementations of suspend/resume_direct to io_service_thre…

358867e

…ad_pool

Correct some assertion messages for thread pools

d9f9a29

Fix long line in scheduled_thread_pool_impl.hpp

045ebc1

msimberg added type: optimization type: enhancement category: threadmanager affecting: CSCS labels Feb 2, 2018

Merge branch 'master' into suspend-speedup

e6288aa

hkaiser added this to the 1.1.0 milestone Feb 2, 2018

hkaiser reviewed Feb 2, 2018

View reviewed changes

hkaiser approved these changes Feb 2, 2018

View reviewed changes

Merge remote-tracking branch 'origin/master' into suspend-speedup

89fdc2a

msimberg merged commit 4fb4a3d into STEllAR-GROUP:master Feb 7, 2018

msimberg mentioned this pull request Feb 20, 2018

Add runtime start/stop, resume/suspend and OpenMP benchmarks #3183

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Suspend speedup #3137

Suspend speedup #3137

msimberg commented Feb 2, 2018

hkaiser commented Feb 2, 2018

hkaiser Feb 2, 2018

msimberg Feb 2, 2018

hkaiser Feb 2, 2018

hkaiser Feb 2, 2018

msimberg Feb 2, 2018

hkaiser Feb 2, 2018

msimberg Feb 2, 2018

hkaiser Feb 2, 2018

msimberg Feb 2, 2018

hkaiser left a comment

msimberg commented Feb 15, 2018 •

edited

hkaiser commented Feb 15, 2018

msimberg commented Feb 15, 2018

Suspend speedup #3137

Suspend speedup #3137

Conversation

msimberg commented Feb 2, 2018

Proposed Changes

hkaiser commented Feb 2, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hkaiser left a comment

Choose a reason for hiding this comment

msimberg commented Feb 15, 2018 • edited

hkaiser commented Feb 15, 2018

msimberg commented Feb 15, 2018

msimberg commented Feb 15, 2018 •

edited