Fix lifo queue backend #3172

msimberg · 2018-02-13T16:53:28Z

This addresses the issue in #3166.

The issue in this case was that when shutdown_all waits for all threads to finish it calls suspend(pending) which eventually calls schedule_last for the shutdown_all thread. For the lifo queue backend schedule_last is the same as schedule and shutdown_all will be scheduled as the next thread again, leaving no chance for the remaining threads to run. I suspect we haven't seen this with more than 1 thread because then other OS threads will be able to steal the threads and/or most tests and examples wait for all futures before calling hpx::finalize.

Proposed Changes

Use deque for the non-abp lifo and fifo queue backends so that schedule_last can actually schedule a thread last in the queue.
Add a test which reproduces the issue.
Fix some naming in the documentation related to schedulers.

Note: from my preliminary tests deque is neither slower nor faster than stack or queue, but I need to check with more examples to see that this holds more generally.

Use deque so that schedule_last can actually schedule threads last in the queue.

Both acceptable but queuing more common and used for command line arguments.

hkaiser · 2018-02-13T18:47:22Z

src/runtime/threads/detail/scheduled_thread_pool.cpp

@@ -38,6 +38,9 @@ template class HPX_EXPORT hpx::threads::detail::scheduled_thread_pool<
 template class HPX_EXPORT hpx::threads::detail::scheduled_thread_pool<
    hpx::threads::policies::local_priority_queue_scheduler<hpx::compat::mutex,
        hpx::threads::policies::lockfree_abp_fifo>>;
+template class HPX_EXPORT hpx::threads::detail::scheduled_thread_pool<
+    hpx::threads::policies::local_priority_queue_scheduler<hpx::compat::mutex,
+        hpx::threads::policies::lockfree_abp_lifo>>;
 #endif


If you add this specialization then you probably should add a command line option for it as well.

Yes, you're right, will add that.

OTOH, instead of adding more schedulers to HPX core, we should consider creating a better mechanism (using the thread-pools) allowing for all schedulers (except the default one) to live outside of HPX core. What do you think?

Not thought about it too much, but yes, that would probably be a good thing. How far outside HPX core are you thinking? This probably goes hand in hand with having a nicer way to create thread pools in general.

hkaiser · 2018-02-13T18:50:15Z

Is the underlying assumption for this PR that boost::lockfree:stack is broken?

msimberg · 2018-02-13T19:05:48Z

No, not at all. It's simply that the scheduling loop relies on schedule_last to actually schedule a task last, but with a stack this requirement is broken and is by definition not possible to do.

hkaiser

LGTM, thanks!

biddisco · 2018-02-13T23:35:24Z

Note that #2705 also mentions scheduler cleanup. Some schedulers should be removed and we should consider replacing the default scheduler.

msimberg · 2018-02-15T12:08:12Z

To follow up on the performance, from what I can tell there is no practical difference between using a deque, stack or queue (tested with lots of small tasks with for (...) hpx::async([](){}); and the stencil examples). I think this is safe to merge without causing a performance hit for the default scheduler.

I'm also taking the liberty of quoting @biddisco from irc, saying there is no difference for his particular application:

13:15 <jbjnr> at the moment, abp_lifo
13:15 <heller_> deque is slower since the implementation uses dcas, which is not lockfree
13:15 <heller_> IIRC
13:15 <jbjnr> I vary them from time to time to experiemnt, but don't seem to get much differnceeither way

hkaiser · 2018-02-15T13:16:30Z

Just a side remark - I don't really trust the ABP schedulers (perhaps without reason), so we may not want to use one of those as the default scheduler.

msimberg added 3 commits February 13, 2018 15:38

Add test to check that suspended threads are scheduled last

925789b

Change (non-abp) lifo and fifo queue backends to use deque

28a71fb

Use deque so that schedule_last can actually schedule threads last in the queue.

Change some occurrences of queueing to queuing

bf6771c

Both acceptable but queuing more common and used for command line arguments.

msimberg mentioned this pull request Feb 13, 2018

Application (Octotiger) gets stuck on hpx::finalize when only using one thread #3166

Closed

hkaiser added type: enhancement category: threadmanager labels Feb 13, 2018

hkaiser added this to the 1.1.0 milestone Feb 13, 2018

hkaiser reviewed Feb 13, 2018

View reviewed changes

hkaiser approved these changes Feb 13, 2018

View reviewed changes

msimberg added 2 commits February 14, 2018 09:00

Add missing includes to schedule_last test

dc26c6a

Add abp-priority-lifo as command line argument for scheduler

707440a

hkaiser merged commit 6a2923e into STEllAR-GROUP:master Feb 17, 2018

G-071 mentioned this pull request Feb 26, 2018

Octotiger gets stuck on hpx::finalize when only using one thread STEllAR-GROUP/octotiger#61

Closed

msimberg mentioned this pull request Jul 21, 2019

Attempt to solve issue where -latomic does not support 128bit atomics #3996

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix lifo queue backend #3172

Fix lifo queue backend #3172

msimberg commented Feb 13, 2018

hkaiser Feb 13, 2018 •

edited

msimberg Feb 13, 2018

hkaiser Feb 13, 2018

msimberg Feb 13, 2018

hkaiser commented Feb 13, 2018

msimberg commented Feb 13, 2018

hkaiser left a comment

biddisco commented Feb 13, 2018

msimberg commented Feb 15, 2018

hkaiser commented Feb 15, 2018

Fix lifo queue backend #3172

Fix lifo queue backend #3172

Conversation

msimberg commented Feb 13, 2018

Proposed Changes

hkaiser Feb 13, 2018 • edited

Choose a reason for hiding this comment

msimberg Feb 13, 2018

Choose a reason for hiding this comment

hkaiser Feb 13, 2018

Choose a reason for hiding this comment

msimberg Feb 13, 2018

Choose a reason for hiding this comment

hkaiser commented Feb 13, 2018

msimberg commented Feb 13, 2018

hkaiser left a comment

Choose a reason for hiding this comment

biddisco commented Feb 13, 2018

msimberg commented Feb 15, 2018

hkaiser commented Feb 15, 2018

hkaiser Feb 13, 2018 •

edited