First steps towards implementing execution par_unseq #3063

hkaiser · 2017-12-08T20:28:40Z

This is related to #2271

sithhell · 2017-12-09T20:44:02Z

hpx/parallel/util/loop.hpp

+#elif defined(HPX_INTEL_VERSION) || defined(HPX_CLANG_VERSION)
+#pragma ivdep
+#pragma omp simd
+#endif


I'd like to leave omp out of the picture here since we'd require users to compile with -fopenmp then.

What about other compilers, should we issue a warning?

sithhell · 2017-12-09T20:49:08Z

Would be nice to have some confidence if it really has the expected effects (better code gen) before pushing it further.

hkaiser · 2017-12-09T21:30:50Z

Would be nice to have some confidence if it really has the expected effects (better code gen) before pushing it further.

I agree, but how can we get confidence without actually trying out things?

sithhell · 2017-12-10T05:58:31Z

> Would be nice to have some confidence if it really has the expected effects (better code gen) before pushing it further. I agree, but how can we get confidence without actually trying out things?

Trying it out doesn't mean it has be merged ;)

jeffhammond · 2018-01-21T19:41:28Z

Summary

You can't use OpenMP simd (or for) with a != comparison for loop termination, i.e. the following is invalid (see below), at least as of OpenMP 4.5.

+#if defined(HPX_HAVE_OPENMP_SIMD)
 +#pragma omp simd
 +#endif
 +                for (/**/; it != end; ++it)
 +                {
 +                    f(it);
 +                }

OpenMP 5 is expected to fix this. I don't know the precise status of that proposal but can look if you want.

I recall that Intel 18 compilers do not object to != in an OpenMP for loop (haven't checked SIMD explicitly) but GCC 7 and Clang 5 do.

I would not expect a compiler to attempt to vectorize container accesses if a random access iterator is not supported...

Details

The relevant portions of the OpenMP 4.5 specification are:

Section 2.8.1

The simd directive places restrictions on the structure of the associated for-loops. Specifically, all associated for-loops must have canonical loop form (Section 2.6 on page 53).

Section 2.6

A loop has canonical loop form if it conforms to the following:
for (init-expr; test-expr; incr-expr) structured-block
...
test-expr One of the following:
var relational-op b
b relational-op var
...
relational-op One of the following:
<
<=
>
>=

hkaiser · 2018-01-21T19:50:02Z

@jeffhammond thanks for this information. This branch is really meant to be used for experimentation towards implementing par_unseq, and I admit that I don't have too much experience with this.

jeffhammond · 2018-01-21T19:50:06Z

CMakeLists.txt

+    find_package(OpenMP QUIET)
+    if(OPENMP_FOUND)
+      if("${CMAKE_CXX_COMPILER_ID}" STREQUAL "Intel")
+        hpx_add_compile_flag_if_available(-openmp-simd)


-openmp-simd is deprecated (https://software.intel.com/en-us/node/693432). It was replaced by -qopenmp-simd in version 16 or 17.

I recommend that CMake test for -fopenmp-simd, then -qopenmp-simd then -openmp-simd. ICC supports many of the GCC flags for compatibility...

Ok, thanks.

jeffhammond · 2018-01-21T19:57:08Z

hpx/parallel/util/prefetching.hpp

@@ -253,9 +253,9 @@ namespace hpx { namespace parallel { namespace util
        HPX_FORCEINLINE void prefetch_addresses(T const& ... ts)
        {
            int const sequencer[] = {
-                (_mm_prefetch(
+                0, (_mm_prefetch(


For the non-x86 code path, you may want to use __builtin_prefetch, which both GCC and Clang support.

We can do that, MSVC however supports _mm_prefetch only.

jeffhammond · 2018-01-24T00:17:50Z

@hkaiser These are important experiments, in part because the OpenMP standard working group recognizes that support for C++ in OpenMP is lacking. While 5.0 is pretty close to finished, your experience in implementing PSTL will be useful in identifying gaps that should be addressed in the next iteration.

I don't know what code you allow yourself to look at (due to licensing), but both RAJA and Intel PSTL may be useful. In particular, RAJA supports a bunch of different pragmas for persuading compilers to vectorize inner loops.

msimberg · 2019-03-20T14:44:22Z

Closing this since it's not actively being worked on, but if someone feels like picking this up again, feel free to do so!

hkaiser added category: algorithms type: enhancement labels Dec 8, 2017

hkaiser added this to the 1.1.0 milestone Dec 8, 2017

hkaiser force-pushed the par_unseq branch from 6e09c6e to 7bae646 Compare December 9, 2017 15:09

sithhell reviewed Dec 9, 2017

View reviewed changes

hkaiser force-pushed the par_unseq branch 2 times, most recently from 85ae248 to 5409978 Compare December 15, 2017 15:29

First steps towards implementing execution par_unseq

bd16426

hkaiser force-pushed the par_unseq branch from 5409978 to bd16426 Compare December 15, 2017 17:36

jeffhammond reviewed Jan 21, 2018

View reviewed changes

msimberg removed this from the 1.1.0 milestone Mar 22, 2018

msimberg closed this Mar 20, 2019

msimberg added tag: up for grabs tag: help needed labels Mar 20, 2019

hkaiser deleted the par_unseq branch April 10, 2019 10:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

First steps towards implementing execution par_unseq #3063

First steps towards implementing execution par_unseq #3063

hkaiser commented Dec 8, 2017

sithhell Dec 9, 2017

sithhell Dec 9, 2017

sithhell commented Dec 9, 2017

hkaiser commented Dec 9, 2017

sithhell commented Dec 10, 2017 via email •

edited by hkaiser

jeffhammond commented Jan 21, 2018

hkaiser commented Jan 21, 2018

jeffhammond Jan 21, 2018

hkaiser Jan 21, 2018

jeffhammond Jan 21, 2018

hkaiser Jan 21, 2018

jeffhammond commented Jan 24, 2018

msimberg commented Mar 20, 2019

First steps towards implementing execution par_unseq #3063

First steps towards implementing execution par_unseq #3063

Conversation

hkaiser commented Dec 8, 2017

sithhell Dec 9, 2017

Choose a reason for hiding this comment

sithhell Dec 9, 2017

Choose a reason for hiding this comment

sithhell commented Dec 9, 2017

hkaiser commented Dec 9, 2017

sithhell commented Dec 10, 2017 via email • edited by hkaiser

jeffhammond commented Jan 21, 2018

Summary

Details

hkaiser commented Jan 21, 2018

jeffhammond Jan 21, 2018

Choose a reason for hiding this comment

hkaiser Jan 21, 2018

Choose a reason for hiding this comment

jeffhammond Jan 21, 2018

Choose a reason for hiding this comment

hkaiser Jan 21, 2018

Choose a reason for hiding this comment

jeffhammond commented Jan 24, 2018

msimberg commented Mar 20, 2019

sithhell commented Dec 10, 2017 via email •

edited by hkaiser