pythonPackages.tensorflow: add flags for efficent math on CPU #30786

jyp · 2017-10-25T08:58:41Z

Motivation for this change

Make tensorflow more efficient on CPU

Things done

Tested using sandboxing (nix.useSandbox on NixOS, or option build-use-sandbox in nix.conf on non-NixOS)
Built on platform(s)
- NixOS
- macOS
- other Linux distributions
Tested via one or more NixOS test(s) if existing and applicable for the change (look inside nixos/tests)
Tested compilation of all pkgs that depend on this change using nix-shell -p nox --run "nox-review wip"
Tested execution of all binary files (usually in ./result/bin/)
Fits CONTRIBUTING.md.

Mic92 · 2017-10-25T20:10:16Z

pkgs/development/python-modules/tensorflow/default.nix

@@ -7,6 +7,9 @@
 , cudaSupport ? false, nvidia_x11 ? null, cudatoolkit ? null, cudnn ? null
 # Default from ./configure script
 , cudaCapabilities ? [ "3.5" "5.2" ]
+, sse42Support ? false
+, avx2Support ? false
+, fmaSupport ? false


What happens if this is enabled, but not supported by the hardware? Would it still works?

jyp · 2017-10-26T05:52:54Z

Most likely it will crash with 'bus error' or somesuch. Cheers, JP.

…

On Wed, Oct 25, 2017 at 10:10 PM, Jörg Thalheim ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In pkgs/development/python-modules/tensorflow/default.nix <#30786 (comment)>: > @@ -7,6 +7,9 @@ , cudaSupport ? false, nvidia_x11 ? null, cudatoolkit ? null, cudnn ? null # Default from ./configure script , cudaCapabilities ? [ "3.5" "5.2" ] +, sse42Support ? false +, avx2Support ? false +, fmaSupport ? false What happens if this is enabled, but not supported by the hardware? Would it still works? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#30786 (review)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AABsYziGAzToQkugrpvoHJHRmkPMtb_Uks5sv5WrgaJpZM4QFqAn> .

abbradar · 2017-11-04T10:18:05Z

@Mic92 No, it wouldn't -- TensorFlow doesn't support dynamic code paths depending on runtime architecture.

Thanks!

pythonPackages.tensorflow: add flags for efficent math on CPU

824cfbb

FRidh approved these changes Oct 25, 2017

View reviewed changes

Mic92 reviewed Oct 25, 2017

View reviewed changes

abbradar merged commit 6269306 into NixOS:master Nov 4, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pythonPackages.tensorflow: add flags for efficent math on CPU #30786

pythonPackages.tensorflow: add flags for efficent math on CPU #30786

jyp commented Oct 25, 2017

Mic92 Oct 25, 2017

jyp commented Oct 26, 2017 via email

abbradar commented Nov 4, 2017

pythonPackages.tensorflow: add flags for efficent math on CPU #30786

pythonPackages.tensorflow: add flags for efficent math on CPU #30786

Conversation

jyp commented Oct 25, 2017

Motivation for this change

Things done

Mic92 Oct 25, 2017

Choose a reason for hiding this comment

jyp commented Oct 26, 2017 via email

abbradar commented Nov 4, 2017