[Truffle] New implementation of non-small hashes #2328

chrisseaton · 2014-12-16T06:11:36Z

We've got two new things here. First of all we've got a proper implementation of hash where there are more than 3 key-value pairs. We have to have a custom data structure because we need to call Ruby methods for hash and eql? with somewhere to store the state for the call site. We can't re-use JRuby's, for the usual reasons, and using the Truffle-style of storage strategies we don't want encapsulation - we want the logic to be in the nodes.

Secondly I've also moved some of the logic of the hash into a node - the basic operation of finding the right bucket. That allows us to store the state and possibly do some branch profiling and value profiling but still provide a nice method call interface.

I've added a test to stress hash implementations.

We aren't doing any rebalancing for overloaded indices yet - we never grow the number of slots.

eregon · 2014-12-16T10:33:24Z

core/src/main/java/org/jruby/truffle/nodes/core/ArrayNodes.java

+
+        @Specialization
+        public RubyArray uniq(RubyArray array) {
+            notDesignedForCompilation();


MRI uses a temporary Hash to avoid O(n²) here.

So maybe we could have an overload of notDesignedForCompilation() mapping to CompilerAsserts.neverPartOfCompilation(String message) so to keep track of what is not compilation ready when not obvious.

Good idea - we can go through and add a reason for all the notDesigneds when we do a spring clean after 0.6 is released.

eregon · 2014-12-16T11:10:25Z

Looks good globally.
Of course HashOperations should disappear, except maybe for some debug stuff. As well as DebugOps.send.

I am slightly uncomfortable with the naming of Bucket for what is a "hashtable entry".
But I understand the need to differentiate from a usual Map.Entry-like entry.
But maybe we don't want such Map.Entry-like entry? Walking directly on what is called Buckets here sounds good (might use an iterator), except maybe for data race issues.
In my intuition, a bucket is usually an element in the storage array, the array of buckets/slots. We likely don't have such a concept as an object in a practical implementation, as it's just a linked chain of hashtable entries.

eregon · 2014-12-16T11:20:29Z

core/src/main/java/org/jruby/truffle/nodes/core/HashGuards.java

-    public static boolean isOtherObjectArray(RubyHash hash, RubyHash other) {
-        return other.getStore() instanceof Object[];
+        // Arrays are covariant in Java!
+        return hash.getStore() instanceof Object[] && !(hash.getStore() instanceof Bucket[]);


So this means there is no good way to check that the ObjectArray strategy is actually only using just a Object[] with instanceof?
But getClass() should do it then, so the assertions in RubyHash constructor should be adapted?
Wondering if 2 instanceof is also better than 1 getClass().

nirvdrum · 2014-12-16T14:00:16Z

This needs to merge in the change from 7a4ab54.

nirvdrum · 2014-12-16T14:51:42Z

core/src/main/java/org/jruby/truffle/nodes/core/HashNodes.java


-            for (int n = 0; n < RubyHash.HASHES_SMALL; n++) {
+            for (int n = 0; n < HashOperations.SMALL_HASH_SIZE; n++) {
                if (n < size && eqlNode.call(frame, store[n * 2], "eql?", null, key)) {
                    return store[n * 2 + 1];


This code was already here, but maybe it'd be easier to follow if KEY_OFFSET and VALUE_OFFSET constants were used.

Good idea - I'll do that on the master branch later.

nirvdrum · 2014-12-16T15:15:01Z

core/src/main/java/org/jruby/truffle/nodes/literal/HashLiteralNode.java


-import java.util.LinkedHashMap;
+import java.util.*;


Minor, but JRuby core prefers these be expanded.

nirvdrum · 2014-12-16T15:16:02Z

I'll have to check out the branch to navigate the code since I'm finding it too annoying in GItHub's web UI. But at first blush this looks pretty good.

chrisseaton · 2014-12-16T15:22:57Z

I renamed the buckets to entries and removed the backwards link in the bucket chain.

…by covariance, just the negation of the other two.

[Truffle] New implementation of non-small hashes

chrisseaton added 7 commits December 15, 2014 12:19

[Truffle] Basic new implementation of large hashes.

9810a50

[Truffle] Fix default block in buckets hash.

0eee052

[Truffle] Pull out some hash classes to the top level.

3e82545

[Truffle] Tidy up RubyHash.

579881f

[Truffle] Make finding a bucket a node.

4b6b34a

[Truffle] Some documentation of hash.

3c5c26d

[Truffle] Fix a couple of hash bugs.

18593e6

eregon reviewed Dec 16, 2014
View reviewed changes

nirvdrum reviewed Dec 16, 2014
View reviewed changes

[Truffle] Make the bucket chain singly linked.

35c1350

nirvdrum reviewed Dec 16, 2014
View reviewed changes

[Truffle] Rename buckets entries.

96c8da6

chrisseaton added 3 commits December 16, 2014 15:44

[Truffle] We don't need HashSearchResult as a DSL type at all.

b2a89e6

[Truffle] Change hash guard terminology to PackedArray or Buckets

639ecf6

[Truffle] Make the packed array guard for hash, which is complicated …

2bb0e15

…by covariance, just the negation of the other two.

chrisseaton added a commit that referenced this pull request Dec 17, 2014

Merge pull request #2328 from jruby/truffle-hash

8c9f381

[Truffle] New implementation of non-small hashes

chrisseaton merged commit 8c9f381 into master Dec 17, 2014

chrisseaton deleted the truffle-hash branch December 17, 2014 22:08

enebo added this to the JRuby 9.0.0.0 milestone Dec 22, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Truffle] New implementation of non-small hashes #2328

[Truffle] New implementation of non-small hashes #2328

chrisseaton commented Dec 16, 2014

eregon Dec 16, 2014

eregon Dec 16, 2014

chrisseaton Dec 16, 2014

eregon commented Dec 16, 2014

eregon Dec 16, 2014

nirvdrum commented Dec 16, 2014

nirvdrum Dec 16, 2014

chrisseaton Dec 16, 2014

nirvdrum Dec 16, 2014

nirvdrum commented Dec 16, 2014

chrisseaton commented Dec 16, 2014


		import java.util.LinkedHashMap;
		import java.util.*;

[Truffle] New implementation of non-small hashes #2328

[Truffle] New implementation of non-small hashes #2328

Conversation

chrisseaton commented Dec 16, 2014

eregon Dec 16, 2014

Choose a reason for hiding this comment

eregon Dec 16, 2014

Choose a reason for hiding this comment

chrisseaton Dec 16, 2014

Choose a reason for hiding this comment

eregon commented Dec 16, 2014

eregon Dec 16, 2014

Choose a reason for hiding this comment

nirvdrum commented Dec 16, 2014

nirvdrum Dec 16, 2014

Choose a reason for hiding this comment

chrisseaton Dec 16, 2014

Choose a reason for hiding this comment

nirvdrum Dec 16, 2014

Choose a reason for hiding this comment

nirvdrum commented Dec 16, 2014

chrisseaton commented Dec 16, 2014