Reimplement NamedTuble#fetch(String) using a Trie #3174

lbguilherme · 2016-08-19T12:23:14Z

Instead of comparing each string key one by one, build a Prefix Tree at compile-time and make a single comparison per char. This improves performance by 16x on large named tuples and about 1.5x on smaller ones.

Large benchmark: https://gist.github.com/lbguilherme/c7249c408d2a226bdbb6892dda5e9ab2
Small benchmark: https://gist.github.com/lbguilherme/7dba63104c4b44d3adcd5bcc9f9d7c20

See #3143 and #2966

Instead of comparing each string key one by one, build a Prefix Tree at compile-time and make a single comparison per char. This improves performance by 16x on large named tuples and about 1.5x on smaller ones.

bcardiff · 2016-08-19T13:59:50Z

I like this 👍 . Since you have a branch in the logic for more than 16 keys of the same size I would say we need some specs to cover that. Just to be sure it does not break in the future.

Let's see if some else agree with this before.

asterite · 2016-08-19T15:01:30Z

I'm not sure about this, a named tuple is not to be used as a hash. They represent named arguments, to a method, and methods usually don't have many arguments. I'd rather have a short and simple implementation, even if a bit slower, than a complex, macro heavy one.

I'd need to find a real use case where big named tuples are used, and indexed with an arbitrary key.

straight-shoota · 2017-06-28T17:55:54Z

If you have such a big datastructure, I don't see a point in using a NamedTuple as well.
A Hash has about the same lookup speed anyway, in this example it's even faster. But I had to cut down the number of entries in the NamedTuple since the compiler maxes at 300.

check every key   2.99  (334.04ms) (±24.93%) 341.02× slower
    trie lookup  34.87  ( 28.68ms) (± 9.12%)  29.28× slower
           hash   1.02k (979.55µs) (±11.60%)        fastest

https://gist.github.com/straight-shoota/cb903575c6dd86d1c6c86c372d842666

akzhan · 2017-06-28T18:26:43Z

NamedTuple looks like returning value pattern, and eats less memory afair.

No more.

asterite · 2017-09-29T12:19:08Z

Closing because too complex, and NamedTuple shouldn't normally be used.

Reimplement NamedTuble#fetch(String) using a Trie

6a6b335

Instead of comparing each string key one by one, build a Prefix Tree at compile-time and make a single comparison per char. This improves performance by 16x on large named tuples and about 1.5x on smaller ones.

asterite closed this Sep 29, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reimplement NamedTuble#fetch(String) using a Trie #3174

Reimplement NamedTuble#fetch(String) using a Trie #3174

lbguilherme commented Aug 19, 2016 •

edited

Loading

bcardiff commented Aug 19, 2016 •

edited

Loading

asterite commented Aug 19, 2016

straight-shoota commented Jun 28, 2017

akzhan commented Jun 28, 2017

asterite commented Sep 29, 2017

Reimplement NamedTuble#fetch(String) using a Trie #3174

Reimplement NamedTuble#fetch(String) using a Trie #3174

Conversation

lbguilherme commented Aug 19, 2016 • edited Loading

bcardiff commented Aug 19, 2016 • edited Loading

asterite commented Aug 19, 2016

straight-shoota commented Jun 28, 2017

akzhan commented Jun 28, 2017

asterite commented Sep 29, 2017

lbguilherme commented Aug 19, 2016 •

edited

Loading

bcardiff commented Aug 19, 2016 •

edited

Loading