Add a macro to `collect` outputs of a yielding method to an array #4813

oprypin · 2017-08-09T00:30:50Z

Just having some fun reducing code duplication.

Obviously this can be improved and documented, I'm just curious about initial feedback.

Papierkorb · 2017-08-09T00:36:10Z

I like the idea, but I don't think this should be a global macro, also as #collect is a somewhat common name, which could end up confusing users. Maybe Array would be a good place, considering it "returns" an array?

Array.collect each_byte looks clearer to me than a mere collect each_byte

oprypin · 2017-08-09T00:39:44Z

Yeah I had a thought like that but assumed it wouldn't work (that macros can be only used unqualified or something)

jhass · 2017-08-09T06:21:44Z

Ideally this would be something like Iterator#to_a I think, that is we would have an iterator for everything and can just call to_a on it. Meanwhile I'm not sure I'm a big fan of the collect name, maybe just Array.from or Array.of or so.

oprypin · 2017-08-09T07:05:27Z

Well sure, having an iterator for every yielding method would be amazing, but that topic has been neglected, and manually implementing those iterators is no fun.

Disagree on your naming ideas.

RX14 · 2017-08-09T09:12:40Z

src/macros.cr

+
+class Array(T)
+  macro collect(expr)
+    arr = [] of typeof(begin


Why no macro variables for the array? We don't want to leak the arr name.

Right...
The rest of this is such an eye sore that I didn't realize.

straight-shoota · 2017-08-09T11:01:27Z

src/macros.cr

@@ -83,6 +83,25 @@ macro record(name, *properties)
  end
 end

+private record TypeSentinel


private record does not work, better use struct directly.

You are right

Sucks that it doesn't work and doesn't error

src/macros.cr

konovod · 2017-08-09T11:18:48Z

Method is great, and Array.collect looks like a right name for it, but it will surely confuse people coming from Ruby (where collect is a synonym to map).

straight-shoota · 2017-08-09T11:19:17Z

I'd suggest to add an option to define the generic type of the array directly instead of inferring it as a union of the yielded types:

macro collect(type, expr)
    %arr = [] of {{type}}
    {{expr}} do |x|
      %arr << x
    end
    %arr
  end

This makes it more clear what array type is returned and allows fine grained control over it. Sometimes you don't want an array of unioned subtypes (or "plus type") but rather explicitly an array of a parent type.
I'd even argue that this might be preferred as default variant (or even replace the current one altogether). Inferring is nice but stating the type directly is more expressive. This ensures the array type is independent of the yielded block.

oprypin · 2017-08-09T11:27:36Z

My previous implementation of this idea had the name "gather" but that's not what it means. It also was more opinionated and wrapped the function instead, so less flexibility, especially for docs. But I can't think of a way to specify the type explicitly like I did there, in a way that doesn't look goofy.

oprypin · 2017-08-09T11:29:32Z

Wonder if it would be possible to get the type from Array(T).collect(expr)

EDIT: Nope

Papierkorb · 2017-08-09T11:35:21Z

@konovod I think the name collect is perfectly fine and descriptive. And even in Ruby, one never writes Array.collect, so it shouldn't be confusing either.

@straight-shoota No idea honestly how common it would be that people would want to use a different type. Still, that explicit-type version macro is much nicer to read, and seeing what kind of Array will be returned is also nice. So 👍 from me.

ysbaddaden · 2017-08-09T15:49:40Z

I'm not sure I can like this. Creating an empty array, iterating and pushing values to it is boring and repetitive, but it's descriptive as to what's happening. This solution may look interesting but it's complex to understand and confusing.

I prefer alternative methods that return an interator that implements #to_a, just like Array does:

[1, 2, 3].cycle(2).to_a
[1, 2, 3, 4, 5].each_cons(3).to_a

oprypin · 2017-08-09T15:56:34Z

The dream solution is automatically implementing Iterators based on yielding methods.
Without that, to make this a reality, one needs to write an Iterator (a ton of boilerplate) for every yielding method.

straight-shoota · 2017-08-09T17:49:37Z

Maybe it'd be easier to understand if you'd have to create the array manually but have a shortcut for adding yielded elements: Array(Int32).new.collect { each } where #collect would call the method in the block, adding all yielded values to self, and return self.
But I don't know if this could be implemented somehow.

Fryguy · 2017-08-13T01:37:44Z

I agree with @ysbaddaden. We already have Iterator#to_a , not sure why we need another style...unless this is providing something different?

Personally, I'm not a fan of top-level (or nearly top-level) functions over methods on objects... reminds me of Python's built-in functions, which I don't really like (len(ary) as opposed to ary.len)

Fryguy · 2017-08-13T01:48:48Z

spec/std/string_spec.cr

@@ -1752,8 +1752,7 @@ describe "String" do
    end

    it "works with strings with block" do
-      res = [] of String
-      "bla bla ablf".scan("bl") { |s| res << s }
+      res = Array.collect "bla bla ablf".scan("bl")


This particular test's purpose is to test scan with a block, so I don't think this one should change (yeah, it's using a block via the collect macro, but that obscures the test itself)

In this case it is better to be more expressive to understand the purpose.

RX14 · 2017-08-13T08:25:17Z

The reason is that not every method with a block has a corresponding iterator. And even if they do, not all those iterators are the same speed as their block-based counterparts.

straight-shoota · 2017-08-13T14:04:05Z

Collecting with blocks should always be faster than Iterator#to_a because there is no need to initialize an Iterator instance.

Fryguy · 2017-08-14T16:38:56Z

The reason is that not every method with a block has a corresponding iterator.

Ah ok, I think what threw me is that nearly every change in this PR is on methods that already return an iterator (cycle, glob, each_slice, etc). The only one in these changes that doesn't return an iterator is the downcase method.

akzhan · 2017-08-14T19:23:10Z

it's failed on Travis CI and has merge conflicts.

bew · 2017-09-26T14:36:42Z

Quoting @oprypin:

Wonder if it would be possible to get the type from Array(T).collect(expr)

EDIT: Nope

Maybe related to #5023

asterite · 2017-09-26T14:37:54Z

Even though I like this, and I'm glad it's possible to do, I'm not sure this should go into the std. I'd like to have general iterators related to yield but it's super hard. Maybe collecting with x = [] of T; x << elem is not that bad for now.

asterite · 2017-09-29T12:36:52Z

(closing because of the above comment)

oprypin force-pushed the collect branch from f58b39c to 2facf80 Compare August 9, 2017 00:38

oprypin force-pushed the collect branch from 2facf80 to 611c6e9 Compare August 9, 2017 00:47

RX14 reviewed Aug 9, 2017

View reviewed changes

Add a macro to collect outputs of a yielding method to an array

88147e4

oprypin force-pushed the collect branch from 611c6e9 to 88147e4 Compare August 9, 2017 09:17

straight-shoota requested changes Aug 9, 2017

View reviewed changes

oprypin added 2 commits August 9, 2017 15:52

Fix private struct

059ae93

Update more instances

e052632

Fryguy reviewed Aug 13, 2017

View reviewed changes

asterite closed this Sep 29, 2017

bew mentioned this pull request Jul 6, 2018

Generator functions #4438

Open

oprypin mentioned this pull request Feb 23, 2021

Add spec helper it_iterates for iteration methods #10158

Merged

straight-shoota mentioned this pull request Oct 22, 2022

Add Enumerable(T)#to_a(& : T -> U) forall U #12643

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a macro to `collect` outputs of a yielding method to an array #4813

Add a macro to `collect` outputs of a yielding method to an array #4813

oprypin commented Aug 9, 2017

Papierkorb commented Aug 9, 2017

oprypin commented Aug 9, 2017

jhass commented Aug 9, 2017

oprypin commented Aug 9, 2017

RX14 Aug 9, 2017

oprypin Aug 9, 2017

straight-shoota Aug 9, 2017

oprypin Aug 9, 2017

konovod commented Aug 9, 2017

straight-shoota commented Aug 9, 2017 •

edited

Loading

oprypin commented Aug 9, 2017

oprypin commented Aug 9, 2017 •

edited

Loading

Papierkorb commented Aug 9, 2017 •

edited

Loading

ysbaddaden commented Aug 9, 2017

oprypin commented Aug 9, 2017 •

edited

Loading

straight-shoota commented Aug 9, 2017

Fryguy commented Aug 13, 2017

Fryguy Aug 13, 2017

straight-shoota Aug 13, 2017

RX14 commented Aug 13, 2017

straight-shoota commented Aug 13, 2017

Fryguy commented Aug 14, 2017

akzhan commented Aug 14, 2017

bew commented Sep 26, 2017 •

edited

Loading

asterite commented Sep 26, 2017

asterite commented Sep 29, 2017

Add a macro to collect outputs of a yielding method to an array #4813

Add a macro to collect outputs of a yielding method to an array #4813

Conversation

oprypin commented Aug 9, 2017

Papierkorb commented Aug 9, 2017

oprypin commented Aug 9, 2017

jhass commented Aug 9, 2017

oprypin commented Aug 9, 2017

RX14 Aug 9, 2017

Choose a reason for hiding this comment

oprypin Aug 9, 2017

Choose a reason for hiding this comment

straight-shoota Aug 9, 2017

Choose a reason for hiding this comment

oprypin Aug 9, 2017

Choose a reason for hiding this comment

konovod commented Aug 9, 2017

straight-shoota commented Aug 9, 2017 • edited Loading

oprypin commented Aug 9, 2017

oprypin commented Aug 9, 2017 • edited Loading

Papierkorb commented Aug 9, 2017 • edited Loading

ysbaddaden commented Aug 9, 2017

oprypin commented Aug 9, 2017 • edited Loading

straight-shoota commented Aug 9, 2017

Fryguy commented Aug 13, 2017

Fryguy Aug 13, 2017

Choose a reason for hiding this comment

straight-shoota Aug 13, 2017

Choose a reason for hiding this comment

RX14 commented Aug 13, 2017

straight-shoota commented Aug 13, 2017

Fryguy commented Aug 14, 2017

akzhan commented Aug 14, 2017

bew commented Sep 26, 2017 • edited Loading

asterite commented Sep 26, 2017

asterite commented Sep 29, 2017

Add a macro to `collect` outputs of a yielding method to an array #4813

Add a macro to `collect` outputs of a yielding method to an array #4813

straight-shoota commented Aug 9, 2017 •

edited

Loading

oprypin commented Aug 9, 2017 •

edited

Loading

Papierkorb commented Aug 9, 2017 •

edited

Loading

oprypin commented Aug 9, 2017 •

edited

Loading

bew commented Sep 26, 2017 •

edited

Loading