Implement Zlib::GzipReader #ungetbyte and #ungetc #4636

haines · 2017-05-31T08:27:18Z

I've had a stab at implementing the unget* methods for Zlib::GzipReader. The implementation is a bit limited in that the maximum number of bytes that can be "ungot" is hardcoded, since I'm using a PushbackInputStream. Also, it doesn't attempt to do anything clever with encoding (although I don't see that as a problem because getc doesn't either).

Let me know what you think!

Closes #4631

haines · 2017-05-31T09:19:53Z

I think the MRI zlib test failure (here) is actually an issue with the implementation of gets("") -- MRI skips the leading newline but JRuby does not.

headius · 2017-05-31T14:32:10Z

@haines You may be right. Do you want to look into it? It may just be some missing logic in our ported GzipReader#gets implementation. I suppose the test passed before because it fell back on io#ungetc which didn't actually unget anything, so the subsequent gets worked properly.

Thanks for your help! I have some review comments I'll add.

headius · 2017-05-31T14:26:56Z

core/src/main/java/org/jruby/ext/zlib/JZlibRubyGzipReader.java

-import org.jruby.RubyIO;
-import org.jruby.RubyNumeric;
-import org.jruby.RubyString;
+import org.jruby.*;


We generally don't collapse imports unless there's >20 or something.

headius · 2017-05-31T14:33:42Z

test/jruby/test_zlib.rb

@@ -517,6 +517,24 @@ def test_error_input
    assert_equal("not in gzip format", e.message)
    assert_equal("foobarzothoge", e.input)
  end
+
+  def test_gzip_reader_ungetc


If these don't exist in either MRI's suite or ruby/spec, it would be nice to add them there. We usually just put JRuby-specific stuff in test/jruby.

Makes sense. There are placeholders for these methods in ruby/spec, so I'll add them there.

Should I submit them to ruby/spec first and then pull them in once merged upstream?

You can submit them to our spec/ruby subdir. We merge both ways.

Make it a separate commit for sure though...makes it easier to merge. 😄

haines · 2017-05-31T21:40:46Z

I've added the specs, which caught a bug in my implementation (I forgot to decrement pos), and a couple in MRI:

ungetting at the start of the stream causes pos to underflow (I've raised this: https://bugs.ruby-lang.org/issues/13616)
ungetting nil raises a TypeError: no implicit conversion of nil. I'm not sure if I should raise this as a bug in MRI or leave the behaviour unspecified. File accepts nil and silently ignores it, so I think GzipReader should behave in the same way for consistency. On the other hand, trying to unget nil is a strange thing to do, so raising a TypeError is fairly reasonable. What do you think?

I'll take a look at the #gets implementation tomorrow (UK time).

haines · 2017-06-01T22:51:00Z

I think this is good to go (assuming you're happy with it!). Turns out #gets("") is supposed to skip any number of leading or trailing newlines, so I've added a spec for that behaviour too.

headius · 2017-06-02T19:34:38Z

Looks good now...I'll merge!

The test suite requires jruby/jruby#4636

eregon · 2017-06-23T12:33:50Z

@headius @haines

ungetting nil raises a TypeError: no implicit conversion of nil. I'm not sure if I should raise this as a bug in MRI or leave the behaviour unspecified. File accepts nil and silently ignores it, so I think GzipReader should behave in the same way for consistency. On the other hand, trying to unget nil is a strange thing to do, so raising a TypeError is fairly reasonable. What do you think?

Either follow what MRI does, or raise a bug, but please not spec different behavior without a bug report.
I cannot keep these specs in ruby/spec since they don't pass on MRI and there is no matching MRI bug.
I'll write a bug report and put these specs in quarantine in the meantime.

eregon · 2017-06-23T12:42:23Z

https://bugs.ruby-lang.org/issues/13675

headius requested changes May 31, 2017

View reviewed changes

headius added this to the JRuby 9.1.11.0 milestone May 31, 2017

haines added 4 commits June 1, 2017 20:29

Implement Zlib::GzipReader#ungetbyte and #ungetc

8d5a5a4

Add specs for Zlib::GzipReader#ungetbyte and #ungetc

f4305a4

Strip newlines when reading paragraphs from Zlib::GzipReader

414fe36

Add spec for Zlib::GzipReader#gets("")

4bd6888

headius merged commit 15899bb into jruby:master Jun 2, 2017

haines deleted the gzip_reader_unget branch June 2, 2017 19:59

haines added a commit to haines/tar that referenced this pull request Jun 6, 2017

Stop building on JRuby until 9.1.11.0 is released

e90793b

The test suite requires jruby/jruby#4636

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GitHub Sponsors

Implement Zlib::GzipReader #ungetbyte and #ungetc #4636

Implement Zlib::GzipReader #ungetbyte and #ungetc #4636

haines commented May 31, 2017

haines commented May 31, 2017 •

edited

Loading

headius commented May 31, 2017

headius May 31, 2017

headius May 31, 2017

haines May 31, 2017

headius May 31, 2017

headius May 31, 2017

haines commented May 31, 2017

haines commented Jun 1, 2017

headius commented Jun 2, 2017

eregon commented Jun 23, 2017

eregon commented Jun 23, 2017

Implement Zlib::GzipReader #ungetbyte and #ungetc #4636

Implement Zlib::GzipReader #ungetbyte and #ungetc #4636

Conversation

haines commented May 31, 2017

haines commented May 31, 2017 • edited Loading

headius commented May 31, 2017

headius May 31, 2017

Choose a reason for hiding this comment

headius May 31, 2017

Choose a reason for hiding this comment

haines May 31, 2017

Choose a reason for hiding this comment

headius May 31, 2017

Choose a reason for hiding this comment

headius May 31, 2017

Choose a reason for hiding this comment

haines commented May 31, 2017

haines commented Jun 1, 2017

headius commented Jun 2, 2017

eregon commented Jun 23, 2017

eregon commented Jun 23, 2017

haines commented May 31, 2017 •

edited

Loading