Use a single write for IO#puts #3183

cheald · 2015-07-24T19:37:07Z

This is a fix for #3182

Avoids interleaved writes across threads:

require 'thread'
8.times.map do
  Thread.new do
    1000.times do
      $stdout.puts "line"
      $stdout.fsync
    end
  end
end.map(&:join)

This can currently produce output like:

lineline


linelineline


lineline

lineline

After patching, output is consistently:

line
line
line
line
line
line
line
line
line
line
line
line

…eads

headius · 2016-11-02T13:10:31Z

This one came up on IRC today.

I'm not opposed to this, but currently we implement it the same way MRI does (nearly line-for-line). I'd like to know if there's a reason it's implemented that way, since there are even specs and tests that check for the separate writes.

headius · 2016-11-02T13:26:23Z

Oh, I did think of one explanation: zero-allocation puts. If you have to stick the \n on a buffer to do it as one write, you'll probably have to allocate a new buffer.

Our puts logic currently isn't zero-alloc, but it could be.

enebo · 2016-11-02T14:55:00Z

@headius I think this has languished because of the fear of the unknown. In most of our optimizations of reducing dyncalls we cannot observe a change in behavior. The behavior @cheald fixes as was pointed out in original report is what MRI does as well (weird interleaved output). So even if this is faster it will change behaviorally. If we think no one will really want this behavior (and that is likely true), then we need to consider test suites mocking to make sure there are two write() calls. The only way of getting around that would be to check if this is being called from builtin version of the method. Seems like this is a ways beneath that point. We could pass that and then remove the cost?

headius · 2016-11-02T16:27:01Z

The cost of the two dynamic calls should be close to negligible at this point, since we added call-site caching. The only real benefit is performing the write in one go.

An alternative would be to lock the IO around the puts logic, but that would also need to check if write had been replaced (since there's no IO-locking semantics in MRI puts).

headius · 2016-11-02T18:59:34Z

I don't think we'll do this. If you are using an IO across threads, it guarantees only that its internal logic and buffering will be thread-safe, not that any given call will be atomic (and indeed, even write calls might get turned into multiple native calls). If you want atomicity of groups of writes, including implicit cases like puts, you can introduced a lock or use the JRuby::Synchronized module.

Use a single write for IO#puts to avoid interleaved writes across thr…

14dd591

…eads

enebo force-pushed the master branch from 2beed09 to 1ee9007 Compare November 23, 2015 22:48

headius closed this Nov 2, 2016

headius added this to the Invalid or Duplicate milestone Nov 2, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use a single write for IO#puts #3183

Use a single write for IO#puts #3183

cheald commented Jul 24, 2015

headius commented Nov 2, 2016

headius commented Nov 2, 2016 •

edited

Loading

enebo commented Nov 2, 2016

headius commented Nov 2, 2016

headius commented Nov 2, 2016

Use a single write for IO#puts #3183

Use a single write for IO#puts #3183

Conversation

cheald commented Jul 24, 2015

headius commented Nov 2, 2016

headius commented Nov 2, 2016 • edited Loading

enebo commented Nov 2, 2016

headius commented Nov 2, 2016

headius commented Nov 2, 2016

headius commented Nov 2, 2016 •

edited

Loading