Improve error messages for InvalidByteSequenceError #2814

jhass · 2016-06-12T16:28:52Z

No description provided.

asterite · 2016-06-12T16:37:40Z

It's a good idea, but I'm not sure we should do this. Try this benchmark:

require "benchmark"

io = MemoryIO.new
io << "foo bar"
io << "今日"
string = io.to_s

time = Time.now
a = 0
20_000_000.times do
  string.each_char do |char|
    a += char.ord
  end
end
puts a
puts Time.now - time

Before: 00:00:01.7733200
After: 00:00:02.1635580

Basically, doing string interpolation and more work, even if an exception is not raised, makes the optimizer not inline some things and generate more, slower code.

That's also the reason why Array#[] just raises IndexError without too much information.

jhass · 2016-06-12T16:39:14Z

Not being helpful is a terrible solution though. What if we make Exception.new take a block?

asterite · 2016-06-12T16:40:27Z

Hm, in fact, I tried adding @[AlwaysInline] in a few places in Char::Reader and the times improve. I get 0.85s now. However, without this change and also using @[AlwaysInline] I get 0.41s, so raising with more info is twice as slow. I'm not sure what should we do.

jhass · 2016-06-12T16:41:13Z

Actually I'm not sure I even understand why it's slower, I mean unless the error condition is actually reached the code shouldn't run?

asterite · 2016-06-12T16:46:39Z

Never mind. With @[AlwaysInline] I now get the same times. It was slow because I also had the change of chr and unsafe_chr. Using unsafe_chr brings the performance back.

jhass · 2016-06-12T16:47:58Z

Cool so merge and add the @[AlwaysInline] annotations afterwards? :)

And we actually can make Array#[] more verbose too?

asterite · 2016-06-12T16:58:34Z

You can try, yes, but do some benchmarks to see if the performance doesn't degrade.

asterite · 2016-06-12T17:14:06Z

Let's merge this, then I'll rebase and merge #2816, and then add the attribute.

jhass · 2016-06-12T17:40:00Z

require "benchmark"

class Array
  @[AlwaysInline]
  def at_detail(index : Int)
    at(index) { raise IndexError.new("Index out of bounds: #{index} is bigger than #{size-1}") }
  end
end


a = [1, 2, 3]

Benchmark.ips do |x|
  x.report("Array#at(1)") { a.at(1) }
  x.report("Array#at_detail(1)") { a.at_detail(1) }
  x.report("Array#at(4)") { a.at(4) rescue nil }
  x.report("Array#at_detail(4)") { a.at_detail(4) rescue nil }
end

       Array#at(1) 219.24M (± 7.00%)    1.02× slower
Array#at_detail(1) 223.84M (±10.52%)         fastest
       Array#at(4) 132.95k (± 4.97%) 1683.59× slower
Array#at_detail(4) 121.14k (± 9.93%) 1847.74× slower

No significant difference in the average case, the verbose version is even a bit faster in this benchmark arrangement due to the better cache locality, the results are swapped if the order of the benchmark is swapped, all within error margin anyway.

Do I still have to benchmark the other IndexError cases in array? :P

asterite · 2016-06-12T17:43:12Z

Looks really good! :-) 👍

Improve error messages for InvalidByteSequenceError

17f7165

asterite merged commit 5626e12 into crystal-lang:master Jun 12, 2016

jhass added the topic:stdlib label Jun 12, 2016

jhass deleted the better_utf8_errors branch June 16, 2016 23:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve error messages for InvalidByteSequenceError #2814

Improve error messages for InvalidByteSequenceError #2814

jhass commented Jun 12, 2016

asterite commented Jun 12, 2016

jhass commented Jun 12, 2016

asterite commented Jun 12, 2016

jhass commented Jun 12, 2016

asterite commented Jun 12, 2016

jhass commented Jun 12, 2016

asterite commented Jun 12, 2016

asterite commented Jun 12, 2016

jhass commented Jun 12, 2016

asterite commented Jun 12, 2016

Improve error messages for InvalidByteSequenceError #2814

Improve error messages for InvalidByteSequenceError #2814

Conversation

jhass commented Jun 12, 2016

asterite commented Jun 12, 2016

jhass commented Jun 12, 2016

asterite commented Jun 12, 2016

jhass commented Jun 12, 2016

asterite commented Jun 12, 2016

jhass commented Jun 12, 2016

asterite commented Jun 12, 2016

asterite commented Jun 12, 2016

jhass commented Jun 12, 2016

asterite commented Jun 12, 2016