Spec String#unsafe_byte_at #5500

chastell · 2018-01-01T12:23:24Z

I noticed the String#unsafe_byte_at#unsafe_chr pattern used in a few places, so this adds String#unsafe_chr_at.

Question: should String#unsafe_byte_at indeed return 0_u8 on out-of-bounds index?

oprypin · 2018-01-01T12:44:15Z

Sorry for the harsh message below. Nothing personal, I just don't think that more time should be spent in this direction. But also feel free to ignore me.

So the behavior is
Get the n-th byte, and take the Unicode character corresponding to its value. Don't mind that for most characters this does not work and returns arbitrary gibberish.
The reason that you found use for it in standard library is due to specific optimizations, but replacing the existing explicit actions with this magical method doesn't make things clearer. There is no use for this otherwise, it's clutter at best.

There is no precedent for naming things as chr_at.
There is precedent for char_at which means something completely different.

oprypin · 2018-01-01T12:49:34Z

As for your question: the behavior of String#unsafe_byte_at is undefined for an out-of-bounds index.

In practice, it returns whatever is in the memory after the string (also note that there's one guaranteed 0 byte at the end of the string)

chastell · 2018-01-01T15:04:08Z

I didn’t find this harsh at all, I fully understand your reasoning!

I cut this PR down to speccing the (defined…) String#unsafe_byte_at behaviour.

RX14 · 2018-01-01T16:05:47Z

Not sure about this, all the other string specs heavily test this internal method so it has coverage already. unsafe_byte_at should probably be protected...

asterite · 2018-01-01T18:21:04Z

I agree here, I don't think we need to add general unsafe methods, specially when they are not used a lot. At most they should be protected. Same goes with unsafe_byte_at, should be protected or undocumented, can be easily achieved with to_unsafe[I]

chastell · 2018-01-01T18:22:29Z

Welllll, I tried making String#unsafe_byte_at protected, but it’s also used in IO#gets, HTTP::Params.parse, URI.unescape and URI.unescape_one. 😄 But I do agree with your arguments, closing!

chastell · 2018-01-01T18:25:10Z

(Let me know if you think adjusting IO#gets, HTTP::Params.parse, URI.unescape and URI.unescape_one to use String#to_unsafe and making String#unsafe_byte_at protected would be welcome, I can happily do that.)

straight-shoota · 2018-01-01T18:29:18Z

What purpose has unsafe_byte_at as a dedicated method anyway? It's just an alias for to_unsafe[] which is actually shorter...

RX14 · 2018-01-01T18:39:25Z

unsafe_byte_{slice,string} should also be made either :nodoc: or protected. Heavilly prefer the latter.

straight-shoota · 2018-01-01T19:11:57Z

unsafe_byte_slice_string is already protected and unsafe_byte_slice is only used in file.cr.

RX14 added kind:refactor topic:stdlib labels Jan 1, 2018

Spec String#unsafe_byte_at

6d6f920

chastell force-pushed the String#unsafe_chr_at branch from aa06f60 to 6d6f920 Compare January 1, 2018 15:04

chastell changed the title ~~Add (and use) String#unsafe_chr_at~~ Spec String#unsafe_byte_at Jan 1, 2018

chastell closed this Jan 1, 2018

chastell deleted the String#unsafe_chr_at branch January 1, 2018 18:23

chastell mentioned this pull request Jan 1, 2018

Protect internal String methods #5503

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spec String#unsafe_byte_at #5500

Spec String#unsafe_byte_at #5500

chastell commented Jan 1, 2018

oprypin commented Jan 1, 2018

oprypin commented Jan 1, 2018 •

edited

Loading

chastell commented Jan 1, 2018

RX14 commented Jan 1, 2018

asterite commented Jan 1, 2018

chastell commented Jan 1, 2018

chastell commented Jan 1, 2018

straight-shoota commented Jan 1, 2018

RX14 commented Jan 1, 2018 •

edited

Loading

straight-shoota commented Jan 1, 2018

Spec String#unsafe_byte_at #5500

Spec String#unsafe_byte_at #5500

Conversation

chastell commented Jan 1, 2018

oprypin commented Jan 1, 2018

oprypin commented Jan 1, 2018 • edited Loading

chastell commented Jan 1, 2018

RX14 commented Jan 1, 2018

asterite commented Jan 1, 2018

chastell commented Jan 1, 2018

chastell commented Jan 1, 2018

straight-shoota commented Jan 1, 2018

RX14 commented Jan 1, 2018 • edited Loading

straight-shoota commented Jan 1, 2018

oprypin commented Jan 1, 2018 •

edited

Loading

RX14 commented Jan 1, 2018 •

edited

Loading