Shellescaped utf-8 string misbehaving in backticks #3046

mmustala · 2015-06-15T05:18:57Z

Just tried my app with JRuby 9.0.0.0.rc1 and noticed that shellescape is returning utf-8 characters with a couple of backslashes in front of them. And because of this the escaped string does not work with the backticks.

This is almost the same situation as in #2258 but now I'm using shellescape to sanitize the input.

#encoding: utf-8
filename = "neliö" # Any string with multibyte chars should do
File.write(filename, "content")
mime = `file -b --mime #{filename.shellescape}`
# => "ERROR: cannot open `neliÃ¶' (No such file or directory)\n"

In the 1.7 JRuby the shellescape was working without issues.

The text was updated successfully, but these errors were encountered:

mmustala · 2015-06-15T06:24:28Z

The shellescape method seems to be the same in 1.7.20.1 and 9.0.0.0.rc1. So it must be the backticks that is behaving differently and not treating the backslashes correctly.

headius · 2015-06-15T15:10:29Z

Our shellwords library is identical to that in MRI, so I think you're right...the problem isn't in shellwords.

However, I was unable to reproduce your issue with my HEAD version of JRuby 9k. What platform are you on? Do you have an unusual system encoding (i.e. non-UTF-8)?

mmustala · 2015-06-15T20:51:51Z

I just installed jruby-head with rvm and tested that it reproduces. My test file content was:

#encoding: utf-8
require 'shellwords'

filename = "neliö" # Any string with multibyte chars should do
File.write(filename, "content")
mime = `file -b --mime #{filename.shellescape}`
puts mime

This will output

ERROR: cannot open `neliÃ¶' (No such file or directory)

My environment is Ubuntu 14.04.

The same test file run with JRuby 1.7.20.1 outputs

text/plain; charset=us-ascii

headius · 2015-06-15T21:32:53Z

Can you show me your LANG env var please? I ran on OS X and your script works ok.

mmustala · 2015-06-15T21:34:29Z

LANG=en_US.UTF-8

headius · 2015-06-15T21:34:58Z

Ok, I'll give it a shot on Linux and try to reproduce.

mmustala changed the title ~~utf-8 characters misbehaving in shellescape~~ Shellescaped utf-8 string misbehaving in backticks Jun 15, 2015

headius added this to the JRuby 9.0.0.0.rc2 milestone Jun 15, 2015

enebo closed this as completed in c9c2390 Jul 1, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GitHub Sponsors

Shellescaped utf-8 string misbehaving in backticks #3046

Shellescaped utf-8 string misbehaving in backticks #3046

mmustala commented Jun 15, 2015

mmustala commented Jun 15, 2015

headius commented Jun 15, 2015

mmustala commented Jun 15, 2015

headius commented Jun 15, 2015

mmustala commented Jun 15, 2015

headius commented Jun 15, 2015

Shellescaped utf-8 string misbehaving in backticks #3046

Shellescaped utf-8 string misbehaving in backticks #3046

Comments

mmustala commented Jun 15, 2015

mmustala commented Jun 15, 2015

headius commented Jun 15, 2015

mmustala commented Jun 15, 2015

headius commented Jun 15, 2015

mmustala commented Jun 15, 2015

headius commented Jun 15, 2015