Mega Code Archive

 
Categories / Ruby / String
 

Encoding and bytesize

euro1 = "\u20AC"                     # Start with the Unicode Euro character puts euro1                           # Prints "&#x20AC;" euro1.encoding                       # => <Encoding:UTF-8> euro1.bytesize                       # => 3 euro2 = euro1.encode("iso-8859-15")  # Transcode to Latin-15 puts euro2.inspect                   # Prints "\xA4" euro2.encoding                       # => <Encoding:iso-8859-15> euro2.bytesize                       # => 1 euro3 = euro2.encode("utf-8")        # Transcode back to UTF-8 euro1 == euro3                       # => true