Add support for non-ASCII character references and encode them as UTF-8