Unicorn Library: String Normalization

Unicode library for C++ by Ross Smith

This is a small module (really just one function), with the specific purpose of converting Unicode strings into the four standard normalization forms.

Normalization functions

The standard Unicode normalization forms.

Convert a string to one of the normalized forms. The normalize() function returns the normalized string, while normalize_in() updates the source string in place. As usual, these functions assume valid Unicode input, and will emit garbage if the input contains invalid UTF encoding.