Returns the KC normalization of the string by default. NFKC is considered
the best normalization form for passing strings to databases and
validations.
string - The string to perform normalization on.
form - The form you want to normalize in.
Should be one of the following: :c,:kc,:d, or
:kd. Default is
ActiveSupport::Multibyte.default_normalization_form.
# File activesupport/lib/active_support/multibyte/unicode.rb, line 275
def normalize(string, form=nil)
form ||= @default_normalization_form
# See http://www.unicode.org/reports/tr15, Table 1
codepoints = string.codepoints.to_a
case form
when :d
reorder_characters(decompose(:canonical, codepoints))
when :c
compose(reorder_characters(decompose(:canonical, codepoints)))
when :kd
reorder_characters(decompose(:compatibility, codepoints))
when :kc
compose(reorder_characters(decompose(:compatibility, codepoints)))
else
raise ArgumentError, "#{form} is not a valid normalization variant", caller
end.pack('U*')
end