Sunbelt Computer Software

description: Normalizes each UTF-8 string in the input tensor using the specified rule.

text.normalize_utf8

Normalizes each UTF-8 string in the input tensor using the specified rule.

text.normalize_utf8(
    input, normalization_form='NFKC', name=None
)

See http://unicode.org/reports/tr15/

Examples:

>>> # input: <string>[num_strings]
>>> normalize_utf8(["株式会社", "ＫＡＤＯＫＡＷＡ"])
>>> # output: <string>[num_strings]
<tf.Tensor: shape=(2,), dtype=string, numpy=
array([b'\xe6\xa0\xaa\xe5\xbc\x8f\xe4\xbc\x9a\xe7\xa4\xbe', b'KADOKAWA'],
      dtype=object)>

Args
`input`	A `Tensor` or `RaggedTensor` of type string. (Must be UTF-8.)
`normalization_form`	One of the following string values ('NFC', 'NFKC', 'NFD', 'NFKD'). Default is 'NFKC'.
`name`	The name for this op (optional).

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

text.normalize_utf8

Examples:

Args

Returns

Sunbelt Computer Software

PL/B Language Development and Support

Uh oh!

FilesExpand file tree

normalize_utf8.md

Latest commit

History

normalize_utf8.md

File metadata and controls

text.normalize_utf8

Examples:

Args

Returns