Enter/paste text to replace special characters with correct escape codes:
The Unicode code space is divided into seventeen planes of 2^16 (65,536) code points each. Some of these code points have not yet been assigned character values, some are reserved for private use, and some are permanently reserved as non-characters. The code points in each plane have the hexadecimal values
xy is a hex value from
10, signifying which plane the values belong to.
Differences between UCS-2 and UTF-16
Both UCS-2 and UTF-16 are character encodings for Unicode.
UCS-2 (2-byte Universal Character Set) produces a fixed-length format by simply using the code point as the 16-bit code unit. This produces exactly the same result as UTF-16 for the majority of all code points in the range from
0xFFFF (i.e. the BMP).
UTF-16 (16-bit Unicode Transformation Format) is an extension of UCS-2 that allows representing code points outside the BMP. It produces a variable-length result of either one or two 16-bit code units per code point. This way, it can encode code points in the range from