Santekno/toolsCategoriesTutorials

Unicode Code Point Lookup

Inspect any character: codepoint, UTF-8 bytes, UTF-16 surrogates, HTML entity, JS escape.

Processed in your browserUpdated · Jan 2026
Input
0 chars
Output
0 chars

How to use Unicode Code Point Lookup

Paste your input on the left, choose the options you want, and the output appears instantly on the right. Everything runs in your browser — none of your data is sent to a server.

  • Paste or type your input in the INPUT panel
  • The output regenerates automatically as you type
  • Use Copy to put the result in your clipboard
  • Click Sample to load a working example

What is Unicode Code Point Lookup?

Unicode Lookup decomposes characters into their full description: Unicode codepoint (U+XXXX), decimal value, HTML numeric entity, UTF-8 byte sequence in hex, UTF-16 code units (with surrogate pair detection for supplementary plane), and JavaScript-compatible escape sequence (`\uXXXX` or `\u{XXXXX}` for ES2015+). In codepoint mode, enter `U+XXXX`, `0xXXXX`, or decimal to retrieve the character. This tool is part of santekno's developer toolbox — a curated collection of utilities built for engineers who care about speed, privacy, and simplicity.

Common use cases

  • Debugging API payloads and integration issues
  • Inspecting tokens, hashes, or encoded strings during development
  • Generating fixtures and sample data for tests
  • Sharing readable output with teammates in code reviews

FAQ

A code point is the abstract integer assigned to a character (U+1F389 = 🎉). A code unit is the storage unit of an encoding (UTF-16 stores 🎉 as 2 surrogate code units `D83C DF89`).