` becomes `<script>alert(1)</script>`, harmless text. **But context matters**. Inside a JavaScript string, an HTML attribute value, a URL, a CSS expression, each needs its own encoding. Modern frameworks (React, Vue, Svelte) handle this automatically when you use JSX or template syntax. **Manual encoding is only safe** for content you are putting directly into HTML text nodes. For anything more complex use a library like DOMPurify."}},{"@type":"Question","name":"When should I use a named entity like © versus a number like ©?","acceptedAnswer":{"@type":"Answer","text":"**Use named entities when you want humans to read the source.** `©` is obvious, `©` is a riddle. Named entities also age better: `€` is more readable than `€` in a CSS file you revisit five years later. **Use numeric entities when**: the target system does not parse named entities (some email templating engines, custom XML parsers, very old environments), you are not sure which named form is canonical (`'` versus `'`), or your file must be **strictly ASCII-only** (some old build pipelines). For modern HTML in modern browsers either form works identically."}},{"@type":"Question","name":"Why does the tool encode the apostrophe as ' instead of '?","acceptedAnswer":{"@type":"Answer","text":"**Compatibility.** `'` is officially part of HTML5 and works in every modern browser, but it was **not in HTML4** and Internet Explorer up to version 8 does not recognize it (those browsers render the literal text \"'\"). **\\'** is the numeric reference for the same character and works in **every browser ever made**, all the way back to Netscape 1.0. The difference matters for emails (where ancient rendering engines still show up), legacy intranet apps, and any HTML you cannot fully control. Cost of the safer choice: two extra characters per apostrophe."}},{"@type":"Question","name":"My text has nested entities like < and they decoded only once. Why?","acceptedAnswer":{"@type":"Answer","text":"Because **that is the correct behavior.** `<` is the encoded form of the literal text `<`. Decoding it once gives you `<`, which is the right answer: someone wanted to display the text `<` literally on a page and encoded it correctly. **Decoding twice** would give you `<`, which would erase the original author's intent. **If you want to fully decode nested encoding** (rare, usually a bug somewhere upstream), run the output through the decoder again manually. The tool deliberately does not auto-loop, otherwise it could break correctly encoded content."}},{"@type":"Question","name":"Can I encode JavaScript or JSON safely with this?","acceptedAnswer":{"@type":"Answer","text":"**No, and you should not try.** HTML entities are for HTML context only. **For JavaScript strings** use `JSON.stringify()` or backslash escaping (\\\\u00E9 for é). **For URLs** use `encodeURIComponent()` or our [URL encoder](/en/url-encoder). For binary or auth-header payloads, use the [Base64 encoder](/en/base64-encoder). **For CSS** use the hex backslash form (\\\\E9 for é). **Common mistake**: putting HTML-encoded text inside a JavaScript string literal: `var x = \"<div>\"` does NOT contain `

`, it contains the literal eight-character string `<div>`. Different contexts, different encoders."}},{"@type":"Question","name":"Why are some characters like emoji or Asian text encoded as a pair of numbers?","acceptedAnswer":{"@type":"Answer","text":"Because **emoji and many Asian characters live above the basic Unicode plane**, they use two **UTF-16 code units** internally in JavaScript. When the tool encodes them as numeric references, it correctly emits the full **codepoint** (one number, not two). **Example**: 🎉 encodes as `🎉` (a single codepoint), not two separate references. **Decoding** reverses this perfectly. If you see weird two-number sequences in your input, it means whoever encoded it earlier did NOT handle surrogate pairs correctly, run the input through Decode and the tool will repair it."}},{"@type":"Question","name":"Does this work offline? Is my text uploaded anywhere?","acceptedAnswer":{"@type":"Answer","text":"**Everything runs in your browser.** No server calls, no analytics on the input, no upload. You can open the **Network tab** in DevTools and confirm: zero requests when you type. Safe for **sensitive content**: legal docs, internal templates, customer data. **Works offline** too: once the page is loaded the encoding logic is fully client-side JavaScript, you can disconnect Wi-Fi and keep working. **The entity reference table** (top 30 named entities) is embedded directly in the page bundle."}},{"@type":"Question","name":"Why does my non-breaking space ( ) decode to a regular space?","acceptedAnswer":{"@type":"Answer","text":"**It does not, it decodes to a non-breaking space.** They look identical but they are different characters: regular space is **U+0020**, non-breaking space is **U+00A0**. Copy the decoded output and paste it into a Unicode inspector and you will see U+00A0. **Browsers render them the same way** which is why it looks like a regular space. **Important**: non-breaking spaces in HTML source can break layout calculations, regex `\\s` matches them but `\" \"` (literal space) does not. If you want to **convert them to real spaces** after decoding, run the output through our find and replace tool."}},{"@type":"Question","name":"What is the difference between A and A?","acceptedAnswer":{"@type":"Answer","text":"**Same character, different number bases.** `A` is the **decimal** numeric reference for codepoint 65 (the letter A). `A` is the **hexadecimal** form of the same number (0x41 = 65 in decimal). Both decode to 'A'. **Why both exist**: hex is more compact for high codepoints (`🎉` versus `🎉` for 🎉), and matches the way Unicode codepoints are usually written in documentation (U+1F389). Decimal is older and more readable for ASCII range. **Browsers accept either**, decode is identical. **For encoding**: this tool emits decimal by default, that is the more common convention in real-world HTML."}}]}

HTML Entities Encoder

Encode <, >, & and special chars to HTML entities. Encode and decode both ways.

LiveRuns in your browser

Encodes only the 5 HTML-significant chars: & < > " '. Accented characters stay as-is.

Input

Output

&#39;&lt;script&gt;&#39;alert(&quot;xss&quot;)&#39;&lt;/script&gt;&#39;
Price: 5 € (10% off), Café Joe&#39;s

Most-used named entitiesClick any row to copy.

What are HTML entities and when do I need them?

Some characters mean something special to a browser. Type a literal less-than sign in your CMS and the browser thinks a tag is starting. Paste an ampersand into a URL and a different parameter takes over. The fix is HTML entities: short codes like `&lt;` or `&amp;` that show the actual character without confusing the parser.

This tool flips text in both directions: plain text into entities (Encode) and entities back into plain text (Decode). Three encode modes: Strict only handles the five characters that change HTML's meaning, Named uses friendly names like `&copy;` for ©, Numeric all turns every non-ASCII character into a number (`&#233;` for é). Decode handles all three formats and the full named entity table.

Everything runs in your browser: nothing leaves the page, no sign-up, no quota. Pinned at the bottom: a reference table of the 30 most-used named entities, click any row to copy it.

How to use it

Pick a direction: Encode (text into entities) or Decode (entities back into text). Toggle bar at the top of the page.
If you picked Encode, choose a mode: Strict (just the five HTML-breaking characters), Named (use `©`, `€` and friends), Numeric all (every non-ASCII character as `&#NNN;`).
Paste your text on the left. The result appears on the right instantly: no Run button, no delay.
Hit Copy to grab the output. Or Swap to flip the result back through the opposite direction (useful for round-trip testing).
Need a specific entity? Open the reference table at the bottom (the top 30 named entities), click any row to copy that exact entity token.
Decode handles everything: named entities (`&copy;`), decimal numeric (`&#65;`), hex numeric (`&#x41;`). Mixed input works too.

When this is useful

Six situations where this tool saves you from a broken page or a security hole:

Showing code samples on a blog. You want to display `<div class="foo">` as literal text in a tutorial. Without encoding the browser eats the tag and you get an empty box. Encode in Strict mode turns the angle brackets into `<` and `>` so the markup shows as text.
Pasting content from Word or Google Docs into a CMS. The doc has smart quotes (`“ ”`), em dashes (`-`), non-breaking spaces. Some CMSs choke on these. Encode in Named mode swaps them for readable entities like `“` and `—` that every system handles.
Stopping a basic XSS attempt. A user submits `<script>alert("xss")</script>` to your comment form. Display it raw and the script runs in every visitor's browser. Encode in Strict mode turns it into harmless text. Real defense needs more than this, but encoding output is step one.
Cleaning up old HTML emails or scraped pages. You copy text from an old newsletter and it is full of `&eacute;` and `&#8230;` artifacts. Decode turns them back into proper é and …, ready to paste into a fresh document.
Sending email content through legacy systems. Some older mail servers and templating engines strip non-ASCII characters. Encode in Numeric all mode turns é into `&#233;`, which survives the trip and renders correctly in any mail client.
Building an iframe srcdoc attribute. The HTML inside `srcdoc="..."` needs its quotes escaped or the whole thing breaks. Encode in Strict mode is exactly what you need: it touches only the characters the parser cares about, leaves the rest readable.

Questions and answers

Strict only escapes the five characters that have a special meaning in HTML: \< \> & " '. Everything else passes through unchanged. Use this for most everyday HTML, code samples, comment forms, blog content. Named does the same five plus swaps non-ASCII characters for friendly named entities where one exists: © becomes `©`, € becomes `€`. Falls back to numeric for characters without a named form. Use this when you want human-readable entities. Numeric all escapes the five specials AND every non-ASCII character as a decimal number: © becomes `©`, € becomes `€`. Use this for maximum compatibility with legacy systems that do not trust named entities.

HTML Entities Encoder

What are HTML entities and when do I need them?

How to use it

When this is useful

Questions and answers

Related tools

Find and Replace

Case Converter

Slug Generator

URL encoder and decoder

Base64 Encoder