User Rating 0.0
Total Usage 0 times
0 characters
Is this tool helpful?

Your feedback helps us improve.

About

Mistyped diacritical marks corrupt data silently. A name like Müller indexed as Muller fails database lookups, breaks sorting algorithms, and can invalidate legal documents. This tool provides the full Unicode Latin Extended set - over 300 precomposed characters covering acute (é), grave (è), circumflex (ê), tilde (ñ), umlaut (ü), cedilla (ç), caron (š), and other diacritics used across European, Turkic, and Vietnamese orthographies. Characters are precomposed (NFC normalized), not combining sequences, so they render consistently across systems.

Pro tip: precomposed characters (e.g., U+00E9 é) are safer for filenames, URLs, and databases than combining sequences (U+0065 + U+0301). This tool outputs only precomposed forms. Note: some rare diacritical combinations have no precomposed Unicode codepoint and require combining marks - those cases are outside this tool's scope.

accent characters diacritical marks unicode characters special characters text formatting accented letters

Formulas

This tool performs direct Unicode character insertion, not mathematical transformation. The core logic maps a base letter to its precomposed diacritical variants using a lookup dictionary.

lookup(base) { c1, c2, …, cn }

where base A - Z and each ci is a precomposed NFC codepoint. Text insertion uses cursor-position slicing:

result = text[0..cursor] + ci + text[cursor..end]

where cursor is the current selectionStart index. Clipboard operations use the async navigator.clipboard.writeText API with document.execCommand("copy") as fallback for older browsers.

Reference Data

Diacritic NameSymbolExampleLanguagesUnicode Block
Acute´é, áFrench, Spanish, Portuguese, Hungarian, CzechLatin-1 Supplement
Grave`è, àFrench, Italian, Portuguese, CatalanLatin-1 Supplement
Circumflex^ê, âFrench, Portuguese, Romanian, VietnameseLatin-1 Supplement
Tilde~ñ, ãSpanish, Portuguese, Estonian, VietnameseLatin-1 Supplement
Umlaut / Diaeresis¨ü, öGerman, Swedish, Finnish, Turkish, HungarianLatin-1 Supplement
Cedilla¸ç, şFrench, Portuguese, Turkish, CatalanLatin-1 Supplement / Extended-A
Caron / Háčekˇš, čCzech, Slovak, Slovenian, Croatian, LithuanianLatin Extended-A
Ring Above˚å, ůSwedish, Norwegian, Danish, CzechLatin-1 Supplement / Extended-A
Ogonek˛ą, ęPolish, Lithuanian, NavajoLatin Extended-A
Macron¯ā, ōLatvian, Māori, Japanese Rōmaji, HawaiianLatin Extended-A
Breve˘ă, ğRomanian, Turkish, VietnameseLatin Extended-A
Dot Above˙ż, ġPolish, Lithuanian, Maltese, TurkishLatin Extended-A
Double Acute˝ő, űHungarianLatin Extended-A
Stroke / Bar/ø, đDanish, Norwegian, Vietnamese, SamiLatin-1 Supplement / Extended-A
Horn˛ơ, ưVietnameseLatin Extended-B
Eth - ð, ÐIcelandic, Faroese, Old EnglishLatin-1 Supplement
Thorn - þ, ÞIcelandic, Old EnglishLatin-1 Supplement
Eszett / Sharp S - ß, GermanLatin-1 Supplement / Extended Additional
Ligature AE - æ, ÆDanish, Norwegian, Icelandic, Old EnglishLatin-1 Supplement
Ligature OE - œ, ŒFrenchLatin Extended-A

Frequently Asked Questions

A precomposed character like é (U+00E9) is a single codepoint. A combining sequence uses two codepoints: the base letter e (U+0065) followed by combining acute accent (U+0301). Both may render identically, but precomposed forms are safer for string comparison, database storage, filenames, and URL slugs. This tool outputs only precomposed (NFC) characters.
Your file or database column is likely encoded in ASCII or Latin-1 (ISO 8859-1) rather than UTF-8. Characters outside the encoding's range get replaced with ? or mojibake (e.g., é appearing as é). Ensure your storage uses UTF-8 encoding. In MySQL, use utf8mb4 charset - the older utf8 only covers the Basic Multilingual Plane.
Yes. Vietnamese requires stacking of tone marks and vowel diacritics. This tool includes precomposed Vietnamese characters such as ơ (o-horn), ư (u-horn), and combinations with acute, grave, hook above, tilde, and dot below from the Latin Extended Additional block (U+1EA0 - U+1EF9).
On Windows, use Alt codes (e.g., Alt+0233 for é) or enable the US-International keyboard layout where ' + e produces é. On macOS, press Option+E then E for é. On Linux, use Compose key sequences. All methods require memorization per character. This tool eliminates that overhead by providing visual browsing.
Yes. Most CMS systems strip or transliterate accented characters in URLs (e.g., é becomes e). However, in page content and metadata, using correct diacritics improves relevance for queries in French, Spanish, German, etc. Google treats cafe and café as related but distinct terms. Use accented forms in body text and titles; use transliterated ASCII in URLs.
Unicode defines four normalization forms. NFC (Canonical Decomposition + Canonical Composition) produces precomposed characters where possible. NFD decomposes them into base + combining marks. Two strings that look identical can fail equality checks if one is NFC and the other NFD. Database indexes, password hashing, and digital signatures can all break. This tool outputs NFC characters to minimize such issues.