KOI8-R Character Table & Reference
Comprehensive KOI8-R encoding reference (RFC 1489). View the full 256-character map, phonetic bit-stripping logic, and hex/decimal values for legacy Russian Unix systems.
Name: Select a character
Category: -
Bit Strip Equivalent: -
The bit-strip equivalent shows the character resulting from Byte & 0x7F.
About
System administrators and digital archivists frequently encounter KOI8-R when recovering data from early Unix systems or analyzing pre-2000 Russian internet traffic. This 8-bit character encoding (defined in RFC 1489) assigns the upper 128 bytes to Cyrillic characters and box-drawing symbols. Unlike ISO-8859-5 or Windows-1251, the KOI8-R layout is not alphabetical. The designers arranged Cyrillic letters to map onto phonetically similar ASCII characters if the eighth bit is stripped.
Data corruption often occurs when 8-bit text passes through a 7-bit gateway (such as early email servers). In such cases, the phonetic property of KOI8-R acts as a fail-safe. The text degrades into a readable transliteration rather than random garbage. This tool provides a complete lookup table for debugging encoding errors (mojibake) and verifying byte sequences in hex editors.
Formulas
The core design principle of KOI8-R involves a bitwise relationship between the Cyrillic upper range and the Latin lower range. If the most significant bit (MSB) is dropped, the value correlates to a phonetic equivalent.
For example, the Cyrillic "Russian a" occupies a specific index. Applying the mask yields the Latin "a".
x ∧ 011111112 = 0x41 (ASCII 'A')
Reference Data
| Decimal | Hex | Binary | Char | Description |
|---|---|---|---|---|
| 193 | 0xC1 | 11000001 | a | CYRILLIC SMALL LETTER A |
| 194 | 0xC2 | 11000010 | б | CYRILLIC SMALL LETTER BE |
| 225 | 0xE1 | 11100001 | A | CYRILLIC CAPITAL LETTER A |
| 226 | 0xE2 | 11100010 | Б | CYRILLIC CAPITAL LETTER BE |
| 128 | 0x80 | 10000000 | ─ | BOX DRAWINGS LIGHT HORIZONTAL |
| 154 | 0x9A | 10011010 | NO-BREAK SPACE |