Knowledge (XXG)

UTF-7

Source 📝

4698: 2654: 2643: 561:, and which to place in blocks of Unicode characters. The expansion cost of UTF-7 can be high: for example, the character sequence U+10FFFF U+0077 U+10FFFF is 9 bytes in UTF-8, but 17 bytes in UTF-7. (At worst, treating every codepoint as a sequence in its own right produces the maximum expansion of 5x, e.g. when encoding 703:
UTF-7 allows multiple representations of the same source string. In particular, ASCII characters can be represented as part of Unicode blocks. As such, if standard ASCII-based escaping or validation processes are used on strings that may be later interpreted as UTF-7, then Unicode blocks may be used
345:
Space, tab, carriage return and line feed may also be represented directly as single ASCII bytes. However, if the encoded text is to be used in e-mail, care is needed to ensure that these characters are used in ways that do not require further content transfer encoding to be suitable for e-mail. The
307:
has been made obsolete by RFC 2152, an informational RFC which never became a standard. As RFC 2152 clearly states, the RFC "does not specify an Internet standard of any kind". Despite this, RFC 2152 is quoted as the definition of UTF-7 in the IANA's list of charsets. Neither is UTF-7 a Unicode
675:
A byte order mark (BOM) is an optional special byte sequence at the very start of a stream or file that, without being data itself, indicates the encoding used for the data that follows; it can be used in the absence of metadata that denotes the encoding. For a given encoding scheme, it's that
342:). Using the optional direct characters reduces size and enhances human readability but also increases the chance of breakage by things like badly designed mail gateways and may require extra escaping when used in encoded words for header fields. 633:
es followed by a dash) and nonempty Unicode blocks as mentioned in the description section. Once this is done, each Unicode block must be decoded with the following procedure (using the result of the encoding example above as our example)
196:
then it can (and probably does) encode each surrogate half as though it was a 16-bit code point, and thus can encode all code points. It is unclear if other UTF-7 software (such as translators to UTF-32 or UTF-8) support this.
273:
UTF-7 is generally not used as a native representation within applications as it is very awkward to process. Despite its size advantage over the combination of UTF-8 with either quoted-printable or Base64, the now defunct
270:, UTF-7 was designed to avoid using the = sign as an escape character to avoid double escaping when it is combined with quoted-printable (or its variant, the RFC 2047/1522 "Q"-encoding of headers). 4316: 956:
characters, except for "&", represent themselves…. The character "&" (0x26) is represented by the two-octet sequence "&-". All other characters… are represented in modified BASE64….
734:
UTF-7 is considered obsolete, at least for Microsoft software (.NET), with code paths previously supporting it intentionally broken (to prevent security issues) in .NET 5, in 2020.
3161: 127: 2998: 315:
Some characters can be represented directly as single ASCII bytes. The first group is known as "direct characters" and contains 62 alphanumeric characters and 9 symbols:
319:. The direct characters are safe to include literally. The other main group, known as "optional direct characters", contains all other printable characters in the range 1546: 1488: 258:, but still must be explicitly identified as the text character set. In addition, if used within e-mail headers such as "Subject:", UTF-7 must be contained in MIME 3106: 704:
to slip malicious strings past them. To mitigate this problem, systems should perform decoding before validation and should avoid attempting to autodetect UTF-7.
3181: 2695: 2611: 1101: 683:
While it's typically a single, fixed byte sequence, in UTF-7 four variations may appear, because the last 2 bits of the 4th byte of the UTF-7 encoding of
2596: 120: 381:) then it is consumed by the decoder and decoding resumes with the next character. Otherwise decoding resumes with the character after the Base64. 4725: 4408: 4162: 2616: 1406: 291:
e-mail retrieval protocol, version 4 rev 1, for "international" mailbox names. The following version, IMAP version 4 rev 2, uses UTF-8 instead.
4398: 1391: 652:
If there is an incomplete group at the end containing only zeros, discard it (if the incomplete group contains any ones, the code is invalid):
4147: 3101: 1635: 1314: 4281: 1630: 766: 691:
character, resulting in 4 possible bit patterns and therefore 4 different possible bytes in the 4th position. See the UTF-7 entry in the
113: 4675: 4186: 3989: 2733: 1201: 1164: 312:
only lists UTF-8, UTF-16 and UTF-32. There is also a modified version, specified in RFC 2060, which is sometimes identified as UTF-7.
4231: 3847: 3842: 3345: 3176: 2688: 3265: 1463: 1094: 288: 4483: 4418: 4172: 4152: 1468: 1383: 1284: 743: 4236: 2829: 2723: 373:
sign. The end is indicated by any character not in the modified Base64 set. If the character after the modified Base64 is a
1062:"ArticleUtf7 - doctype-mirror - UTF-7: the case of the missing charset - Mirror of Google Doctype - Google Project Hosting" 4350: 4321: 3971: 1765: 1579: 1563: 1526: 1373: 1309: 1129: 228: 181: 147: 52: 4413: 4301: 4261: 2681: 2504: 1740: 569:.) Each Unicode sequence must be encoded using the following procedure, then surrounded by the appropriate delimiters. 4720: 4655: 4266: 4182: 4167: 4071: 3984: 3956: 3922: 2623: 1551: 1351: 1087: 204:. It is known to have security issues, which is why software has been changed to disable its use. It is prohibited in 3932: 3927: 4629: 4574: 4495: 4276: 4271: 3280: 1790: 1625: 1620: 1269: 1174: 254:
Provided certain rules are followed during encoding, UTF-7 can be sent in e-mail without using an underlying MIME
4336: 4291: 4127: 3676: 3380: 3325: 3290: 2228: 1505: 1214: 3851: 3360: 3340: 3335: 3275: 3270: 2779: 827: 275: 248: 185: 64: 4701: 4685: 4612: 4607: 4569: 4540: 4505: 3937: 3671: 3370: 3255: 2658: 2559: 1915: 1820: 1169: 1157: 60: 4296: 4286: 4142: 4132: 3666: 3375: 2820: 2807: 2743: 2248: 1995: 1850: 1785: 1500: 1368: 1274: 1233: 809: 4473: 4311: 4246: 4122: 3661: 2815: 2444: 2198: 2193: 2070: 1483: 1248: 3691: 235:. Therefore, a non-trivial content transfer encoding has to be applied in case of doubt. Unfortunately 4634: 4306: 4196: 4066: 3686: 2429: 2374: 2343: 2263: 1985: 1950: 1830: 1319: 996: 895: 712: 304: 4107: 247:
produces a very size-inefficient format requiring 6–9 bytes for non-ASCII characters from the
4589: 4216: 3701: 3586: 3576: 3571: 2474: 2469: 2384: 2328: 1694: 1689: 1674: 1473: 1458: 1363: 1299: 1289: 1279: 659:
Each group of 16 bits is a character's Unicode (UTF-16) number and can be expressed in other forms:
223:
using byte values above the ASCII range. Although MIME allows encoding the message body in various
2494: 4670: 4518: 4331: 4326: 4251: 3250: 3224: 2748: 2704: 2575: 2464: 2434: 2414: 2050: 2030: 1780: 1346: 1223: 1219: 1124: 220: 201: 791: 4660: 4599: 4579: 4241: 4221: 4201: 3829: 3305: 3285: 2797: 2544: 2454: 2439: 2308: 2278: 2243: 2055: 1895: 1720: 1531: 1243: 1184: 988: 849: 708: 284:
has also been introduced, which reduces the need to encode message bodies in a 7-bit format.
4617: 4191: 4157: 3867: 3696: 2647: 2606: 2499: 2449: 2338: 2298: 2223: 2213: 2203: 2105: 2075: 2060: 1965: 1940: 1815: 1795: 1664: 1655: 1541: 1294: 1253: 978: 885: 263: 244: 174: 38: 4665: 4584: 3315: 3310: 3300: 3245: 2930: 2920: 2915: 2910: 2905: 2900: 2895: 2601: 2554: 2539: 2479: 2399: 2364: 2359: 2303: 2293: 2283: 2233: 2100: 2090: 2085: 2035: 2005: 1875: 1865: 1825: 1725: 1640: 1615: 1304: 1209: 1179: 692: 553:
First, an encoder must decide which characters to represent directly in ASCII form, which
425: 1810: 4117: 4112: 4102: 4097: 4092: 4087: 4051: 4046: 4039: 4034: 4029: 4024: 4019: 4014: 4009: 4004: 3999: 3994: 3862: 3819: 3814: 3809: 3804: 3799: 3794: 3789: 3784: 3779: 3774: 3769: 3764: 3759: 3754: 3749: 3656: 3651: 3646: 3641: 3636: 3631: 3626: 3621: 3616: 3611: 3606: 3601: 3385: 3365: 2970: 2890: 2885: 2880: 2875: 2870: 2865: 2860: 2855: 2850: 2718: 2509: 2489: 2409: 2389: 2379: 2288: 2135: 2065: 2040: 2020: 1975: 1960: 1935: 1885: 1860: 1805: 1760: 1061: 1037: 4714: 4437: 3857: 3744: 3739: 3734: 3729: 3724: 3719: 3596: 3591: 3581: 3566: 3561: 3556: 3551: 3546: 3541: 3536: 3531: 3526: 3521: 3516: 3511: 3506: 3501: 3496: 3491: 3486: 3481: 3476: 3471: 3466: 3461: 3456: 3451: 3446: 3441: 3436: 3431: 3426: 3421: 3416: 3411: 3406: 3401: 3320: 3295: 3260: 3219: 2965: 2529: 2514: 2394: 2333: 2218: 2160: 2155: 2150: 2120: 2095: 2045: 1945: 1930: 1920: 1770: 1715: 1558: 1356: 1152: 1012:
In IMAP4rev2, mailbox names are encoded in Net-Unicode (this differs from IMAP4rev1).
597:
0000 0000 1010 0011 and 0010 0000 0010 0000 → 0000 0000 1010 0011 0010 0000 0010 0000
224: 74: 4457: 4452: 4447: 4442: 4177: 3917: 3912: 3907: 3902: 3897: 3892: 3887: 3882: 3877: 3872: 3355: 3350: 3330: 3214: 3206: 2839: 2524: 2258: 2253: 2208: 2110: 2025: 2015: 1970: 1925: 1900: 1840: 1755: 1735: 1730: 1710: 1589: 1536: 378: 648:
000000 001010 001100 100000 001000 000000 → 0000000010100011 0010000000100000 0000
243:
characters unreadable in non-MIME clients. On the other hand, UTF-8 combined with
611:
000000 001010 001100 100000 001000 00 → 000000 001010 001100 100000 001000 000000
2772: 2755: 2580: 2419: 2404: 2318: 2188: 2165: 2130: 2000: 1980: 1955: 1775: 1684: 1238: 1228: 1008: 999: 968: 947: 943: 939: 935: 931: 927: 923: 919: 915: 911: 907: 898: 879: 629:
First an encoded data must be separated into plain ASCII text chunks (including
369:. The start of these blocks of modified Base64-encoded UTF-16 is indicated by a 232: 56: 42: 604:
0000 0000 1010 0011 0010 0000 0010 0000 → 000000 001010 001100 100000 001000 00
4622: 4563: 4530: 4383: 4061: 3156: 3126: 3121: 3116: 3111: 3076: 2960: 2955: 2945: 2940: 2738: 2728: 2459: 1910: 1800: 1436: 1144: 421: 89: 992: 365:(hence U+10000 and higher would be encoded into two surrogates), and then in 4510: 4488: 4393: 4206: 3235: 3166: 3146: 3141: 3066: 3061: 2323: 2238: 2170: 1905: 1679: 1599: 1594: 1495: 1478: 906:
sec. 5.1.3 "Mailbox International Naming Convention". Obsoleted by RFC 
339: 866:
Store mailbox names on disk using UTF-8 instead of modified UTF-7 (mUTF-7).
711:
can be tricked into interpreting the page as UTF-7. This can be used for a
428:
as in the table below. There are two bits left over, which are padded to 0.
4680: 4535: 4500: 4478: 4388: 4211: 3151: 3136: 3096: 3091: 3086: 3071: 3030: 3025: 3020: 3015: 3010: 3005: 2802: 2792: 2788: 2762: 2549: 2519: 2369: 2354: 2349: 2140: 1990: 1890: 1870: 1835: 1745: 1584: 1401: 953: 281: 163: 103: 4550: 4346: 4256: 4137: 3711: 3081: 3056: 3046: 2784: 2273: 2268: 2145: 2080: 2010: 1750: 1134: 1110: 1023: 645:
Regroup the binary into groups of sixteen bits, starting from the left:
321: 262:
identifying the character set. Since encoded words force use of either
159: 151: 78: 2673: 158:
characters. It was originally intended to provide a means of encoding
4555: 4545: 4523: 4403: 4378: 4373: 4226: 4056: 3947: 3837: 3196: 3186: 3171: 2988: 2534: 2313: 2125: 2115: 1845: 1431: 1426: 1396: 983: 890: 366: 362: 338:
being excluded due to being redefined in "variants of ASCII" such as
287:
A modified form of UTF-7 (sometimes dubbed 'mUTF-7') was used in the
267: 236: 205: 193: 192:
and many other characters). However if a UTF-7 translator is to/from
189: 166: 150:) is an obsolete variable-length character encoding for representing 835: 601:
Regroup the binary into groups of six bits, starting from the left:
231:, the main E-mail transfer standard) is still not guaranteed to be 4650: 4368: 4358: 3975: 3681: 3191: 3131: 2993: 2767: 2628: 2484: 2424: 1880: 1855: 1510: 1421: 1411: 299:
UTF-7 was first proposed as an experimental protocol in RFC 1642,
240: 227:(broader than ASCII), the underlying transmission infrastructure ( 170: 155: 99: 1038:"Clarify guidance for use of a BOM as a UTF-8 encoding signature" 3961: 3051: 974: 259: 255: 216: 2677: 1653: 1083: 572:
Using the £† (U+00A3 U+2020) character sequence as an example:
4428: 608:
If the last group has fewer than six bits, add trailing zeros:
219:, the modern standard for e-mail formats, forbids encoding of 144: 615:
Replace each group of six bits with a respective Base64 code:
188:(the first 65536 Unicode code points, which does not include 731:
in UTF-7, which most validators let through as simple text.
638:
Express each Base64 code as the bit sequence it represents:
577:
Express the character's Unicode numbers (UTF-16) in binary:
1079: 169:
messages that was more efficient than the combination of
184:", as the definition can only encode code points in the 970:
Internet Message Access Protocol (IMAP) - Version 4rev2
4643: 4598: 4466: 4427: 4345: 4080: 3970: 3946: 3828: 3710: 3394: 3233: 3205: 3039: 2981: 2838: 2711: 2589: 2568: 2179: 1703: 1663: 1608: 1572: 1519: 1449: 1382: 1339: 1332: 1262: 1200: 1193: 1143: 1117: 335: 331: 95: 85: 70: 48: 34: 26: 641:AKMgIA → 000000 001010 001100 100000 001000 000000 618:000000 001010 001100 100000 001000 000000 → AKMgIA 828:"Using International Characters in Internet Mail" 200:UTF-7 has never been an official standard of the 4630:Unicode control, format and separator characters 881:INTERNET MESSAGE ACCESS PROTOCOL - VERSION 4rev1 856:. 8 February 2023. Sec. "Mail Location Settings" 767:"Breaking change: UTF-7 code paths are obsolete" 1007:sec. 5.1. "Mailbox Naming". Obsoletes RFC  967:Melnikov, Alexey; Leiba, Barry (August 2021). 676:scheme's representation of Unicode code point 2689: 1095: 251:and 12 bytes for characters outside the BMP. 121: 8: 301:A Mail-Safe Transformation Format of Unicode 21: 2696: 2682: 2674: 2597:Cultural, political, and religious symbols 1650: 1336: 1197: 1102: 1088: 1080: 128: 114: 982: 889: 761: 759: 536: 533: 530: 522: 519: 516: 446: 443: 440: 437: 431: 1130:ISO/IEC 10646 (Universal Character Set) 1024:"FAQ – UTF-8, UTF-16, UTF-32 & BOM" 755: 289:Internet Message Access Protocol (IMAP) 180:UTF-7 (according to its RFC) isn't a " 20: 7: 1631:International Components for Unicode 1580:Common Locale Data Repository (CLDR) 361:Other characters must be encoded in 544:Algorithm for encoding and decoding 4040:Norwegian and Danish (alternative) 2612:Mathematical operators and symbols 662:0000 0000 1010 0011 ≡ 0x00A3 ≡ 163 420:". The Unicode code point for the 239:has a disadvantage of making even 14: 693:table of Unicode byte order marks 655:0000000010100011 0010000000100000 594:Concatenate the binary sequences: 4697: 4696: 2653: 2652: 2642: 2641: 2624:Phonetic symbols (including IPA) 4484:Digital encoding of APL symbols 4419:Comparison of Unicode encodings 2937:Proposed but not approved 834:. 1 August 1998. Archived from 744:Comparison of Unicode encodings 4726:Unicode Transformation Formats 810:"12.2.3.3 Character encodings" 792:"8.2.2.3. Character encodings" 424:is U+00A3 which converts into 1: 1564:International Ideographs Core 1374:International Ideographs Core 1315:Alias names and abbreviations 952:In modified UTF-7, printable 278:recommended against its use. 182:Unicode Transformation Format 148:Unicode Transformation Format 53:Unicode Transformation Format 1786:CJK Unified Ideographs (Han) 1636:People involved with Unicode 878:Crispin, Mark (March 2003). 587:0x2020 → 0010 0000 0010 0000 582:0x00A3 → 0000 0000 1010 0011 4656:Character encodings in HTML 3990:National Replacement (NRCS) 3957:Japanese language in EBCDIC 317:' ( ) , - . / : ? 4742: 1626:Ideographic Research Group 1621:ConScript Unicode Registry 330:and space (the characters 4694: 2637: 1649: 1506:Regional indicator symbol 1215:Combining grapheme joiner 884:. Network Working Group. 449: 320: 109: 4686:Variable-length encoding 4467:Miscellaneous code pages 3225:Extended Unix Code / EUC 2916:-15 (New Western Europe) 2712:Early telecommunications 2659:Category: Unicode blocks 1464:Compatibility characters 832:Internet Mail Consortium 723:marks can be encoded as 310:The Unicode Standard 5.0 276:Internet Mail Consortium 4613:C0 and C1 control codes 1384:Comparison of encodings 1310:Halfwidth and fullwidth 1165:Universal Character Set 154:text using a stream of 61:variable-width encoding 2861:-3 (Maltese/Esperanto) 2812:World System Teletext 2309:Inscriptional Parthian 1996:Nyiakeng Puachue Hmong 1658:and symbols in Unicode 1275:CJK Unified Ideographs 910:. Updated by RFC  850:"Configuration Manual" 557:have to be escaped as 4635:Whitespace characters 4312:Ventura International 2445:Old Persian cuneiform 2304:Inscriptional Pahlavi 2199:Ancient North Arabian 2194:Anatolian hieroglyphs 1484:Precomposed character 1320:Whitespace characters 1249:Zero-width non-joiner 946:. Obsoletes RFC  854:Dovecot Documentation 326:–U+007E except 4030:Norwegian and Danish 2264:Egyptian hieroglyphs 1469:Duplicate characters 1285:Duplicate characters 838:on 7 September 2015. 814:HTML Living Standard 713:cross-site scripting 71:Transforms / Encodes 4590:Unified Hangul Code 4262:PostScript Standard 3985:Multinational (MCS) 2856:-2 (Central Europe) 2851:-1 (Western Europe) 2705:Character encodings 2329:Khitan small script 1766:Canadian Aboriginal 1501:Variation sequences 1459:Combining character 1369:Variation sequences 1280:Combining character 23: 4721:Character encoding 4671:Hardware code page 4431:typesetting system 4267:PostScript Latin 1 3923:Cyrillic + Finnish 3830:Windows code pages 3712:IBM AIX code pages 3040:National standards 2971:Ukrainian Cyrillic 2569:Notational scripts 2520:Tagalog (Baybayin) 2229:Caucasian Albanian 1552:numeric references 1527:Domain names (IDN) 1347:Bidirectional text 1224:Right-to-left mark 1220:Left-to-right mark 1175:Character property 1125:Unicode Consortium 1005:Proposed Standard. 904:Proposed Standard. 771:docs.microsoft.com 707:Older versions of 202:Unicode Consortium 16:Character encoding 4708: 4707: 4661:Charset detection 4600:Control character 4282:Sharp calculators 4153:Casio calculators 4081:Platform specific 3933:Cyrillic + German 3928:Cyrillic + French 3346:Maltese/Esperanto 2982:Bibliographic use 2866:-4 (North Europe) 2798:T.51/ISO/IEC 6937 2756:Baudot and Murray 2671: 2670: 2667: 2666: 2648:Category: Unicode 1685:Punctuation marks 1667:inherited scripts 1573:Related standards 1547:entity references 1445: 1444: 1328: 1327: 1244:Zero-width joiner 1064:. 14 October 2011 796:HTML 5.1 Standard 709:Internet Explorer 541: 540: 416:" is encoded as " 405:" is encoded as " 396:Hello, World+ACE- 394:" is encoded as " 256:transfer encoding 138: 137: 65:stateful encoding 4733: 4700: 4699: 4192:DG International 4067:Special Graphics 3868:Extended Latin-8 3266:Central European 3256:Barents Cyrillic 2961:Barents Cyrillic 2931:-12 (Devanagari) 2927:Abandoned parts 2698: 2691: 2684: 2675: 2656: 2655: 2645: 2644: 2607:Control Pictures 2560:Zanabazar Square 2299:Imperial Aramaic 2182:historic scripts 1651: 1511:Emoji skin color 1337: 1254:Zero-width space 1198: 1185:Private Use Area 1170:Character charts 1104: 1097: 1090: 1081: 1074: 1073: 1071: 1069: 1058: 1052: 1051: 1049: 1047: 1042: 1034: 1028: 1027: 1020: 1014: 1003: 986: 984:10.17487/RFC9051 964: 958: 902: 893: 891:10.17487/RFC3501 875: 869: 868: 863: 861: 846: 840: 839: 824: 818: 817: 806: 800: 799: 788: 782: 781: 779: 777: 763: 730: 726: 722: 718: 686: 679: 666: 656: 649: 642: 619: 612: 605: 598: 588: 583: 568: 564: 560: 556: 432: 419: 415: 408: 404: 397: 393: 376: 372: 357: 349: 337: 333: 329: 325: 318: 264:quoted-printable 245:quoted-printable 175:quoted-printable 162:text for use in 130: 123: 116: 24: 4741: 4740: 4736: 4735: 4734: 4732: 4731: 4730: 4711: 4710: 4709: 4704: 4690: 4666:Han unification 4639: 4594: 4462: 4423: 4341: 4163:Compucolor 8001 4076: 4072:Technical (TCS) 3995:French Canadian 3966: 3942: 3938:Polytonic Greek 3824: 3706: 3390: 3376:Turkic Cyrillic 3291:Font X (Kermit) 3286:Farsi (Persian) 3238: 3229: 3201: 3035: 2977: 2847:Approved parts 2834: 2707: 2702: 2672: 2663: 2633: 2617:List by subject 2590:Symbols, emojis 2585: 2564: 2480:Psalter Pahlavi 2181: 2175: 2036:Pracalit (Newa) 1851:Hanifi Rohingya 1699: 1675:Combining marks 1666: 1659: 1645: 1641:Han unification 1604: 1568: 1515: 1451: 1441: 1378: 1324: 1258: 1202:Special purpose 1189: 1139: 1113: 1108: 1078: 1077: 1067: 1065: 1060: 1059: 1055: 1045: 1043: 1040: 1036: 1035: 1031: 1022: 1021: 1017: 966: 965: 961: 877: 876: 872: 859: 857: 848: 847: 843: 826: 825: 821: 808: 807: 803: 790: 789: 785: 775: 773: 765: 764: 757: 752: 740: 728: 724: 720: 716: 701: 684: 677: 673: 671:Byte order mark 665: 661: 660: 654: 653: 647: 646: 640: 639: 627: 622: 617: 616: 610: 609: 603: 602: 596: 595: 591: 586: 581: 566: 562: 558: 554: 551: 546: 528:Base64-Encoded 426:modified Base64 417: 413: 406: 402: 395: 391: 387: 374: 370: 367:modified Base64 355: 347: 327: 316: 297: 214: 134: 17: 12: 11: 5: 4739: 4737: 4729: 4728: 4723: 4713: 4712: 4706: 4705: 4702:Character sets 4695: 4692: 4691: 4689: 4688: 4683: 4678: 4673: 4668: 4663: 4658: 4653: 4647: 4645: 4644:Related topics 4641: 4640: 4638: 4637: 4632: 4627: 4626: 4625: 4620: 4610: 4608:Morse prosigns 4604: 4602: 4596: 4595: 4593: 4592: 4587: 4582: 4577: 4572: 4567: 4560: 4559: 4558: 4553: 4548: 4538: 4533: 4528: 4527: 4526: 4521: 4513: 4508: 4503: 4498: 4493: 4492: 4491: 4481: 4476: 4470: 4468: 4464: 4463: 4461: 4460: 4455: 4450: 4445: 4440: 4434: 4432: 4425: 4424: 4422: 4421: 4416: 4411: 4406: 4401: 4396: 4391: 4386: 4381: 4376: 4371: 4366: 4361: 4355: 4353: 4343: 4342: 4340: 4339: 4334: 4329: 4324: 4319: 4314: 4309: 4304: 4302:TI calculators 4299: 4294: 4289: 4284: 4279: 4274: 4269: 4264: 4259: 4254: 4249: 4244: 4239: 4234: 4229: 4224: 4219: 4214: 4209: 4204: 4199: 4194: 4189: 4180: 4175: 4170: 4165: 4160: 4155: 4150: 4145: 4140: 4135: 4130: 4125: 4120: 4115: 4110: 4105: 4100: 4095: 4090: 4084: 4082: 4078: 4077: 4075: 4074: 4069: 4064: 4059: 4054: 4049: 4044: 4043: 4042: 4037: 4032: 4027: 4022: 4017: 4012: 4010:United Kingdom 4007: 4002: 3997: 3987: 3981: 3979: 3968: 3967: 3965: 3964: 3959: 3953: 3951: 3944: 3943: 3941: 3940: 3935: 3930: 3925: 3920: 3915: 3910: 3905: 3900: 3895: 3890: 3885: 3880: 3875: 3870: 3865: 3860: 3855: 3845: 3840: 3834: 3832: 3826: 3825: 3823: 3822: 3817: 3812: 3807: 3802: 3797: 3792: 3787: 3782: 3777: 3772: 3767: 3762: 3757: 3752: 3747: 3742: 3737: 3732: 3727: 3722: 3716: 3714: 3708: 3707: 3705: 3704: 3699: 3694: 3689: 3684: 3679: 3674: 3669: 3664: 3659: 3654: 3649: 3644: 3639: 3634: 3629: 3624: 3619: 3614: 3609: 3604: 3599: 3594: 3589: 3584: 3579: 3574: 3569: 3564: 3559: 3554: 3549: 3544: 3539: 3534: 3529: 3524: 3519: 3514: 3509: 3504: 3499: 3494: 3489: 3484: 3479: 3474: 3469: 3464: 3459: 3454: 3449: 3444: 3439: 3434: 3429: 3424: 3419: 3414: 3409: 3404: 3398: 3396: 3395:DOS code pages 3392: 3391: 3389: 3388: 3383: 3378: 3373: 3368: 3363: 3358: 3353: 3348: 3343: 3341:Latin (Kermit) 3338: 3333: 3328: 3323: 3318: 3313: 3308: 3303: 3298: 3293: 3288: 3283: 3278: 3273: 3268: 3263: 3258: 3253: 3248: 3242: 3240: 3231: 3230: 3228: 3227: 3222: 3217: 3211: 3209: 3203: 3202: 3200: 3199: 3194: 3189: 3184: 3179: 3174: 3169: 3164: 3159: 3154: 3149: 3144: 3139: 3134: 3129: 3124: 3119: 3114: 3109: 3104: 3099: 3094: 3089: 3084: 3079: 3074: 3069: 3064: 3059: 3054: 3049: 3043: 3041: 3037: 3036: 3034: 3033: 3028: 3023: 3018: 3013: 3008: 3003: 3002: 3001: 2996: 2985: 2983: 2979: 2978: 2976: 2975: 2974: 2973: 2968: 2963: 2958: 2950: 2949: 2948: 2943: 2941:KOI-8 Cyrillic 2935: 2934: 2933: 2925: 2924: 2923: 2921:-16 (Romanian) 2918: 2913: 2908: 2903: 2898: 2893: 2888: 2883: 2878: 2873: 2868: 2863: 2858: 2853: 2844: 2842: 2836: 2835: 2833: 2832: 2827: 2826: 2825: 2824: 2823: 2818: 2810: 2805: 2800: 2782: 2777: 2776: 2775: 2765: 2760: 2759: 2758: 2753: 2752: 2751: 2746: 2741: 2736: 2726: 2719:Telegraph code 2715: 2713: 2709: 2708: 2703: 2701: 2700: 2693: 2686: 2678: 2669: 2668: 2665: 2664: 2662: 2661: 2650: 2638: 2635: 2634: 2632: 2631: 2626: 2621: 2620: 2619: 2609: 2604: 2599: 2593: 2591: 2587: 2586: 2584: 2583: 2578: 2572: 2570: 2566: 2565: 2563: 2562: 2557: 2552: 2547: 2542: 2537: 2532: 2527: 2522: 2517: 2512: 2507: 2502: 2497: 2492: 2487: 2482: 2477: 2472: 2467: 2462: 2457: 2452: 2447: 2442: 2437: 2432: 2427: 2422: 2417: 2412: 2407: 2402: 2397: 2392: 2387: 2382: 2377: 2372: 2367: 2362: 2357: 2352: 2347: 2341: 2336: 2331: 2326: 2321: 2316: 2311: 2306: 2301: 2296: 2291: 2286: 2281: 2276: 2271: 2266: 2261: 2256: 2251: 2246: 2241: 2236: 2231: 2226: 2221: 2216: 2211: 2206: 2201: 2196: 2191: 2185: 2183: 2177: 2176: 2174: 2173: 2168: 2163: 2158: 2153: 2148: 2143: 2138: 2133: 2128: 2123: 2118: 2113: 2108: 2103: 2098: 2093: 2088: 2083: 2078: 2073: 2071:Sorang Sompeng 2068: 2063: 2058: 2053: 2048: 2043: 2038: 2033: 2028: 2023: 2018: 2013: 2008: 2003: 1998: 1993: 1988: 1983: 1978: 1973: 1968: 1963: 1961:Miao (Pollard) 1958: 1953: 1948: 1943: 1938: 1933: 1928: 1923: 1918: 1913: 1908: 1903: 1898: 1893: 1888: 1883: 1878: 1873: 1868: 1863: 1858: 1853: 1848: 1843: 1838: 1833: 1828: 1823: 1818: 1813: 1808: 1803: 1798: 1793: 1788: 1783: 1778: 1773: 1768: 1763: 1758: 1753: 1748: 1743: 1738: 1733: 1728: 1723: 1718: 1713: 1707: 1705: 1704:Modern scripts 1701: 1700: 1698: 1697: 1692: 1687: 1682: 1677: 1671: 1669: 1661: 1660: 1654: 1647: 1646: 1644: 1643: 1638: 1633: 1628: 1623: 1618: 1612: 1610: 1609:Related topics 1606: 1605: 1603: 1602: 1597: 1592: 1587: 1582: 1576: 1574: 1570: 1569: 1567: 1566: 1561: 1556: 1555: 1554: 1549: 1539: 1534: 1529: 1523: 1521: 1517: 1516: 1514: 1513: 1508: 1503: 1498: 1493: 1492: 1491: 1481: 1476: 1471: 1466: 1461: 1455: 1453: 1447: 1446: 1443: 1442: 1440: 1439: 1434: 1429: 1424: 1419: 1414: 1409: 1404: 1399: 1394: 1388: 1386: 1380: 1379: 1377: 1376: 1371: 1366: 1361: 1360: 1359: 1349: 1343: 1341: 1334: 1330: 1329: 1326: 1325: 1323: 1322: 1317: 1312: 1307: 1302: 1297: 1292: 1287: 1282: 1277: 1272: 1266: 1264: 1260: 1259: 1257: 1256: 1251: 1246: 1241: 1236: 1231: 1226: 1217: 1212: 1206: 1204: 1195: 1191: 1190: 1188: 1187: 1182: 1177: 1172: 1167: 1162: 1161: 1160: 1149: 1147: 1141: 1140: 1138: 1137: 1132: 1127: 1121: 1119: 1115: 1114: 1109: 1107: 1106: 1099: 1092: 1084: 1076: 1075: 1053: 1029: 1015: 959: 870: 841: 819: 801: 783: 754: 753: 751: 748: 747: 746: 739: 736: 715:attack as the 700: 697: 687:belong to the 672: 669: 668: 667: 663: 657: 650: 643: 626: 623: 621: 620: 613: 606: 599: 592: 590: 589: 584: 578: 574: 550: 547: 545: 542: 539: 538: 535: 532: 529: 525: 524: 521: 518: 515: 511: 510: 507: 504: 501: 498: 495: 492: 489: 486: 483: 480: 477: 474: 471: 468: 465: 462: 459: 456: 452: 451: 448: 445: 442: 439: 436: 430: 429: 410: 407:1 +- 1 +AD0- 2 399: 386: 383: 354:be encoded as 296: 293: 225:character sets 213: 210: 136: 135: 133: 132: 125: 118: 110: 107: 106: 97: 93: 92: 87: 83: 82: 72: 68: 67: 50: 49:Classification 46: 45: 36: 32: 31: 28: 15: 13: 10: 9: 6: 4: 3: 2: 4738: 4727: 4724: 4722: 4719: 4718: 4716: 4703: 4693: 4687: 4684: 4682: 4679: 4677: 4674: 4672: 4669: 4667: 4664: 4662: 4659: 4657: 4654: 4652: 4649: 4648: 4646: 4642: 4636: 4633: 4631: 4628: 4624: 4621: 4619: 4616: 4615: 4614: 4611: 4609: 4606: 4605: 4603: 4601: 4597: 4591: 4588: 4586: 4583: 4581: 4578: 4576: 4573: 4571: 4568: 4566: 4565: 4561: 4557: 4554: 4552: 4549: 4547: 4544: 4543: 4542: 4539: 4537: 4534: 4532: 4529: 4525: 4522: 4520: 4517: 4516: 4514: 4512: 4509: 4507: 4504: 4502: 4499: 4497: 4494: 4490: 4487: 4486: 4485: 4482: 4480: 4477: 4475: 4472: 4471: 4469: 4465: 4459: 4456: 4454: 4451: 4449: 4446: 4444: 4441: 4439: 4436: 4435: 4433: 4430: 4426: 4420: 4417: 4415: 4412: 4410: 4407: 4405: 4402: 4400: 4397: 4395: 4392: 4390: 4387: 4385: 4382: 4380: 4377: 4375: 4372: 4370: 4367: 4365: 4362: 4360: 4357: 4356: 4354: 4352: 4351:ISO/IEC 10646 4348: 4344: 4338: 4335: 4333: 4330: 4328: 4325: 4323: 4320: 4318: 4315: 4313: 4310: 4308: 4305: 4303: 4300: 4298: 4295: 4293: 4290: 4288: 4285: 4283: 4280: 4278: 4275: 4273: 4270: 4268: 4265: 4263: 4260: 4258: 4255: 4253: 4250: 4248: 4245: 4243: 4240: 4238: 4235: 4233: 4230: 4228: 4225: 4223: 4220: 4218: 4215: 4213: 4210: 4208: 4205: 4203: 4200: 4198: 4195: 4193: 4190: 4188: 4184: 4181: 4179: 4176: 4174: 4171: 4169: 4168:Compucolor II 4166: 4164: 4161: 4159: 4156: 4154: 4151: 4149: 4146: 4144: 4141: 4139: 4136: 4134: 4131: 4129: 4126: 4124: 4123:Acorn RISC OS 4121: 4119: 4116: 4114: 4111: 4109: 4106: 4104: 4101: 4099: 4096: 4094: 4091: 4089: 4086: 4085: 4083: 4079: 4073: 4070: 4068: 4065: 4063: 4060: 4058: 4055: 4053: 4052:8-bit Turkish 4050: 4048: 4045: 4041: 4038: 4036: 4033: 4031: 4028: 4026: 4023: 4021: 4018: 4016: 4013: 4011: 4008: 4006: 4003: 4001: 3998: 3996: 3993: 3992: 3991: 3988: 3986: 3983: 3982: 3980: 3977: 3973: 3969: 3963: 3960: 3958: 3955: 3954: 3952: 3949: 3945: 3939: 3936: 3934: 3931: 3929: 3926: 3924: 3921: 3919: 3916: 3914: 3911: 3909: 3906: 3904: 3901: 3899: 3896: 3894: 3891: 3889: 3886: 3884: 3881: 3879: 3876: 3874: 3871: 3869: 3866: 3864: 3861: 3859: 3856: 3853: 3849: 3846: 3844: 3841: 3839: 3836: 3835: 3833: 3831: 3827: 3821: 3818: 3816: 3813: 3811: 3808: 3806: 3803: 3801: 3798: 3796: 3793: 3791: 3788: 3786: 3783: 3781: 3778: 3776: 3773: 3771: 3768: 3766: 3763: 3761: 3758: 3756: 3753: 3751: 3748: 3746: 3743: 3741: 3738: 3736: 3733: 3731: 3728: 3726: 3723: 3721: 3718: 3717: 3715: 3713: 3709: 3703: 3700: 3698: 3695: 3693: 3690: 3688: 3685: 3683: 3680: 3678: 3675: 3673: 3670: 3668: 3665: 3663: 3660: 3658: 3655: 3653: 3650: 3648: 3645: 3643: 3640: 3638: 3635: 3633: 3630: 3628: 3625: 3623: 3620: 3618: 3615: 3613: 3610: 3608: 3605: 3603: 3600: 3598: 3595: 3593: 3590: 3588: 3585: 3583: 3580: 3578: 3575: 3573: 3570: 3568: 3565: 3563: 3560: 3558: 3555: 3553: 3550: 3548: 3545: 3543: 3540: 3538: 3535: 3533: 3530: 3528: 3525: 3523: 3520: 3518: 3515: 3513: 3510: 3508: 3505: 3503: 3500: 3498: 3495: 3493: 3490: 3488: 3485: 3483: 3480: 3478: 3475: 3473: 3470: 3468: 3465: 3463: 3460: 3458: 3455: 3453: 3450: 3448: 3445: 3443: 3440: 3438: 3435: 3433: 3430: 3428: 3425: 3423: 3420: 3418: 3415: 3413: 3410: 3408: 3405: 3403: 3400: 3399: 3397: 3393: 3387: 3384: 3382: 3379: 3377: 3374: 3372: 3369: 3367: 3364: 3362: 3359: 3357: 3354: 3352: 3349: 3347: 3344: 3342: 3339: 3337: 3334: 3332: 3329: 3327: 3324: 3322: 3319: 3317: 3314: 3312: 3309: 3307: 3304: 3302: 3299: 3297: 3294: 3292: 3289: 3287: 3284: 3282: 3279: 3277: 3274: 3272: 3269: 3267: 3264: 3262: 3259: 3257: 3254: 3252: 3249: 3247: 3244: 3243: 3241: 3237: 3232: 3226: 3223: 3221: 3220:ISO/IEC 10367 3218: 3216: 3213: 3212: 3210: 3208: 3204: 3198: 3195: 3193: 3190: 3188: 3185: 3183: 3180: 3178: 3175: 3173: 3170: 3168: 3165: 3163: 3160: 3158: 3155: 3153: 3150: 3148: 3145: 3143: 3140: 3138: 3135: 3133: 3130: 3128: 3125: 3123: 3120: 3118: 3115: 3113: 3110: 3108: 3105: 3103: 3100: 3098: 3095: 3093: 3090: 3088: 3085: 3083: 3080: 3078: 3075: 3073: 3070: 3068: 3065: 3063: 3060: 3058: 3055: 3053: 3050: 3048: 3045: 3044: 3042: 3038: 3032: 3029: 3027: 3024: 3022: 3019: 3017: 3014: 3012: 3009: 3007: 3004: 3000: 2997: 2995: 2992: 2991: 2990: 2987: 2986: 2984: 2980: 2972: 2969: 2967: 2964: 2962: 2959: 2957: 2954: 2953: 2951: 2947: 2944: 2942: 2939: 2938: 2936: 2932: 2929: 2928: 2926: 2922: 2919: 2917: 2914: 2912: 2909: 2907: 2904: 2902: 2899: 2897: 2894: 2892: 2889: 2887: 2884: 2882: 2879: 2877: 2874: 2872: 2871:-5 (Cyrillic) 2869: 2867: 2864: 2862: 2859: 2857: 2854: 2852: 2849: 2848: 2846: 2845: 2843: 2841: 2837: 2831: 2828: 2822: 2819: 2817: 2814: 2813: 2811: 2809: 2806: 2804: 2801: 2799: 2796: 2795: 2794: 2790: 2786: 2783: 2781: 2778: 2774: 2771: 2770: 2769: 2766: 2764: 2761: 2757: 2754: 2750: 2747: 2745: 2742: 2740: 2737: 2735: 2732: 2731: 2730: 2727: 2725: 2722: 2721: 2720: 2717: 2716: 2714: 2710: 2706: 2699: 2694: 2692: 2687: 2685: 2680: 2679: 2676: 2660: 2651: 2649: 2640: 2639: 2636: 2630: 2627: 2625: 2622: 2618: 2615: 2614: 2613: 2610: 2608: 2605: 2603: 2600: 2598: 2595: 2594: 2592: 2588: 2582: 2579: 2577: 2574: 2573: 2571: 2567: 2561: 2558: 2556: 2553: 2551: 2548: 2546: 2543: 2541: 2540:Tulu Tigalari 2538: 2536: 2533: 2531: 2528: 2526: 2523: 2521: 2518: 2516: 2515:Sylheti Nagri 2513: 2511: 2508: 2506: 2505:South Arabian 2503: 2501: 2498: 2496: 2493: 2491: 2488: 2486: 2483: 2481: 2478: 2476: 2473: 2471: 2468: 2466: 2463: 2461: 2458: 2456: 2453: 2451: 2448: 2446: 2443: 2441: 2438: 2436: 2433: 2431: 2430:Old Hungarian 2428: 2426: 2423: 2421: 2418: 2416: 2413: 2411: 2408: 2406: 2403: 2401: 2398: 2396: 2393: 2391: 2388: 2386: 2383: 2381: 2378: 2376: 2373: 2371: 2368: 2366: 2363: 2361: 2358: 2356: 2353: 2351: 2348: 2345: 2342: 2340: 2337: 2335: 2332: 2330: 2327: 2325: 2322: 2320: 2317: 2315: 2312: 2310: 2307: 2305: 2302: 2300: 2297: 2295: 2292: 2290: 2287: 2285: 2282: 2280: 2277: 2275: 2272: 2270: 2267: 2265: 2262: 2260: 2257: 2255: 2252: 2250: 2247: 2245: 2242: 2240: 2237: 2235: 2232: 2230: 2227: 2225: 2222: 2220: 2217: 2215: 2212: 2210: 2207: 2205: 2202: 2200: 2197: 2195: 2192: 2190: 2187: 2186: 2184: 2178: 2172: 2169: 2167: 2164: 2162: 2159: 2157: 2154: 2152: 2149: 2147: 2144: 2142: 2139: 2137: 2134: 2132: 2129: 2127: 2124: 2122: 2119: 2117: 2114: 2112: 2109: 2107: 2104: 2102: 2099: 2097: 2094: 2092: 2089: 2087: 2084: 2082: 2079: 2077: 2074: 2072: 2069: 2067: 2064: 2062: 2059: 2057: 2054: 2052: 2049: 2047: 2044: 2042: 2039: 2037: 2034: 2032: 2029: 2027: 2024: 2022: 2019: 2017: 2014: 2012: 2009: 2007: 2004: 2002: 1999: 1997: 1994: 1992: 1989: 1987: 1984: 1982: 1979: 1977: 1974: 1972: 1969: 1967: 1964: 1962: 1959: 1957: 1954: 1952: 1951:Mende Kikakui 1949: 1947: 1946:Masaram Gondi 1944: 1942: 1939: 1937: 1934: 1932: 1931:Lisu (Fraser) 1929: 1927: 1924: 1922: 1919: 1917: 1914: 1912: 1909: 1907: 1904: 1902: 1899: 1897: 1894: 1892: 1889: 1887: 1884: 1882: 1879: 1877: 1874: 1872: 1869: 1867: 1864: 1862: 1859: 1857: 1854: 1852: 1849: 1847: 1844: 1842: 1839: 1837: 1834: 1832: 1831:Gunjala Gondi 1829: 1827: 1824: 1822: 1819: 1817: 1814: 1812: 1809: 1807: 1804: 1802: 1799: 1797: 1794: 1792: 1789: 1787: 1784: 1782: 1779: 1777: 1774: 1772: 1769: 1767: 1764: 1762: 1759: 1757: 1754: 1752: 1749: 1747: 1744: 1742: 1739: 1737: 1734: 1732: 1729: 1727: 1724: 1722: 1719: 1717: 1714: 1712: 1709: 1708: 1706: 1702: 1696: 1693: 1691: 1688: 1686: 1683: 1681: 1678: 1676: 1673: 1672: 1670: 1668: 1662: 1657: 1652: 1648: 1642: 1639: 1637: 1634: 1632: 1629: 1627: 1624: 1622: 1619: 1617: 1614: 1613: 1611: 1607: 1601: 1598: 1596: 1593: 1591: 1588: 1586: 1583: 1581: 1578: 1577: 1575: 1571: 1565: 1562: 1560: 1557: 1553: 1550: 1548: 1545: 1544: 1543: 1540: 1538: 1535: 1533: 1530: 1528: 1525: 1524: 1522: 1518: 1512: 1509: 1507: 1504: 1502: 1499: 1497: 1494: 1490: 1487: 1486: 1485: 1482: 1480: 1477: 1475: 1472: 1470: 1467: 1465: 1462: 1460: 1457: 1456: 1454: 1448: 1438: 1435: 1433: 1430: 1428: 1425: 1423: 1420: 1418: 1415: 1413: 1410: 1408: 1405: 1403: 1400: 1398: 1395: 1393: 1390: 1389: 1387: 1385: 1381: 1375: 1372: 1370: 1367: 1365: 1362: 1358: 1357:ISO/IEC 14651 1355: 1354: 1353: 1350: 1348: 1345: 1344: 1342: 1338: 1335: 1331: 1321: 1318: 1316: 1313: 1311: 1308: 1306: 1303: 1301: 1298: 1296: 1293: 1291: 1288: 1286: 1283: 1281: 1278: 1276: 1273: 1271: 1268: 1267: 1265: 1261: 1255: 1252: 1250: 1247: 1245: 1242: 1240: 1237: 1235: 1232: 1230: 1227: 1225: 1221: 1218: 1216: 1213: 1211: 1208: 1207: 1205: 1203: 1199: 1196: 1192: 1186: 1183: 1181: 1178: 1176: 1173: 1171: 1168: 1166: 1163: 1159: 1156: 1155: 1154: 1151: 1150: 1148: 1146: 1142: 1136: 1133: 1131: 1128: 1126: 1123: 1122: 1120: 1116: 1112: 1105: 1100: 1098: 1093: 1091: 1086: 1085: 1082: 1063: 1057: 1054: 1039: 1033: 1030: 1025: 1019: 1016: 1013: 1010: 1006: 1001: 998: 994: 990: 985: 980: 976: 972: 971: 963: 960: 957: 955: 949: 945: 941: 937: 933: 929: 925: 921: 917: 913: 909: 905: 900: 897: 892: 887: 883: 882: 874: 871: 867: 855: 851: 845: 842: 837: 833: 829: 823: 820: 815: 811: 805: 802: 797: 793: 787: 784: 772: 768: 762: 760: 756: 749: 745: 742: 741: 737: 735: 732: 714: 710: 705: 698: 696: 694: 690: 681: 670: 658: 651: 644: 637: 636: 635: 632: 624: 614: 607: 600: 593: 585: 580: 579: 576: 575: 573: 570: 548: 543: 527: 526: 513: 512: 508: 505: 502: 499: 496: 493: 490: 487: 484: 481: 478: 475: 472: 469: 466: 463: 460: 457: 454: 453: 434: 433: 427: 423: 411: 400: 392:Hello, World! 389: 388: 384: 382: 380: 368: 364: 359: 353: 343: 341: 323: 313: 311: 306: 302: 294: 292: 290: 285: 283: 279: 277: 271: 269: 265: 261: 260:encoded words 257: 252: 250: 246: 242: 238: 234: 230: 226: 222: 218: 211: 209: 207: 203: 198: 195: 191: 187: 183: 178: 176: 172: 168: 165: 161: 157: 153: 149: 146: 142: 131: 126: 124: 119: 117: 112: 111: 108: 105: 101: 98: 94: 91: 88: 84: 80: 76: 75:ISO/IEC 10646 73: 69: 66: 62: 58: 54: 51: 47: 44: 40: 37: 33: 30:International 29: 25: 19: 4618:ISO/IEC 6429 4575:Stanford/ITS 4562: 4496:ARIB STD-B24 4363: 4277:Sega SC-3000 4178:DEC RADIX 50 3215:ISO/IEC 8859 3207:ISO/IEC 2022 2952:Adaptations 2911:-14 (Celtic) 2906:-13 (Baltic) 2896:-10 (Nordic) 2891:-9 (Turkish) 2840:ISO/IEC 8859 2395:Meetei Mayek 2346:(Chorasmian) 2249:Cypro-Minoan 2026:Pahawh Hmong 1841:Gurung Khema 1590:ISO/IEC 8859 1432:UTF-32/UCS-4 1427:UTF-16/UCS-2 1416: 1234:Variant form 1066:. Retrieved 1056: 1044:. Retrieved 1032: 1018: 1011: 1004: 969: 962: 951: 903: 880: 873: 865: 858:. Retrieved 853: 844: 836:the original 831: 822: 813: 804: 795: 786: 774:. Retrieved 770: 733: 706: 702: 688: 682: 674: 630: 628: 571: 552: 455:Bit pattern 379:hyphen-minus 360: 351: 344: 314: 309: 300: 298: 286: 280: 272: 253: 215: 199: 179: 140: 139: 96:Succeeded by 18: 4337:ZX Spectrum 4292:Sinclair QL 4128:Amstrad CPC 4047:8-bit Greek 3974:terminals ( 3687:Iran System 3239:("scripts") 2886:-8 (Hebrew) 2876:-6 (Arabic) 2773:ISO/IEC 646 2581:SignWriting 2450:Old Sogdian 2420:Nandinagari 2344:Khwarezmian 2254:Dives Akuru 2180:Ancient and 2166:Warang Citi 2031:Pau Cin Hau 1986:New Tai Lue 1981:Nag Mundari 1956:Medefaidrin 1665:Common and 1474:Equivalence 1452:code points 1450:On pairs of 1364:Equivalence 1239:Word joiner 1229:Soft hyphen 1145:Code points 860:28 February 346:plus sign ( 308:Standard. 295:Description 233:8-bit clean 86:Preceded by 57:ASCII armor 27:Language(s) 4715:Categories 4623:JIS X 0211 4531:ISO-IR-169 4384:UTF-EBCDIC 3950:code pages 3677:CSX+ Indic 3281:Devanagari 3236:Code pages 3157:LST 1590-4 3127:JIS X 0213 3122:JIS X 0212 3117:JIS X 0208 3112:JIS X 0201 3077:GOST 10859 2999:CCCII/EACC 2901:-11 (Thai) 2881:-7 (Greek) 2816:background 2739:Wabun/Kana 2475:Phoenician 2460:Old Uyghur 2455:Old Turkic 2440:Old Permic 2435:Old Italic 2385:Manichaean 2279:Glagolitic 2056:Saurashtra 1801:Devanagari 1680:Diacritics 1437:UTF-EBCDIC 1340:Algorithms 1333:Processing 1270:Characters 1194:Characters 1046:17 January 750:References 567:+AEA-+AEA- 435:Hex digit 422:pound sign 212:Motivation 90:HZ-GB-2312 4676:MICR code 4511:IEC-P27-1 4489:ISO-IR-68 4394:DIN 91379 4272:SAM Coupé 4207:GSM 03.38 4197:Galaksija 3692:Kamenický 3672:CSX Indic 3381:Ukrainian 3167:Shift JIS 3147:KS X 1002 3142:KS X 1001 3067:DIN 66003 3062:CNS 11643 2830:Transcode 2808:ITU T.101 2734:Non-Latin 2470:ʼPhags-pa 2465:Palmyrene 2415:Nabataean 2339:Khudawadi 2324:Kharosthi 2239:Cuneiform 2214:Bhaiksuki 2209:Bassa Vah 2076:Sundanese 2051:Samaritan 1966:Mongolian 1941:Malayalam 1906:Kirat Rai 1616:Anomalies 1600:ISO 15924 1595:DIN 91379 1496:Z-variant 1479:Homoglyph 1352:Collation 993:2070-1721 816:. WHATWG. 776:8 January 689:following 403:1 + 1 = 2 340:JIS-Roman 4681:Mojibake 4536:ISO 2033 4501:Fieldata 4479:ASMO 449 4389:GB 18030 4349: / 4297:Teletext 4287:Sharp MZ 4217:HP FOCAL 4212:HP Roman 4143:Atari ST 4133:Apple II 3667:CS Indic 3361:Romanian 3336:Keyboard 3316:Gurmukhi 3311:Gujarati 3301:Georgian 3276:Cyrillic 3271:Croatian 3246:Armenian 3152:LST 1564 3137:KPS 9566 3097:GB 18030 3092:GB 12052 3087:GB 12345 3072:ELOT 927 3006:ISO 5426 2966:Estonian 2803:ITU T.61 2793:Teletext 2789:Videotex 2763:Fieldata 2749:Cyrillic 2602:Currency 2576:Duployan 2550:Vithkuqi 2545:Ugaritic 2400:Meroitic 2370:Mahajani 2355:Linear B 2350:Linear A 2141:Tifinagh 2106:Tai Viet 2101:Tai Tham 2091:Tagbanwa 2006:Ol Chiki 1896:Kayah Li 1891:Katakana 1876:Javanese 1871:Hiragana 1861:Hanunuoo 1836:Gurmukhi 1826:Gujarati 1816:Georgian 1791:Cyrillic 1781:Cherokee 1746:Bopomofo 1726:Balinese 1721:Armenian 1585:GB 18030 1402:Punycode 1290:Numerals 1222: / 1135:Versions 954:US-ASCII 738:See also 699:Security 625:Decoding 549:Encoding 385:Examples 303:. This 282:8BITMIME 164:Internet 104:8BITMIME 35:Standard 4570:SEASCII 4564:Mojikyō 4551:KOI8-RU 4474:ABICOMP 4347:Unicode 4257:PETSCII 4247:NEC APC 4183:DEC MCS 4138:ATASCII 4035:Swedish 4020:Finnish 4005:Spanish 3697:Mazovia 3662:ABICOMP 3371:Turkish 3326:Iceland 3234:Mac OS 3177:TIS-620 3082:GB 2312 3057:BraSCII 3047:ArmSCII 2785:Teletex 2744:Chinese 2510:Soyombo 2500:Sogdian 2495:Siddham 2490:Sharada 2410:Multani 2390:Marchen 2380:Mandaic 2375:Makasar 2289:Grantha 2274:Elymaic 2269:Elbasan 2244:Cypriot 2204:Avestan 2146:Tirhuta 2136:Tibetan 2081:Sunuwar 2066:Sinhala 2061:Shavian 2041:Ranjana 2021:Osmanya 2011:Ol Onal 1936:Lontara 1886:Kannada 1796:Deseret 1761:Burmese 1751:Braille 1741:Bengali 1695:Numbers 1656:Scripts 1305:Symbols 1295:Scripts 1118:Unicode 1111:Unicode 1068:29 June 450:  377:(ASCII 221:headers 160:Unicode 152:Unicode 79:Unicode 4580:Symbol 4556:KOI8-U 4546:KOI8-R 4414:TACE16 4404:CESU-8 4399:BOCU-1 4379:UTF-32 4374:UTF-16 4317:WISCII 4307:TRS-80 4227:SQUOZE 4222:HP RPL 4062:Hebrew 4057:SI 960 4025:French 3948:EBCDIC 3838:CER-GS 3321:Hebrew 3296:Gaelic 3261:Celtic 3251:Arabic 3197:YUSCII 3187:VISCII 3172:SI 960 3162:PASCII 3011:5426-2 2989:MARC-8 2724:Needle 2657:  2646:  2555:Yezidi 2535:Todhri 2530:Tangut 2365:Lydian 2360:Lycian 2334:Khojki 2314:Kaithi 2294:Hatran 2284:Gothic 2234:Coptic 2224:Carian 2219:Brāhmī 2161:Wancho 2126:Thaana 2121:Telugu 2116:Tangsa 2096:Tai Le 2086:Syriac 2046:Rejang 1921:Lepcha 1866:Hebrew 1846:Hangul 1771:Chakma 1716:Arabic 1690:Spaces 1397:CESU-8 1392:BOCU-1 1300:Spaces 991:  798:. W3C. 685:U+FEFF 678:U+FEFF 514:Index 418:+AKM-1 363:UTF-16 268:Base64 237:Base64 206:HTML 5 194:UTF-16 190:emojis 167:E-mail 41:  4651:CCSID 4524:8-bit 4519:7-bit 4515:INIS 4369:UTF-8 4364:UTF-7 4359:UTF-1 4237:LMBCS 4173:CP/M+ 4015:Dutch 4000:Swiss 3682:CWI-2 3386:VT100 3356:Roman 3351:Ogham 3331:Inuit 3306:Greek 3192:VSCII 3182:TSCII 3132:KOI-7 3107:ISCII 3102:HKSCS 2994:ANSEL 2956:Welsh 2780:BCDIC 2768:ASCII 2729:Morse 2629:Emoji 2525:Takri 2485:Runic 2425:Ogham 2259:Dogra 2111:Tamil 2016:Osage 1991:Nüshu 1926:Limbu 1916:Latin 1901:Khmer 1881:Kanji 1856:Hanja 1821:Greek 1811:Geʽez 1806:Garay 1756:Buhid 1736:Batak 1731:Bamum 1711:Adlam 1559:Input 1537:Fonts 1532:Email 1520:Usage 1422:UTF-8 1417:UTF-7 1412:UTF-1 1263:Lists 1180:Plane 1153:Block 1041:(PDF) 729:+AD4- 725:+ADw- 328:~ \ + 241:ASCII 173:with 171:UTF-8 156:ASCII 141:UTF-7 102:over 100:UTF-8 22:UTF-7 4585:TRON 4438:Cork 4409:SCSU 4332:ZX81 4327:ZX80 4322:XCCS 4252:NeXT 4232:LICS 4187:NRCS 4148:BICS 4118:1058 4113:1057 4108:1056 4103:1055 4098:1054 4093:1053 4088:1052 3962:DKOI 3918:1270 3913:1258 3908:1257 3903:1256 3898:1255 3893:1254 3888:1253 3883:1252 3878:1251 3873:1250 3863:1169 3820:1133 3815:1124 3810:1046 3805:1019 3800:1018 3795:1017 3790:1016 3785:1015 3780:1014 3775:1013 3770:1012 3765:1010 3760:1009 3755:1008 3750:1006 3657:3846 3652:1127 3647:1118 3642:1117 3637:1116 3632:1115 3627:1098 3622:1044 3617:1043 3612:1042 3607:1040 3602:1034 3366:Sámi 3052:Big5 3031:6862 3026:6438 3021:5428 3016:5427 2946:Sámi 2821:sets 2787:and 2405:Modi 2319:Kawi 2189:Ahom 2151:Toto 2131:Thai 2001:Odia 1976:N'Ko 1776:Cham 1542:HTML 1489:list 1407:SCSU 1158:List 1070:2012 1048:2024 1009:3501 1000:9051 989:ISSN 975:IETF 948:2060 944:5738 942:and 940:5032 936:4466 932:5182 928:4469 924:4551 920:8474 916:8437 912:7817 908:9051 899:3501 862:2023 778:2021 727:and 721:> 719:and 717:< 334:and 324:0020 229:SMTP 217:MIME 43:2152 4541:KOI 4458:OT1 4453:OMS 4448:OML 4443:LY1 4429:TeX 4242:MSX 4202:GEM 4158:CDC 3976:VTx 3972:DEC 3858:950 3852:GBK 3848:936 3843:932 3745:922 3740:921 3735:915 3730:912 3725:896 3720:895 3702:MIK 3597:951 3592:950 3587:949 3582:942 3577:936 3572:932 3567:904 3562:903 3557:899 3552:897 3547:869 3542:868 3537:867 3532:866 3527:865 3522:864 3517:863 3512:862 3507:861 3502:860 3497:859 3492:858 3487:857 3482:856 3477:855 3472:853 3467:852 3462:851 3457:850 3452:778 3447:777 3442:776 3437:775 3432:773 3427:770 3422:737 3417:720 3412:708 3407:668 3402:437 2156:Vai 1971:Mru 1911:Lao 1210:BOM 997:RFC 979:doi 896:RFC 886:doi 565:as 523:12 520:10 352:may 305:RFC 266:or 249:BMP 186:BMP 145:bit 143:(7- 39:RFC 4717:: 4506:HZ 2171:Yi 995:. 987:. 977:. 973:. 950:. 938:, 934:, 930:, 926:, 922:, 918:, 914:, 894:. 864:. 852:. 830:. 812:. 794:. 769:. 758:^ 695:. 680:. 664:10 563:@@ 559:+- 537:M 534:K 531:A 517:0 509:0 447:3 444:A 441:0 438:0 414:£1 358:. 356:+- 350:) 322:U+ 208:. 177:. 63:, 59:, 55:, 4185:/ 3978:) 3854:) 3850:( 2791:/ 2697:e 2690:t 2683:v 1103:e 1096:t 1089:v 1072:. 1050:. 1026:. 1002:. 981:: 901:. 888:: 780:. 631:+ 555:+ 506:0 503:1 500:1 497:0 494:0 491:0 488:1 485:0 482:1 479:0 476:0 473:0 470:0 467:0 464:0 461:0 458:0 412:" 409:" 401:" 398:" 390:" 375:- 371:+ 348:+ 336:~ 332:\ 129:e 122:t 115:v 81:) 77:(

Index

RFC
2152
Unicode Transformation Format
ASCII armor
variable-width encoding
stateful encoding
ISO/IEC 10646
Unicode
HZ-GB-2312
UTF-8
8BITMIME
v
t
e
bit
Unicode Transformation Format
Unicode
ASCII
Unicode
Internet
E-mail
UTF-8
quoted-printable
Unicode Transformation Format
BMP
emojis
UTF-16
Unicode Consortium
HTML 5
MIME

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.