117:
MargaretAreYouGrievingOverGoldengroveUnleavingLeavesLikeTheThingsOfManYouWithYourFreshThoughtsCareForCanYouAhAsTheHeartGrowsOlderItWillComeToSuchSightsColderByAndByNorSpareASighThoughWorldsOfWanwoodLeafmealLieAndYetYouWillWeepAndKnowWhyNowNoMatterChildTheNameSorrowsSpringsAreTheSameNorMouthHadNoNorMindExpressedWhatHeartHeardOfGhostGuessedItIsTheBlightManWasBornForItIsMargaretYouMournFor
20:
2227:
2216:
164:
Here, SHY is a visible hyphen that is usually visually indistinguishable from a regular hyphen, but has been inserted solely for the purpose of line breaking. The purpose of the soft hyphen here is to distinguish it from any regular hyphen that might have been part of the original spelling of the
629:
341:
They are also used in emails to try to defeat spam prevention systems. For example, the phrase "I need your assistance discreetly" has a soft hyphen in the word assistance which may mean a mail system would not detect the phrase in the email body.
165:
word. This distinction helps re-use of already formatted text, when line breaks and soft hyphens inserted during word wrapping have to be removed to convert the text back into its unformatted form. For example, the copy or paste function of a
222:
defining 0x8D as an "Optional
Syllabification Control (OSC)", a "print control character" for use marking syllable boundaries in long words. This C1 control set was registered in 1979. (Note: this is not the same as the
60:
Two alternative ways of using the soft hyphen character for this purpose have emerged, depending on whether the encoded text will be broken into lines by its recipient, or has already been preformatted by its originator.
237::1986 (Latin 1) inherited SHY from EBCDIC, but called it "soft hyphen", placed it at position 0xAD (hexadecimal), and stated its purpose as "for use when a line break has been established within a word". Other
101:
of the characters on either side when not visible. The zero-width space, on the other hand, will not, as it is considered a visible character even if not rendered, thus having its own kerning metrics.
285:
Unicode 4.0 (2002) changed the category of its SHY character from previously "Pd" (punctuation, dash) to "Cf" (other, format), thereby aligning its interpretation of the character with that of HTML 4.
282:
HTML 4 (1999) redefined the purpose of the character as marking a hyphenation opportunity, which only becomes visible as a hyphen at the end of a line after formatting.
1119:
1061:
122:
On HTML browsers supporting soft hyphens, resizing the window will re-break the above text only at word boundaries, and insert a hyphen at the end of each line.
2184:
674:
2169:
2276:
2189:
979:
560:
289:
Other commands for marking hyphenation opportunities in text formatting languages (similar to the HTML 4 and
Unicode 4.0 interpretation of SHY):
964:
1208:
887:
273:
Unicode 1.0 (1991) and ISO 10646 (1993) took the first 256 code positions from ISO 8859-1, resulting in SHY at
Unicode code point of U+00AD.
1203:
69:
The use of SHY characters in text that will be broken into lines by the recipient is the application context considered by the post-1999
774:
737:
603:
1036:
667:
406:
211:). IBM defined its purpose as a "hyphen used to divide a word at the end of a line may be removed when a program adjusts lines."
1041:
956:
857:
1338:
1152:
1136:
1099:
946:
882:
702:
85:. It serves as an invisible marker used to specify a place in text where a hyphenated break is allowed without forcing a
2077:
522:
93:
at the end of a line. The soft hyphen's
Unicode semantics and HTML implementation are in many ways similar to Unicode's
1313:
2266:
2196:
1124:
924:
660:
460:
441:
77:
specifications, as well as some word-processing file formats. In this context, the soft hyphen may also be called a
1363:
1198:
1193:
842:
747:
1801:
1078:
787:
143:
130:
The SHY character is also used in text where paragraphs have already been broken into lines, such as certain
2231:
2132:
1488:
1393:
742:
730:
582:
228:
224:
1821:
1568:
1423:
1358:
1073:
941:
847:
806:
2017:
1771:
1766:
1643:
1056:
821:
109:
2256:
2002:
1947:
1916:
1836:
1558:
1523:
1403:
892:
279:
2 (1995) incorporated the "­" character entity from SGML, but explicitly discouraged its use.
174:
2271:
2261:
2047:
2042:
1957:
1901:
1267:
1262:
1247:
1046:
1031:
936:
872:
862:
852:
50:
2067:
2148:
2037:
2007:
1987:
1623:
1603:
1353:
919:
796:
792:
697:
356:
434:
2117:
2027:
2012:
1881:
1851:
1816:
1628:
1468:
1293:
1104:
816:
757:
166:
158:
139:
2220:
2179:
2072:
2022:
1911:
1871:
1796:
1786:
1776:
1678:
1648:
1633:
1538:
1513:
1388:
1368:
1237:
1228:
1114:
867:
826:
371:
297:
263:
181:
104:
To show the effect of a soft hyphen in HTML, the words of the following text (from the poem
94:
578:
2174:
2127:
2112:
2052:
1972:
1937:
1932:
1876:
1866:
1856:
1806:
1673:
1663:
1658:
1608:
1578:
1448:
1438:
1398:
1298:
1213:
1188:
877:
782:
752:
170:
1383:
553:
Additional
Control Functions for Bibliographic Use according to German Standard DIN 31626
19:
2082:
2062:
1982:
1962:
1952:
1861:
1708:
1638:
1613:
1593:
1548:
1533:
1508:
1458:
1433:
1378:
1333:
633:
430:
2250:
2102:
2087:
1967:
1906:
1791:
1733:
1728:
1723:
1693:
1668:
1618:
1518:
1503:
1493:
1343:
1288:
1131:
929:
725:
249:
90:
456:
2097:
1831:
1826:
1781:
1683:
1598:
1588:
1543:
1498:
1473:
1413:
1328:
1308:
1303:
1283:
1162:
1109:
361:
326:
Soft hyphens, like other invisible characters, have been used to obscure malicious
256:
character set covering all ISO 8859-1 characters) placed it at position 240 = 0xF0.
219:
207:
placed a SHY character (known there as a "syllable hyphen") at position 202 (0xCA
89:
in an inconvenient place if the text is re-flowed. It becomes visible only after
2153:
1992:
1977:
1891:
1761:
1738:
1703:
1573:
1553:
1528:
1348:
1257:
811:
474:
366:
351:
335:
327:
242:
208:
2032:
1483:
1373:
1009:
717:
234:
151:
131:
1896:
1811:
1743:
1478:
1252:
1172:
1167:
1068:
1051:
500:
376:
185:
86:
2122:
2092:
1942:
1927:
1922:
1713:
1563:
1463:
1443:
1408:
1318:
1157:
974:
607:
238:
180:
An example application that outputs soft hyphens for this reason is the
1846:
1841:
1718:
1653:
1583:
1323:
707:
683:
435:"Unicode interpretation of SOFT HYPHEN breaks ISO 8859-1 compatibility"
200:) characters in coded characters sets, roughly in chronological order:
98:
74:
551:
2107:
1886:
1698:
1688:
1418:
1004:
999:
969:
564:
253:
204:
147:
54:
53:
for the purpose of breaking words across lines by inserting visible
57:
if they fall on the line end but remain invisible within the line.
2201:
2057:
1997:
1453:
1428:
1083:
994:
989:
984:
311:
293:
173:, and remove any soft hyphens including any immediately following
155:
135:
18:
276:
259:
70:
1226:
656:
402:
146:. This is the application context originally considered by the
547:
526:
331:
307:
215:
184:
text formatter as used on many Unix/Linux systems to display
241:
parts placed it at the same position, with the exception of
97:, with the exception that the soft hyphen will preserve the
652:
501:"Extended Binary-Coded Decimal Interchange Code - S/390"
2162:
2141:
1752:
1276:
1236:
1181:
1145:
1092:
1022:
955:
912:
905:
835:
773:
766:
716:
690:
604:"Spammers Using Soft Hyphen To Hide Malicious URLs"
630:"Soft Hyphen ā A New URL Obfuscation Technique"
668:
8:
2170:Cultural, political, and religious symbols
1223:
909:
770:
675:
661:
653:
262:'s "Numeric and Special Graphic" (isonum)
112:) have been separated with soft hyphens:
169:can offer to replace line breaks with a
703:ISO/IEC 10646 (Universal Character Set)
475:"CSS Text Module Level 3 Specification"
388:
425:
423:
403:"Soft hyphen (SHY) ā a hard problem?"
396:
394:
392:
142:or printers, or pages represented in
65:Text to be formatted by the recipient
7:
1204:International Components for Unicode
1153:Common Locale Data Repository (CLDR)
457:"Yes, SOFT HYPHEN is a hard problem"
126:Text preformatted by the originator
49:, is a code point reserved in some
2185:Mathematical operators and symbols
479:W3C Candidate Recommendation Draft
154:standards and implemented in many
14:
481:. World Wide Web Consortium (W3C)
2226:
2225:
2215:
2214:
2197:Phonetic symbols (including IPA)
407:Tampere University of Technology
27:In computing and typesetting, a
270:for the ISO 8859-1 soft hyphen.
2277:Unicode formatting code points
579:"Commonly Confused Characters"
455:Eric Muller (14 August 2002).
401:Jukka Korpela (January 2011).
229:Operating System Command (OSC)
1:
1137:International Ideographs Core
947:International Ideographs Core
888:Alias names and abbreviations
266:set (ISO 8879:1986) includes
245:(Latin/Thai), which lacks it.
1359:CJK Unified Ideographs (Han)
1209:People involved with Unicode
461:Unicode Technical Committee
442:Unicode Technical Committee
2293:
1199:Ideographic Research Group
1194:ConScript Unicode Registry
144:page description languages
23:ISO symbol for soft hyphen
2210:
1222:
1079:Regional indicator symbol
788:Combining grapheme joiner
192:Encodings and definitions
2232:Category: Unicode blocks
1037:Compatibility characters
957:Comparison of encodings
883:Halfwidth and fullwidth
738:Universal Character Set
583:Simon Fraser University
1882:Inscriptional Parthian
1569:Nyiakeng Puachue Hmong
1231:and symbols in Unicode
848:CJK Unified Ideographs
120:
24:
2018:Old Persian cuneiform
1877:Inscriptional Pahlavi
1772:Ancient North Arabian
1767:Anatolian hieroglyphs
1057:Precomposed character
893:Whitespace characters
822:Zero-width non-joiner
175:whitespace characters
114:
110:Gerard Manley Hopkins
22:
1837:Egyptian hieroglyphs
1042:Duplicate characters
858:Duplicate characters
134:files, text sent to
79:discretionary hyphen
51:coded character sets
1902:Khitan small script
1339:Canadian Aboriginal
1074:Variation sequences
1032:Combining character
942:Variation sequences
853:Combining character
220:C1 control code set
2267:Control characters
2142:Notational scripts
2093:Tagalog (Baybayin)
1802:Caucasian Albanian
1125:numeric references
1100:Domain names (IDN)
920:Bidirectional text
797:Right-to-left mark
793:Left-to-right mark
748:Character property
698:Unicode Consortium
357:Non-breaking space
159:terminal emulators
140:terminal emulators
25:
2244:
2243:
2240:
2239:
2221:Category: Unicode
1258:Punctuation marks
1240:inherited scripts
1146:Related standards
1120:entity references
1018:
1017:
901:
900:
817:Zero-width joiner
167:terminal emulator
16:Unicode character
2284:
2229:
2228:
2218:
2217:
2180:Control Pictures
2133:Zanabazar Square
1872:Imperial Aramaic
1755:historic scripts
1224:
1084:Emoji skin color
910:
827:Zero-width space
771:
758:Private Use Area
743:Character charts
677:
670:
663:
654:
645:
644:
642:
640:
626:
620:
619:
617:
615:
610:. 7 October 2010
600:
594:
593:
591:
589:
575:
569:
568:
558:
550:(15 July 1979).
544:
538:
537:
535:
533:
519:
513:
512:
510:
508:
497:
491:
490:
488:
486:
471:
465:
464:
452:
446:
445:
439:
427:
418:
417:
415:
413:
398:
372:Zero-width space
317:
303:
269:
264:character entity
227:C1 control code
218:31626 defined a
214:German standard
95:zero-width space
44:
40:
37:
35:
2292:
2291:
2287:
2286:
2285:
2283:
2282:
2281:
2247:
2246:
2245:
2236:
2206:
2190:List by subject
2163:Symbols, emojis
2158:
2137:
2053:Psalter Pahlavi
1754:
1748:
1609:Pracalit (Newa)
1424:Hanifi Rohingya
1272:
1248:Combining marks
1239:
1232:
1218:
1214:Han unification
1177:
1141:
1088:
1024:
1014:
951:
897:
831:
775:Special purpose
762:
712:
686:
681:
649:
648:
638:
636:
628:
627:
623:
613:
611:
602:
601:
597:
587:
585:
577:
576:
572:
556:
546:
545:
541:
531:
529:
521:
520:
516:
506:
504:
499:
498:
494:
484:
482:
473:
472:
468:
454:
453:
449:
437:
433:(4 June 2003).
429:
428:
421:
411:
409:
400:
399:
390:
385:
348:
324:
322:Security issues
315:
301:
267:
194:
171:space character
128:
119:
106:Spring and Fall
83:optional hyphen
67:
47:syllable hyphen
42:
38:
33:
32:
17:
12:
11:
5:
2290:
2288:
2280:
2279:
2274:
2269:
2264:
2259:
2249:
2248:
2242:
2241:
2238:
2237:
2235:
2234:
2223:
2211:
2208:
2207:
2205:
2204:
2199:
2194:
2193:
2192:
2182:
2177:
2172:
2166:
2164:
2160:
2159:
2157:
2156:
2151:
2145:
2143:
2139:
2138:
2136:
2135:
2130:
2125:
2120:
2115:
2110:
2105:
2100:
2095:
2090:
2085:
2080:
2075:
2070:
2065:
2060:
2055:
2050:
2045:
2040:
2035:
2030:
2025:
2020:
2015:
2010:
2005:
2000:
1995:
1990:
1985:
1980:
1975:
1970:
1965:
1960:
1955:
1950:
1945:
1940:
1935:
1930:
1925:
1920:
1914:
1909:
1904:
1899:
1894:
1889:
1884:
1879:
1874:
1869:
1864:
1859:
1854:
1849:
1844:
1839:
1834:
1829:
1824:
1819:
1814:
1809:
1804:
1799:
1794:
1789:
1784:
1779:
1774:
1769:
1764:
1758:
1756:
1750:
1749:
1747:
1746:
1741:
1736:
1731:
1726:
1721:
1716:
1711:
1706:
1701:
1696:
1691:
1686:
1681:
1676:
1671:
1666:
1661:
1656:
1651:
1646:
1644:Sorang Sompeng
1641:
1636:
1631:
1626:
1621:
1616:
1611:
1606:
1601:
1596:
1591:
1586:
1581:
1576:
1571:
1566:
1561:
1556:
1551:
1546:
1541:
1536:
1534:Miao (Pollard)
1531:
1526:
1521:
1516:
1511:
1506:
1501:
1496:
1491:
1486:
1481:
1476:
1471:
1466:
1461:
1456:
1451:
1446:
1441:
1436:
1431:
1426:
1421:
1416:
1411:
1406:
1401:
1396:
1391:
1386:
1381:
1376:
1371:
1366:
1361:
1356:
1351:
1346:
1341:
1336:
1331:
1326:
1321:
1316:
1311:
1306:
1301:
1296:
1291:
1286:
1280:
1278:
1277:Modern scripts
1274:
1273:
1271:
1270:
1265:
1260:
1255:
1250:
1244:
1242:
1234:
1233:
1227:
1220:
1219:
1217:
1216:
1211:
1206:
1201:
1196:
1191:
1185:
1183:
1182:Related topics
1179:
1178:
1176:
1175:
1170:
1165:
1160:
1155:
1149:
1147:
1143:
1142:
1140:
1139:
1134:
1129:
1128:
1127:
1122:
1112:
1107:
1102:
1096:
1094:
1090:
1089:
1087:
1086:
1081:
1076:
1071:
1066:
1065:
1064:
1054:
1049:
1044:
1039:
1034:
1028:
1026:
1020:
1019:
1016:
1015:
1013:
1012:
1007:
1002:
997:
992:
987:
982:
977:
972:
967:
961:
959:
953:
952:
950:
949:
944:
939:
934:
933:
932:
922:
916:
914:
907:
903:
902:
899:
898:
896:
895:
890:
885:
880:
875:
870:
865:
860:
855:
850:
845:
839:
837:
833:
832:
830:
829:
824:
819:
814:
809:
804:
799:
790:
785:
779:
777:
768:
764:
763:
761:
760:
755:
750:
745:
740:
735:
734:
733:
722:
720:
714:
713:
711:
710:
705:
700:
694:
692:
688:
687:
682:
680:
679:
672:
665:
657:
647:
646:
621:
595:
581:. Greg Baker,
570:
539:
514:
492:
466:
447:
431:Markus G. Kuhn
419:
387:
386:
384:
381:
380:
379:
374:
369:
364:
359:
354:
347:
344:
323:
320:
319:
318:
305:
287:
286:
283:
280:
274:
271:
257:
246:
232:
212:
193:
190:
127:
124:
115:
66:
63:
15:
13:
10:
9:
6:
4:
3:
2:
2289:
2278:
2275:
2273:
2270:
2268:
2265:
2263:
2260:
2258:
2255:
2254:
2252:
2233:
2224:
2222:
2213:
2212:
2209:
2203:
2200:
2198:
2195:
2191:
2188:
2187:
2186:
2183:
2181:
2178:
2176:
2173:
2171:
2168:
2167:
2165:
2161:
2155:
2152:
2150:
2147:
2146:
2144:
2140:
2134:
2131:
2129:
2126:
2124:
2121:
2119:
2116:
2114:
2113:Tulu Tigalari
2111:
2109:
2106:
2104:
2101:
2099:
2096:
2094:
2091:
2089:
2088:Sylheti Nagri
2086:
2084:
2081:
2079:
2078:South Arabian
2076:
2074:
2071:
2069:
2066:
2064:
2061:
2059:
2056:
2054:
2051:
2049:
2046:
2044:
2041:
2039:
2036:
2034:
2031:
2029:
2026:
2024:
2021:
2019:
2016:
2014:
2011:
2009:
2006:
2004:
2003:Old Hungarian
2001:
1999:
1996:
1994:
1991:
1989:
1986:
1984:
1981:
1979:
1976:
1974:
1971:
1969:
1966:
1964:
1961:
1959:
1956:
1954:
1951:
1949:
1946:
1944:
1941:
1939:
1936:
1934:
1931:
1929:
1926:
1924:
1921:
1918:
1915:
1913:
1910:
1908:
1905:
1903:
1900:
1898:
1895:
1893:
1890:
1888:
1885:
1883:
1880:
1878:
1875:
1873:
1870:
1868:
1865:
1863:
1860:
1858:
1855:
1853:
1850:
1848:
1845:
1843:
1840:
1838:
1835:
1833:
1830:
1828:
1825:
1823:
1820:
1818:
1815:
1813:
1810:
1808:
1805:
1803:
1800:
1798:
1795:
1793:
1790:
1788:
1785:
1783:
1780:
1778:
1775:
1773:
1770:
1768:
1765:
1763:
1760:
1759:
1757:
1751:
1745:
1742:
1740:
1737:
1735:
1732:
1730:
1727:
1725:
1722:
1720:
1717:
1715:
1712:
1710:
1707:
1705:
1702:
1700:
1697:
1695:
1692:
1690:
1687:
1685:
1682:
1680:
1677:
1675:
1672:
1670:
1667:
1665:
1662:
1660:
1657:
1655:
1652:
1650:
1647:
1645:
1642:
1640:
1637:
1635:
1632:
1630:
1627:
1625:
1622:
1620:
1617:
1615:
1612:
1610:
1607:
1605:
1602:
1600:
1597:
1595:
1592:
1590:
1587:
1585:
1582:
1580:
1577:
1575:
1572:
1570:
1567:
1565:
1562:
1560:
1557:
1555:
1552:
1550:
1547:
1545:
1542:
1540:
1537:
1535:
1532:
1530:
1527:
1525:
1524:Mende Kikakui
1522:
1520:
1519:Masaram Gondi
1517:
1515:
1512:
1510:
1507:
1505:
1504:Lisu (Fraser)
1502:
1500:
1497:
1495:
1492:
1490:
1487:
1485:
1482:
1480:
1477:
1475:
1472:
1470:
1467:
1465:
1462:
1460:
1457:
1455:
1452:
1450:
1447:
1445:
1442:
1440:
1437:
1435:
1432:
1430:
1427:
1425:
1422:
1420:
1417:
1415:
1412:
1410:
1407:
1405:
1404:Gunjala Gondi
1402:
1400:
1397:
1395:
1392:
1390:
1387:
1385:
1382:
1380:
1377:
1375:
1372:
1370:
1367:
1365:
1362:
1360:
1357:
1355:
1352:
1350:
1347:
1345:
1342:
1340:
1337:
1335:
1332:
1330:
1327:
1325:
1322:
1320:
1317:
1315:
1312:
1310:
1307:
1305:
1302:
1300:
1297:
1295:
1292:
1290:
1287:
1285:
1282:
1281:
1279:
1275:
1269:
1266:
1264:
1261:
1259:
1256:
1254:
1251:
1249:
1246:
1245:
1243:
1241:
1235:
1230:
1225:
1221:
1215:
1212:
1210:
1207:
1205:
1202:
1200:
1197:
1195:
1192:
1190:
1187:
1186:
1184:
1180:
1174:
1171:
1169:
1166:
1164:
1161:
1159:
1156:
1154:
1151:
1150:
1148:
1144:
1138:
1135:
1133:
1130:
1126:
1123:
1121:
1118:
1117:
1116:
1113:
1111:
1108:
1106:
1103:
1101:
1098:
1097:
1095:
1091:
1085:
1082:
1080:
1077:
1075:
1072:
1070:
1067:
1063:
1060:
1059:
1058:
1055:
1053:
1050:
1048:
1045:
1043:
1040:
1038:
1035:
1033:
1030:
1029:
1027:
1021:
1011:
1008:
1006:
1003:
1001:
998:
996:
993:
991:
988:
986:
983:
981:
978:
976:
973:
971:
968:
966:
963:
962:
960:
958:
954:
948:
945:
943:
940:
938:
935:
931:
930:ISO/IEC 14651
928:
927:
926:
923:
921:
918:
917:
915:
911:
908:
904:
894:
891:
889:
886:
884:
881:
879:
876:
874:
871:
869:
866:
864:
861:
859:
856:
854:
851:
849:
846:
844:
841:
840:
838:
834:
828:
825:
823:
820:
818:
815:
813:
810:
808:
805:
803:
800:
798:
794:
791:
789:
786:
784:
781:
780:
778:
776:
772:
769:
765:
759:
756:
754:
751:
749:
746:
744:
741:
739:
736:
732:
729:
728:
727:
724:
723:
721:
719:
715:
709:
706:
704:
701:
699:
696:
695:
693:
689:
685:
678:
673:
671:
666:
664:
659:
658:
655:
651:
635:
631:
625:
622:
609:
605:
599:
596:
584:
580:
574:
571:
566:
562:
555:
554:
549:
543:
540:
528:
524:
518:
515:
502:
496:
493:
480:
476:
470:
467:
462:
458:
451:
448:
444:. L2/03-155R.
443:
436:
432:
426:
424:
420:
408:
404:
397:
395:
393:
389:
382:
378:
375:
373:
370:
368:
365:
363:
360:
358:
355:
353:
350:
349:
345:
343:
339:
337:
333:
329:
321:
313:
309:
306:
299:
295:
292:
291:
290:
284:
281:
278:
275:
272:
265:
261:
258:
255:
251:
250:code page 850
247:
244:
240:
236:
233:
230:
226:
221:
217:
213:
210:
206:
203:
202:
201:
199:
196:Soft hyphen (
191:
189:
187:
183:
178:
176:
172:
168:
162:
160:
157:
153:
149:
145:
141:
137:
133:
125:
123:
118:
113:
111:
107:
102:
100:
96:
92:
91:word wrapping
88:
84:
80:
76:
72:
64:
62:
58:
56:
52:
48:
30:
21:
1968:Meetei Mayek
1919:(Chorasmian)
1822:Cypro-Minoan
1599:Pahawh Hmong
1414:Gurung Khema
1163:ISO/IEC 8859
1005:UTF-32/UCS-4
1000:UTF-16/UCS-2
807:Variant form
801:
650:
637:. Retrieved
624:
612:. Retrieved
598:
586:. Retrieved
573:
552:
542:
530:. Retrieved
517:
505:. Retrieved
495:
483:. Retrieved
478:
469:
463:. L2/02-279.
450:
410:. Retrieved
362:Word divider
340:
325:
288:
225:ISO/IEC 6429
197:
195:
179:
163:
129:
121:
116:
105:
103:
82:
78:
68:
59:
46:
28:
26:
2257:Punctuation
2154:SignWriting
2023:Old Sogdian
1993:Nandinagari
1917:Khwarezmian
1827:Dives Akuru
1753:Ancient and
1739:Warang Citi
1604:Pau Cin Hau
1559:New Tai Lue
1554:Nag Mundari
1529:Medefaidrin
1238:Common and
1047:Equivalence
1025:code points
1023:On pairs of
937:Equivalence
812:Word joiner
802:Soft hyphen
718:Code points
503:. comsci.us
367:Word joiner
352:Hard hyphen
336:e-mail spam
243:ISO 8859-11
209:hexadecimal
39:SOFT HYPHEN
29:soft hyphen
2272:Whitespace
2262:Typography
2251:Categories
2048:Phoenician
2033:Old Uyghur
2028:Old Turkic
2013:Old Permic
2008:Old Italic
1958:Manichaean
1852:Glagolitic
1629:Saurashtra
1374:Devanagari
1253:Diacritics
1010:UTF-EBCDIC
913:Algorithms
906:Processing
843:Characters
767:Characters
523:"Glossary"
383:References
235:ISO 8859-1
152:ISO 8859-1
132:plain text
87:line break
2043:Ź¼Phags-pa
2038:Palmyrene
1988:Nabataean
1912:Khudawadi
1897:Kharosthi
1812:Cuneiform
1787:Bhaiksuki
1782:Bassa Vah
1649:Sundanese
1624:Samaritan
1539:Mongolian
1514:Malayalam
1479:Kirat Rai
1189:Anomalies
1173:ISO 15924
1168:DIN 91379
1069:Z-variant
1052:Homoglyph
925:Collation
377:Word wrap
268:­
186:man pages
43:­
31:(Unicode
2175:Currency
2149:Duployan
2123:Vithkuqi
2118:Ugaritic
1973:Meroitic
1943:Mahajani
1928:Linear B
1923:Linear A
1714:Tifinagh
1679:Tai Viet
1674:Tai Tham
1664:Tagbanwa
1579:Ol Chiki
1469:Kayah Li
1464:Katakana
1449:Javanese
1444:Hiragana
1434:Hanunuoo
1409:Gurmukhi
1399:Gujarati
1389:Georgian
1364:Cyrillic
1354:Cherokee
1319:Bopomofo
1299:Balinese
1294:Armenian
1158:GB 18030
975:Punycode
863:Numerals
795: /
708:Versions
634:Symantec
608:Slashdot
559:. ITSCJ/
485:7 August
346:See also
239:ISO 8859
2083:Soyombo
2073:Sogdian
2068:Siddham
2063:Sharada
1983:Multani
1963:Marchen
1953:Mandaic
1948:Makasar
1862:Grantha
1847:Elymaic
1842:Elbasan
1817:Cypriot
1777:Avestan
1719:Tirhuta
1709:Tibetan
1654:Sunuwar
1639:Sinhala
1634:Shavian
1614:Ranjana
1594:Osmanya
1584:Ol Onal
1509:Lontara
1459:Kannada
1369:Deseret
1334:Burmese
1324:Braille
1314:Bengali
1268:Numbers
1229:Scripts
878:Symbols
868:Scripts
691:Unicode
684:Unicode
639:8 April
614:8 April
588:12 July
532:8 April
507:8 April
412:8 April
328:domains
138:-style
99:kerning
75:Unicode
55:hyphens
2230:
2219:
2128:Yezidi
2108:Todhri
2103:Tangut
1938:Lydian
1933:Lycian
1907:Khojki
1887:Kaithi
1867:Hatran
1857:Gothic
1807:Coptic
1797:Carian
1792:BrÄhmÄ«
1734:Wancho
1699:Thaana
1694:Telugu
1689:Tangsa
1669:Tai Le
1659:Syriac
1619:Rejang
1494:Lepcha
1439:Hebrew
1419:Hangul
1344:Chakma
1289:Arabic
1263:Spaces
970:CESU-8
965:BOCU-1
873:Spaces
565:ISO-IR
254:MS-DOS
205:EBCDIC
148:EBCDIC
45:)) or
36:
34:U+00AD
2202:Emoji
2098:Takri
2058:Runic
1998:Ogham
1832:Dogra
1684:Tamil
1589:Osage
1564:NĆ¼shu
1499:Limbu
1489:Latin
1474:Khmer
1454:Kanji
1429:Hanja
1394:Greek
1384:GeŹ½ez
1379:Garay
1329:Buhid
1309:Batak
1304:Bamum
1284:Adlam
1132:Input
1110:Fonts
1105:Email
1093:Usage
995:UTF-8
990:UTF-7
985:UTF-1
836:Lists
753:Plane
726:Block
557:(PDF)
438:(PDF)
312:LaTeX
298:groff
294:troff
182:groff
156:VT100
136:VT100
1978:Modi
1892:Kawi
1762:Ahom
1724:Toto
1704:Thai
1574:Odia
1549:N'Ko
1349:Cham
1115:HTML
1062:list
980:SCSU
731:List
641:2011
616:2011
590:2011
567:-40.
561:IPSJ
534:2011
509:2011
487:2022
414:2011
332:URLs
310:and
296:and
277:HTML
260:SGML
252:(an
248:IBM
150:and
73:and
71:HTML
1729:Vai
1544:Mru
1484:Lao
783:BOM
548:DIN
527:IBM
334:in
330:or
308:TeX
216:DIN
198:SHY
108:by
81:or
2253::
1744:Yi
632:.
606:.
563:.
525:.
477:.
459:.
440:.
422:^
405:.
391:^
338:.
316:\-
314::
302:\%
300::
231:.)
188:.
177:.
161:.
676:e
669:t
662:v
643:.
618:.
592:.
536:.
511:.
489:.
416:.
304:.
41:(
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.