Knowledge (XXG)

Soft hyphen

Source šŸ“

117:
Margaret­Are­You­Grieving­Over­Goldengrove­Unleaving­Leaves­Like­The­Things­Of­Man­You­With­Your­Fresh­Thoughts­Care­For­Can­You­Ah­As­The­Heart­Grows­Older­It­Will­Come­To­Such­Sights­Colder­By­And­By­Nor­Spare­A­Sigh­Though­Worlds­Of­Wanwood­Leafmeal­Lie­And­Yet­You­Will­Weep­And­Know­Why­Now­No­Matter­Child­The­Name­Sorrows­Springs­Are­The­Same­Nor­Mouth­Had­No­Nor­Mind­Expressed­What­Heart­Heard­Of­Ghost­Guessed­It­Is­The­Blight­Man­Was­Born­For­It­Is­Margaret­You­Mourn­For
20: 2227: 2216: 164:
Here, SHY is a visible hyphen that is usually visually indistinguishable from a regular hyphen, but has been inserted solely for the purpose of line breaking. The purpose of the soft hyphen here is to distinguish it from any regular hyphen that might have been part of the original spelling of the
629: 341:
They are also used in emails to try to defeat spam prevention systems. For example, the phrase "I need your assista­nce discreetly" has a soft hyphen in the word assistance which may mean a mail system would not detect the phrase in the email body.
165:
word. This distinction helps re-use of already formatted text, when line breaks and soft hyphens inserted during word wrapping have to be removed to convert the text back into its unformatted form. For example, the copy or paste function of a
222:
defining 0x8D as an "Optional Syllabification Control (OSC)", a "print control character" for use marking syllable boundaries in long words. This C1 control set was registered in 1979. (Note: this is not the same as the
60:
Two alternative ways of using the soft hyphen character for this purpose have emerged, depending on whether the encoded text will be broken into lines by its recipient, or has already been preformatted by its originator.
237::1986 (Latin 1) inherited SHY from EBCDIC, but called it "soft hyphen", placed it at position 0xAD (hexadecimal), and stated its purpose as "for use when a line break has been established within a word". Other 101:
of the characters on either side when not visible. The zero-width space, on the other hand, will not, as it is considered a visible character even if not rendered, thus having its own kerning metrics.
285:
Unicode 4.0 (2002) changed the category of its SHY character from previously "Pd" (punctuation, dash) to "Cf" (other, format), thereby aligning its interpretation of the character with that of HTML 4.
282:
HTML 4 (1999) redefined the purpose of the character as marking a hyphenation opportunity, which only becomes visible as a hyphen at the end of a line after formatting.
1119: 1061: 122:
On HTML browsers supporting soft hyphens, resizing the window will re-break the above text only at word boundaries, and insert a hyphen at the end of each line.
2184: 674: 2169: 2276: 2189: 979: 560: 289:
Other commands for marking hyphenation opportunities in text formatting languages (similar to the HTML 4 and Unicode 4.0 interpretation of SHY):
964: 1208: 887: 273:
Unicode 1.0 (1991) and ISO 10646 (1993) took the first 256 code positions from ISO 8859-1, resulting in SHY at Unicode code point of U+00AD.
1203: 69:
The use of SHY characters in text that will be broken into lines by the recipient is the application context considered by the post-1999
774: 737: 603: 1036: 667: 406: 211:). IBM defined its purpose as a "hyphen used to divide a word at the end of a line may be removed when a program adjusts lines." 1041: 956: 857: 1338: 1152: 1136: 1099: 946: 882: 702: 85:. It serves as an invisible marker used to specify a place in text where a hyphenated break is allowed without forcing a 2077: 522: 93:
at the end of a line. The soft hyphen's Unicode semantics and HTML implementation are in many ways similar to Unicode's
1313: 2266: 2196: 1124: 924: 660: 460: 441: 77:
specifications, as well as some word-processing file formats. In this context, the soft hyphen may also be called a
1363: 1198: 1193: 842: 747: 1801: 1078: 787: 143: 130:
The SHY character is also used in text where paragraphs have already been broken into lines, such as certain
2231: 2132: 1488: 1393: 742: 730: 582: 228: 224: 1821: 1568: 1423: 1358: 1073: 941: 847: 806: 2017: 1771: 1766: 1643: 1056: 821: 109: 2256: 2002: 1947: 1916: 1836: 1558: 1523: 1403: 892: 279:
2 (1995) incorporated the "­" character entity from SGML, but explicitly discouraged its use.
174: 2271: 2261: 2047: 2042: 1957: 1901: 1267: 1262: 1247: 1046: 1031: 936: 872: 862: 852: 50: 2067: 2148: 2037: 2007: 1987: 1623: 1603: 1353: 919: 796: 792: 697: 356: 434: 2117: 2027: 2012: 1881: 1851: 1816: 1628: 1468: 1293: 1104: 816: 757: 166: 158: 139: 2220: 2179: 2072: 2022: 1911: 1871: 1796: 1786: 1776: 1678: 1648: 1633: 1538: 1513: 1388: 1368: 1237: 1228: 1114: 867: 826: 371: 297: 263: 181: 104:
To show the effect of a soft hyphen in HTML, the words of the following text (from the poem
94: 578: 2174: 2127: 2112: 2052: 1972: 1937: 1932: 1876: 1866: 1856: 1806: 1673: 1663: 1658: 1608: 1578: 1448: 1438: 1398: 1298: 1213: 1188: 877: 782: 752: 170: 1383: 553:
Additional Control Functions for Bibliographic Use according to German Standard DIN 31626
19: 2082: 2062: 1982: 1962: 1952: 1861: 1708: 1638: 1613: 1593: 1548: 1533: 1508: 1458: 1433: 1378: 1333: 633: 430: 2250: 2102: 2087: 1967: 1906: 1791: 1733: 1728: 1723: 1693: 1668: 1618: 1518: 1503: 1493: 1343: 1288: 1131: 929: 725: 249: 90: 456: 2097: 1831: 1826: 1781: 1683: 1598: 1588: 1543: 1498: 1473: 1413: 1328: 1308: 1303: 1283: 1162: 1109: 361: 326:
Soft hyphens, like other invisible characters, have been used to obscure malicious
256:
character set covering all ISO 8859-1 characters) placed it at position 240 = 0xF0.
219: 207:
placed a SHY character (known there as a "syllable hyphen") at position 202 (0xCA
89:
in an inconvenient place if the text is re-flowed. It becomes visible only after
2153: 1992: 1977: 1891: 1761: 1738: 1703: 1573: 1553: 1528: 1348: 1257: 811: 474: 366: 351: 335: 327: 242: 208: 2032: 1483: 1373: 1009: 717: 234: 151: 131: 1896: 1811: 1743: 1478: 1252: 1172: 1167: 1068: 1051: 500: 376: 185: 86: 2122: 2092: 1942: 1927: 1922: 1713: 1563: 1463: 1443: 1408: 1318: 1157: 974: 607: 238: 180:
An example application that outputs soft hyphens for this reason is the
1846: 1841: 1718: 1653: 1583: 1323: 707: 683: 435:"Unicode interpretation of SOFT HYPHEN breaks ISO 8859-1 compatibility" 200:) characters in coded characters sets, roughly in chronological order: 98: 74: 551: 2107: 1886: 1698: 1688: 1418: 1004: 999: 969: 564: 253: 204: 147: 54: 53:
for the purpose of breaking words across lines by inserting visible
57:
if they fall on the line end but remain invisible within the line.
2201: 2057: 1997: 1453: 1428: 1083: 994: 989: 984: 311: 293: 173:, and remove any soft hyphens including any immediately following 155: 135: 18: 276: 259: 70: 1226: 656: 402: 146:. This is the application context originally considered by the 547: 526: 331: 307: 215: 184:
text formatter as used on many Unix/Linux systems to display
241:
parts placed it at the same position, with the exception of
97:, with the exception that the soft hyphen will preserve the 652: 501:"Extended Binary-Coded Decimal Interchange Code - S/390" 2162: 2141: 1752: 1276: 1236: 1181: 1145: 1092: 1022: 955: 912: 905: 835: 773: 766: 716: 690: 604:"Spammers Using Soft Hyphen To Hide Malicious URLs" 630:"Soft Hyphen ā€“ A New URL Obfuscation Technique" 668: 8: 2170:Cultural, political, and religious symbols 1223: 909: 770: 675: 661: 653: 262:'s "Numeric and Special Graphic" (isonum) 112:) have been separated with soft hyphens: 169:can offer to replace line breaks with a 703:ISO/IEC 10646 (Universal Character Set) 475:"CSS Text Module Level 3 Specification" 388: 425: 423: 403:"Soft hyphen (SHY) ā€“ a hard problem?" 396: 394: 392: 142:or printers, or pages represented in 65:Text to be formatted by the recipient 7: 1204:International Components for Unicode 1153:Common Locale Data Repository (CLDR) 457:"Yes, SOFT HYPHEN is a hard problem" 126:Text preformatted by the originator 49:, is a code point reserved in some 2185:Mathematical operators and symbols 479:W3C Candidate Recommendation Draft 154:standards and implemented in many 14: 481:. World Wide Web Consortium (W3C) 2226: 2225: 2215: 2214: 2197:Phonetic symbols (including IPA) 407:Tampere University of Technology 27:In computing and typesetting, a 270:for the ISO 8859-1 soft hyphen. 2277:Unicode formatting code points 579:"Commonly Confused Characters" 455:Eric Muller (14 August 2002). 401:Jukka Korpela (January 2011). 229:Operating System Command (OSC) 1: 1137:International Ideographs Core 947:International Ideographs Core 888:Alias names and abbreviations 266:set (ISO 8879:1986) includes 245:(Latin/Thai), which lacks it. 1359:CJK Unified Ideographs (Han) 1209:People involved with Unicode 461:Unicode Technical Committee 442:Unicode Technical Committee 2293: 1199:Ideographic Research Group 1194:ConScript Unicode Registry 144:page description languages 23:ISO symbol for soft hyphen 2210: 1222: 1079:Regional indicator symbol 788:Combining grapheme joiner 192:Encodings and definitions 2232:Category: Unicode blocks 1037:Compatibility characters 957:Comparison of encodings 883:Halfwidth and fullwidth 738:Universal Character Set 583:Simon Fraser University 1882:Inscriptional Parthian 1569:Nyiakeng Puachue Hmong 1231:and symbols in Unicode 848:CJK Unified Ideographs 120: 24: 2018:Old Persian cuneiform 1877:Inscriptional Pahlavi 1772:Ancient North Arabian 1767:Anatolian hieroglyphs 1057:Precomposed character 893:Whitespace characters 822:Zero-width non-joiner 175:whitespace characters 114: 110:Gerard Manley Hopkins 22: 1837:Egyptian hieroglyphs 1042:Duplicate characters 858:Duplicate characters 134:files, text sent to 79:discretionary hyphen 51:coded character sets 1902:Khitan small script 1339:Canadian Aboriginal 1074:Variation sequences 1032:Combining character 942:Variation sequences 853:Combining character 220:C1 control code set 2267:Control characters 2142:Notational scripts 2093:Tagalog (Baybayin) 1802:Caucasian Albanian 1125:numeric references 1100:Domain names (IDN) 920:Bidirectional text 797:Right-to-left mark 793:Left-to-right mark 748:Character property 698:Unicode Consortium 357:Non-breaking space 159:terminal emulators 140:terminal emulators 25: 2244: 2243: 2240: 2239: 2221:Category: Unicode 1258:Punctuation marks 1240:inherited scripts 1146:Related standards 1120:entity references 1018: 1017: 901: 900: 817:Zero-width joiner 167:terminal emulator 16:Unicode character 2284: 2229: 2228: 2218: 2217: 2180:Control Pictures 2133:Zanabazar Square 1872:Imperial Aramaic 1755:historic scripts 1224: 1084:Emoji skin color 910: 827:Zero-width space 771: 758:Private Use Area 743:Character charts 677: 670: 663: 654: 645: 644: 642: 640: 626: 620: 619: 617: 615: 610:. 7 October 2010 600: 594: 593: 591: 589: 575: 569: 568: 558: 550:(15 July 1979). 544: 538: 537: 535: 533: 519: 513: 512: 510: 508: 497: 491: 490: 488: 486: 471: 465: 464: 452: 446: 445: 439: 427: 418: 417: 415: 413: 398: 372:Zero-width space 317: 303: 269: 264:character entity 227:C1 control code 218:31626 defined a 214:German standard 95:zero-width space 44: 40: 37: 35: 2292: 2291: 2287: 2286: 2285: 2283: 2282: 2281: 2247: 2246: 2245: 2236: 2206: 2190:List by subject 2163:Symbols, emojis 2158: 2137: 2053:Psalter Pahlavi 1754: 1748: 1609:Pracalit (Newa) 1424:Hanifi Rohingya 1272: 1248:Combining marks 1239: 1232: 1218: 1214:Han unification 1177: 1141: 1088: 1024: 1014: 951: 897: 831: 775:Special purpose 762: 712: 686: 681: 649: 648: 638: 636: 628: 627: 623: 613: 611: 602: 601: 597: 587: 585: 577: 576: 572: 556: 546: 545: 541: 531: 529: 521: 520: 516: 506: 504: 499: 498: 494: 484: 482: 473: 472: 468: 454: 453: 449: 437: 433:(4 June 2003). 429: 428: 421: 411: 409: 400: 399: 390: 385: 348: 324: 322:Security issues 315: 301: 267: 194: 171:space character 128: 119: 106:Spring and Fall 83:optional hyphen 67: 47:syllable hyphen 42: 38: 33: 32: 17: 12: 11: 5: 2290: 2288: 2280: 2279: 2274: 2269: 2264: 2259: 2249: 2248: 2242: 2241: 2238: 2237: 2235: 2234: 2223: 2211: 2208: 2207: 2205: 2204: 2199: 2194: 2193: 2192: 2182: 2177: 2172: 2166: 2164: 2160: 2159: 2157: 2156: 2151: 2145: 2143: 2139: 2138: 2136: 2135: 2130: 2125: 2120: 2115: 2110: 2105: 2100: 2095: 2090: 2085: 2080: 2075: 2070: 2065: 2060: 2055: 2050: 2045: 2040: 2035: 2030: 2025: 2020: 2015: 2010: 2005: 2000: 1995: 1990: 1985: 1980: 1975: 1970: 1965: 1960: 1955: 1950: 1945: 1940: 1935: 1930: 1925: 1920: 1914: 1909: 1904: 1899: 1894: 1889: 1884: 1879: 1874: 1869: 1864: 1859: 1854: 1849: 1844: 1839: 1834: 1829: 1824: 1819: 1814: 1809: 1804: 1799: 1794: 1789: 1784: 1779: 1774: 1769: 1764: 1758: 1756: 1750: 1749: 1747: 1746: 1741: 1736: 1731: 1726: 1721: 1716: 1711: 1706: 1701: 1696: 1691: 1686: 1681: 1676: 1671: 1666: 1661: 1656: 1651: 1646: 1644:Sorang Sompeng 1641: 1636: 1631: 1626: 1621: 1616: 1611: 1606: 1601: 1596: 1591: 1586: 1581: 1576: 1571: 1566: 1561: 1556: 1551: 1546: 1541: 1536: 1534:Miao (Pollard) 1531: 1526: 1521: 1516: 1511: 1506: 1501: 1496: 1491: 1486: 1481: 1476: 1471: 1466: 1461: 1456: 1451: 1446: 1441: 1436: 1431: 1426: 1421: 1416: 1411: 1406: 1401: 1396: 1391: 1386: 1381: 1376: 1371: 1366: 1361: 1356: 1351: 1346: 1341: 1336: 1331: 1326: 1321: 1316: 1311: 1306: 1301: 1296: 1291: 1286: 1280: 1278: 1277:Modern scripts 1274: 1273: 1271: 1270: 1265: 1260: 1255: 1250: 1244: 1242: 1234: 1233: 1227: 1220: 1219: 1217: 1216: 1211: 1206: 1201: 1196: 1191: 1185: 1183: 1182:Related topics 1179: 1178: 1176: 1175: 1170: 1165: 1160: 1155: 1149: 1147: 1143: 1142: 1140: 1139: 1134: 1129: 1128: 1127: 1122: 1112: 1107: 1102: 1096: 1094: 1090: 1089: 1087: 1086: 1081: 1076: 1071: 1066: 1065: 1064: 1054: 1049: 1044: 1039: 1034: 1028: 1026: 1020: 1019: 1016: 1015: 1013: 1012: 1007: 1002: 997: 992: 987: 982: 977: 972: 967: 961: 959: 953: 952: 950: 949: 944: 939: 934: 933: 932: 922: 916: 914: 907: 903: 902: 899: 898: 896: 895: 890: 885: 880: 875: 870: 865: 860: 855: 850: 845: 839: 837: 833: 832: 830: 829: 824: 819: 814: 809: 804: 799: 790: 785: 779: 777: 768: 764: 763: 761: 760: 755: 750: 745: 740: 735: 734: 733: 722: 720: 714: 713: 711: 710: 705: 700: 694: 692: 688: 687: 682: 680: 679: 672: 665: 657: 647: 646: 621: 595: 581:. Greg Baker, 570: 539: 514: 492: 466: 447: 431:Markus G. Kuhn 419: 387: 386: 384: 381: 380: 379: 374: 369: 364: 359: 354: 347: 344: 323: 320: 319: 318: 305: 287: 286: 283: 280: 274: 271: 257: 246: 232: 212: 193: 190: 127: 124: 115: 66: 63: 15: 13: 10: 9: 6: 4: 3: 2: 2289: 2278: 2275: 2273: 2270: 2268: 2265: 2263: 2260: 2258: 2255: 2254: 2252: 2233: 2224: 2222: 2213: 2212: 2209: 2203: 2200: 2198: 2195: 2191: 2188: 2187: 2186: 2183: 2181: 2178: 2176: 2173: 2171: 2168: 2167: 2165: 2161: 2155: 2152: 2150: 2147: 2146: 2144: 2140: 2134: 2131: 2129: 2126: 2124: 2121: 2119: 2116: 2114: 2113:Tulu Tigalari 2111: 2109: 2106: 2104: 2101: 2099: 2096: 2094: 2091: 2089: 2088:Sylheti Nagri 2086: 2084: 2081: 2079: 2078:South Arabian 2076: 2074: 2071: 2069: 2066: 2064: 2061: 2059: 2056: 2054: 2051: 2049: 2046: 2044: 2041: 2039: 2036: 2034: 2031: 2029: 2026: 2024: 2021: 2019: 2016: 2014: 2011: 2009: 2006: 2004: 2003:Old Hungarian 2001: 1999: 1996: 1994: 1991: 1989: 1986: 1984: 1981: 1979: 1976: 1974: 1971: 1969: 1966: 1964: 1961: 1959: 1956: 1954: 1951: 1949: 1946: 1944: 1941: 1939: 1936: 1934: 1931: 1929: 1926: 1924: 1921: 1918: 1915: 1913: 1910: 1908: 1905: 1903: 1900: 1898: 1895: 1893: 1890: 1888: 1885: 1883: 1880: 1878: 1875: 1873: 1870: 1868: 1865: 1863: 1860: 1858: 1855: 1853: 1850: 1848: 1845: 1843: 1840: 1838: 1835: 1833: 1830: 1828: 1825: 1823: 1820: 1818: 1815: 1813: 1810: 1808: 1805: 1803: 1800: 1798: 1795: 1793: 1790: 1788: 1785: 1783: 1780: 1778: 1775: 1773: 1770: 1768: 1765: 1763: 1760: 1759: 1757: 1751: 1745: 1742: 1740: 1737: 1735: 1732: 1730: 1727: 1725: 1722: 1720: 1717: 1715: 1712: 1710: 1707: 1705: 1702: 1700: 1697: 1695: 1692: 1690: 1687: 1685: 1682: 1680: 1677: 1675: 1672: 1670: 1667: 1665: 1662: 1660: 1657: 1655: 1652: 1650: 1647: 1645: 1642: 1640: 1637: 1635: 1632: 1630: 1627: 1625: 1622: 1620: 1617: 1615: 1612: 1610: 1607: 1605: 1602: 1600: 1597: 1595: 1592: 1590: 1587: 1585: 1582: 1580: 1577: 1575: 1572: 1570: 1567: 1565: 1562: 1560: 1557: 1555: 1552: 1550: 1547: 1545: 1542: 1540: 1537: 1535: 1532: 1530: 1527: 1525: 1524:Mende Kikakui 1522: 1520: 1519:Masaram Gondi 1517: 1515: 1512: 1510: 1507: 1505: 1504:Lisu (Fraser) 1502: 1500: 1497: 1495: 1492: 1490: 1487: 1485: 1482: 1480: 1477: 1475: 1472: 1470: 1467: 1465: 1462: 1460: 1457: 1455: 1452: 1450: 1447: 1445: 1442: 1440: 1437: 1435: 1432: 1430: 1427: 1425: 1422: 1420: 1417: 1415: 1412: 1410: 1407: 1405: 1404:Gunjala Gondi 1402: 1400: 1397: 1395: 1392: 1390: 1387: 1385: 1382: 1380: 1377: 1375: 1372: 1370: 1367: 1365: 1362: 1360: 1357: 1355: 1352: 1350: 1347: 1345: 1342: 1340: 1337: 1335: 1332: 1330: 1327: 1325: 1322: 1320: 1317: 1315: 1312: 1310: 1307: 1305: 1302: 1300: 1297: 1295: 1292: 1290: 1287: 1285: 1282: 1281: 1279: 1275: 1269: 1266: 1264: 1261: 1259: 1256: 1254: 1251: 1249: 1246: 1245: 1243: 1241: 1235: 1230: 1225: 1221: 1215: 1212: 1210: 1207: 1205: 1202: 1200: 1197: 1195: 1192: 1190: 1187: 1186: 1184: 1180: 1174: 1171: 1169: 1166: 1164: 1161: 1159: 1156: 1154: 1151: 1150: 1148: 1144: 1138: 1135: 1133: 1130: 1126: 1123: 1121: 1118: 1117: 1116: 1113: 1111: 1108: 1106: 1103: 1101: 1098: 1097: 1095: 1091: 1085: 1082: 1080: 1077: 1075: 1072: 1070: 1067: 1063: 1060: 1059: 1058: 1055: 1053: 1050: 1048: 1045: 1043: 1040: 1038: 1035: 1033: 1030: 1029: 1027: 1021: 1011: 1008: 1006: 1003: 1001: 998: 996: 993: 991: 988: 986: 983: 981: 978: 976: 973: 971: 968: 966: 963: 962: 960: 958: 954: 948: 945: 943: 940: 938: 935: 931: 930:ISO/IEC 14651 928: 927: 926: 923: 921: 918: 917: 915: 911: 908: 904: 894: 891: 889: 886: 884: 881: 879: 876: 874: 871: 869: 866: 864: 861: 859: 856: 854: 851: 849: 846: 844: 841: 840: 838: 834: 828: 825: 823: 820: 818: 815: 813: 810: 808: 805: 803: 800: 798: 794: 791: 789: 786: 784: 781: 780: 778: 776: 772: 769: 765: 759: 756: 754: 751: 749: 746: 744: 741: 739: 736: 732: 729: 728: 727: 724: 723: 721: 719: 715: 709: 706: 704: 701: 699: 696: 695: 693: 689: 685: 678: 673: 671: 666: 664: 659: 658: 655: 651: 635: 631: 625: 622: 609: 605: 599: 596: 584: 580: 574: 571: 566: 562: 555: 554: 549: 543: 540: 528: 524: 518: 515: 502: 496: 493: 480: 476: 470: 467: 462: 458: 451: 448: 444:. L2/03-155R. 443: 436: 432: 426: 424: 420: 408: 404: 397: 395: 393: 389: 382: 378: 375: 373: 370: 368: 365: 363: 360: 358: 355: 353: 350: 349: 345: 343: 339: 337: 333: 329: 321: 313: 309: 306: 299: 295: 292: 291: 290: 284: 281: 278: 275: 272: 265: 261: 258: 255: 251: 250:code page 850 247: 244: 240: 236: 233: 230: 226: 221: 217: 213: 210: 206: 203: 202: 201: 199: 196:Soft hyphen ( 191: 189: 187: 183: 178: 176: 172: 168: 162: 160: 157: 153: 149: 145: 141: 137: 133: 125: 123: 118: 113: 111: 107: 102: 100: 96: 92: 91:word wrapping 88: 84: 80: 76: 72: 64: 62: 58: 56: 52: 48: 30: 21: 1968:Meetei Mayek 1919:(Chorasmian) 1822:Cypro-Minoan 1599:Pahawh Hmong 1414:Gurung Khema 1163:ISO/IEC 8859 1005:UTF-32/UCS-4 1000:UTF-16/UCS-2 807:Variant form 801: 650: 637:. Retrieved 624: 612:. Retrieved 598: 586:. Retrieved 573: 552: 542: 530:. Retrieved 517: 505:. Retrieved 495: 483:. Retrieved 478: 469: 463:. L2/02-279. 450: 410:. Retrieved 362:Word divider 340: 325: 288: 225:ISO/IEC 6429 197: 195: 179: 163: 129: 121: 116: 105: 103: 82: 78: 68: 59: 46: 28: 26: 2257:Punctuation 2154:SignWriting 2023:Old Sogdian 1993:Nandinagari 1917:Khwarezmian 1827:Dives Akuru 1753:Ancient and 1739:Warang Citi 1604:Pau Cin Hau 1559:New Tai Lue 1554:Nag Mundari 1529:Medefaidrin 1238:Common and 1047:Equivalence 1025:code points 1023:On pairs of 937:Equivalence 812:Word joiner 802:Soft hyphen 718:Code points 503:. comsci.us 367:Word joiner 352:Hard hyphen 336:e-mail spam 243:ISO 8859-11 209:hexadecimal 39:SOFT HYPHEN 29:soft hyphen 2272:Whitespace 2262:Typography 2251:Categories 2048:Phoenician 2033:Old Uyghur 2028:Old Turkic 2013:Old Permic 2008:Old Italic 1958:Manichaean 1852:Glagolitic 1629:Saurashtra 1374:Devanagari 1253:Diacritics 1010:UTF-EBCDIC 913:Algorithms 906:Processing 843:Characters 767:Characters 523:"Glossary" 383:References 235:ISO 8859-1 152:ISO 8859-1 132:plain text 87:line break 2043:Ź¼Phags-pa 2038:Palmyrene 1988:Nabataean 1912:Khudawadi 1897:Kharosthi 1812:Cuneiform 1787:Bhaiksuki 1782:Bassa Vah 1649:Sundanese 1624:Samaritan 1539:Mongolian 1514:Malayalam 1479:Kirat Rai 1189:Anomalies 1173:ISO 15924 1168:DIN 91379 1069:Z-variant 1052:Homoglyph 925:Collation 377:Word wrap 268:­ 186:man pages 43:­ 31:(Unicode 2175:Currency 2149:Duployan 2123:Vithkuqi 2118:Ugaritic 1973:Meroitic 1943:Mahajani 1928:Linear B 1923:Linear A 1714:Tifinagh 1679:Tai Viet 1674:Tai Tham 1664:Tagbanwa 1579:Ol Chiki 1469:Kayah Li 1464:Katakana 1449:Javanese 1444:Hiragana 1434:Hanunuoo 1409:Gurmukhi 1399:Gujarati 1389:Georgian 1364:Cyrillic 1354:Cherokee 1319:Bopomofo 1299:Balinese 1294:Armenian 1158:GB 18030 975:Punycode 863:Numerals 795: / 708:Versions 634:Symantec 608:Slashdot 559:. ITSCJ/ 485:7 August 346:See also 239:ISO 8859 2083:Soyombo 2073:Sogdian 2068:Siddham 2063:Sharada 1983:Multani 1963:Marchen 1953:Mandaic 1948:Makasar 1862:Grantha 1847:Elymaic 1842:Elbasan 1817:Cypriot 1777:Avestan 1719:Tirhuta 1709:Tibetan 1654:Sunuwar 1639:Sinhala 1634:Shavian 1614:Ranjana 1594:Osmanya 1584:Ol Onal 1509:Lontara 1459:Kannada 1369:Deseret 1334:Burmese 1324:Braille 1314:Bengali 1268:Numbers 1229:Scripts 878:Symbols 868:Scripts 691:Unicode 684:Unicode 639:8 April 614:8 April 588:12 July 532:8 April 507:8 April 412:8 April 328:domains 138:-style 99:kerning 75:Unicode 55:hyphens 2230:  2219:  2128:Yezidi 2108:Todhri 2103:Tangut 1938:Lydian 1933:Lycian 1907:Khojki 1887:Kaithi 1867:Hatran 1857:Gothic 1807:Coptic 1797:Carian 1792:BrāhmÄ« 1734:Wancho 1699:Thaana 1694:Telugu 1689:Tangsa 1669:Tai Le 1659:Syriac 1619:Rejang 1494:Lepcha 1439:Hebrew 1419:Hangul 1344:Chakma 1289:Arabic 1263:Spaces 970:CESU-8 965:BOCU-1 873:Spaces 565:ISO-IR 254:MS-DOS 205:EBCDIC 148:EBCDIC 45:)) or 36: 34:U+00AD 2202:Emoji 2098:Takri 2058:Runic 1998:Ogham 1832:Dogra 1684:Tamil 1589:Osage 1564:NĆ¼shu 1499:Limbu 1489:Latin 1474:Khmer 1454:Kanji 1429:Hanja 1394:Greek 1384:GeŹ½ez 1379:Garay 1329:Buhid 1309:Batak 1304:Bamum 1284:Adlam 1132:Input 1110:Fonts 1105:Email 1093:Usage 995:UTF-8 990:UTF-7 985:UTF-1 836:Lists 753:Plane 726:Block 557:(PDF) 438:(PDF) 312:LaTeX 298:groff 294:troff 182:groff 156:VT100 136:VT100 1978:Modi 1892:Kawi 1762:Ahom 1724:Toto 1704:Thai 1574:Odia 1549:N'Ko 1349:Cham 1115:HTML 1062:list 980:SCSU 731:List 641:2011 616:2011 590:2011 567:-40. 561:IPSJ 534:2011 509:2011 487:2022 414:2011 332:URLs 310:and 296:and 277:HTML 260:SGML 252:(an 248:IBM 150:and 73:and 71:HTML 1729:Vai 1544:Mru 1484:Lao 783:BOM 548:DIN 527:IBM 334:in 330:or 308:TeX 216:DIN 198:SHY 108:by 81:or 2253:: 1744:Yi 632:. 606:. 563:. 525:. 477:. 459:. 440:. 422:^ 405:. 391:^ 338:. 316:\- 314:: 302:\% 300:: 231:.) 188:. 177:. 161:. 676:e 669:t 662:v 643:. 618:. 592:. 536:. 511:. 489:. 416:. 304:. 41:(

Index


coded character sets
hyphens
HTML
Unicode
line break
word wrapping
zero-width space
kerning
Gerard Manley Hopkins
plain text
VT100
terminal emulators
page description languages
EBCDIC
ISO 8859-1
VT100
terminal emulators
terminal emulator
space character
whitespace characters
groff
man pages
EBCDIC
hexadecimal
DIN
C1 control code set
ISO/IEC 6429
Operating System Command (OSC)
ISO 8859-1

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.

ā†‘