Knowledge (XXG)

E-text

Source 📝

36: 245: 312:
file, but that it has no information beyond "the text itself"—no representation of bold or italics, paragraph, page, chapter, or footnote boundaries, etc. Michael S. Hart, for example, argued that this "is the only text mode that is easy on both the eyes and the computer". Hart made the correct point
338:
In consequence of this, such texts cannot be reliably re-formatted. A program cannot reliably tell where footnotes, headers or footers are, or perhaps even paragraphs, so it cannot re-arrange the text, for example to fit a narrower screen, or read it aloud for the visually impaired. Programs might
334:
might be omitted, or might simply appear as additional lines of text, perhaps with blank lines before and after (or not). An ornate separator line might be represented instead by a line of asterisks (or not). Chapter and sections titles, likewise, are just additional lines of text: they might be
313:
that proprietary word-processor formats made texts grossly inaccessible; but that is irrelevant to standard, open data formats. The narrow sense of "e-text" is now uncommon, because the notion of "just vanilla ASCII" (attractive at first glance), has turned out to have serious difficulties:
374:
output usually produces more information than this, such as the use of bold and italic. If this information is not kept, it is expensive and time-consuming to reconstruct it; more sophisticated information such as what edition you have, may not be recoverable at all.
382:, spaces, tabs, and the like: Spaces between words; two returns and 5 spaces for paragraph. The main difference from more formal markup is that "plain texts" use implicit, usually undocumented conventions, which are therefore inconsistent and difficult to recognize. 385:
The narrow sense of e-text as "plain vanilla ASCII" has fallen out of favor. Nevertheless, many such texts are freely available on the Web, perhaps as much because they are easily produced as because of any purported portability advantage. For many years
355:
relating to the text is sometimes included with an e-text, but there is by this definition no way to say whether or where it is preset. At best, the text of the title page might be included (or not), perhaps with centering imitated by indentation.
350:
the work. For example, is it the first or the tenth edition? Who prepared it, and what rights do they reserve or grant to others? Is this the raw version straight off a scanner, or has it been proofread and corrected?
186:
information, or not. An e-text may be an electronic edition of a work originally composed or published in other media, or may be created in electronic form originally. The term is usually synonymous with
335:
detectable by capitalization if they were all caps in the original (or not). Even to discover what conventions (if any) were used, makes each book a new research or reverse-engineering project.
367:, or even the simplest tables. This leads to endless practical problems: for example, if the computer cannot reliably distinguish footnotes, it cannot find a phrase that a footnote interrupts. 320:
or the accented vowels used in many European languages cannot be represented (unless awkwardly and ambiguously as "~n" "a'"). Asian, Slavic, Greek, and other writing systems are impossible.
464: 326:
Third, "e-texts" in this narrow sense have no reliable way to distinguish "the text" from other things that occur in a work. For example, page numbers,
323:
Second, diagrams and pictures cannot be accommodated, and many books have at least some such material; often it is essential to the book.
491: 452:
Reading and Writing the Electronic Book. Nicole Yankelovich, Norman Meyrowitz, and Andries van Dam. IEEE Computer 18(10), October 1985.
433: 288: 119: 255: 548: 231:, and other information in their texts, as well as in some cases (such as FRESS) supporting not just text but also graphics. 57: 371: 154:, and especially a document that is mainly text. For example, a computer-based book of art with minimal text, or a set of 301:
In some communities, "e-text" is used much more narrowly, to refer to electronic documents that are, so to speak, "plain
270: 100: 390:
strongly favored this model of text, but with time, has begun to develop and distribute more capable forms such as
72: 346:
Fourth, and a perhaps surprisingly important issue, a "plain-text" e-text affords no way to represent information
266: 46: 482: 428: 79: 53: 543: 359:
Fifth, texts with more complicated information cannot really be handled at all. A bilingual edition, or a
216: 86: 203:, have been around since long before the Internet, the Web, and specialized E-book reading hardware. 175: 302: 200: 147: 68: 553: 505: 423: 387: 379: 495: 413: 364: 360: 418: 224: 179: 316:
First, this narrow type of "e-text" is limited to the English letters. Not even Spanish
183: 537: 159: 509: 93: 204: 151: 327: 171: 163: 137: 35: 17: 340: 309: 228: 167: 155: 223:
appeared in the 1960s. These early systems made extensive use of formatting,
403: 212: 453: 352: 331: 526: 500: 477: 476:
Coombs, James H.; Renear, Allen H.; DeRose, Steven J. (November 1987).
208: 408: 188: 273:. Statements consisting only of original research should be removed. 378:
If actuality, even "plain text" uses some kind of "markup"—usually
305: 220: 391: 317: 162:, would not usually be called an "e-text". An e-text may be a 238: 29: 478:"Markup systems and the future of scholarly text processing" 211:
in the 1940s, while large-scale electronic text editing,
262: 343:
to guess at the structure, but this can easily fail.
308:". By this is meant not only that the document is a 60:. Unsourced material may be challenged and removed. 363:with footnotes, commentary, critical apparatus, 8: 528:Scholarly Electronic Publishing Bibliography 207:began developing an electronic edition of 499: 434:L'Association des Bibliophiles Universels 289:Learn how and when to remove this message 120:Learn how and when to remove this message 27:Any document that is read in digital form 445: 215:, and online reading platforms such as 454:http://dl.acm.org/citation.cfm?id=4407 7: 58:adding citations to reliable sources 25: 243: 227:, automatic tables of contents, 34: 45:needs additional citations for 1: 146:) is a general term for any 269:the claims made and adding 570: 483:Communications of the ACM 429:Distributed Proofreaders 142:"; sometimes written as 170:file, viewed with any 549:Electronic publishing 178:. An e-text may have 201:electronic documents 176:proprietary software 54:improve this article 501:10.1145/32206.32209 380:control characters 254:possibly contains 424:Online Books Page 388:Project Gutenberg 370:Even raw scanner 299: 298: 291: 256:original research 235:"Just plain text" 130: 129: 122: 104: 16:(Redirected from 561: 514: 513: 503: 473: 467: 462: 456: 450: 414:Electronic paper 365:cross-references 361:critical edition 294: 287: 283: 280: 274: 271:inline citations 247: 246: 239: 150:that is read in 125: 118: 114: 111: 105: 103: 62: 38: 30: 21: 569: 568: 564: 563: 562: 560: 559: 558: 534: 533: 523: 518: 517: 475: 474: 470: 465:Michael S. Hart 463: 459: 451: 447: 442: 419:Digital library 400: 295: 284: 278: 275: 260: 248: 244: 237: 197: 126: 115: 109: 106: 63: 61: 51: 39: 28: 23: 22: 18:Electronic text 15: 12: 11: 5: 567: 565: 557: 556: 551: 546: 536: 535: 532: 531: 522: 521:External links 519: 516: 515: 468: 457: 444: 443: 441: 438: 437: 436: 431: 426: 421: 416: 411: 406: 399: 396: 297: 296: 251: 249: 242: 236: 233: 196: 195:E-text origins 193: 160:scans of pages 128: 127: 42: 40: 33: 26: 24: 14: 13: 10: 9: 6: 4: 3: 2: 566: 555: 552: 550: 547: 545: 544:Books by type 542: 541: 539: 530: 529: 525: 524: 520: 511: 507: 502: 497: 493: 489: 485: 484: 479: 472: 469: 466: 461: 458: 455: 449: 446: 439: 435: 432: 430: 427: 425: 422: 420: 417: 415: 412: 410: 407: 405: 402: 401: 397: 395: 393: 389: 383: 381: 376: 373: 368: 366: 362: 357: 354: 349: 344: 342: 336: 333: 329: 324: 321: 319: 314: 311: 307: 304: 293: 290: 282: 272: 268: 264: 258: 257: 252:This section 250: 241: 240: 234: 232: 230: 226: 222: 218: 214: 210: 206: 202: 194: 192: 190: 185: 181: 177: 173: 169: 165: 161: 157: 153: 149: 145: 141: 139: 134: 124: 121: 113: 102: 99: 95: 92: 88: 85: 81: 78: 74: 71: –  70: 66: 65:Find sources: 59: 55: 49: 48: 43:This article 41: 37: 32: 31: 19: 527: 487: 481: 471: 460: 448: 384: 377: 369: 358: 347: 345: 337: 328:page headers 325: 322: 315: 300: 285: 276: 253: 205:Roberto Busa 199:E-texts, or 198: 152:digital form 143: 136: 132: 131: 116: 110:January 2013 107: 97: 90: 83: 76: 64: 52:Please help 47:verification 44: 494:: 933–947. 172:open source 156:photographs 538:Categories 440:References 341:heuristics 310:plain text 279:April 2015 263:improve it 229:hyperlinks 184:formatting 168:plain text 138:electronic 80:newspapers 554:Documents 404:Text file 332:footnotes 267:verifying 213:hypertext 182:or other 510:59941802 398:See also 353:Metadata 148:document 69:"E-text" 303:vanilla 261:Please 217:Augment 209:Aquinas 135:(from " 94:scholar 508:  490:(11). 409:e-book 339:apply 330:, and 225:markup 189:e-book 180:markup 164:binary 133:e-text 96:  89:  82:  75:  67:  506:S2CID 348:about 306:ASCII 221:FRESS 166:or a 144:etext 101:JSTOR 87:books 392:HTML 219:and 140:text 73:news 496:doi 492:ACM 372:OCR 265:by 174:or 158:or 56:by 540:: 504:. 488:30 486:. 480:. 394:. 191:. 512:. 498:: 318:ñ 292:) 286:( 281:) 277:( 259:. 123:) 117:( 112:) 108:( 98:· 91:· 84:· 77:· 50:. 20:)

Index

Electronic text

verification
improve this article
adding citations to reliable sources
"E-text"
news
newspapers
books
scholar
JSTOR
Learn how and when to remove this message
electronic
document
digital form
photographs
scans of pages
binary
plain text
open source
proprietary software
markup
formatting
e-book
electronic documents
Roberto Busa
Aquinas
hypertext
Augment
FRESS

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.