Knowledge (XXG)

:Google Books and Knowledge (XXG) - Knowledge (XXG)

Source đź“ť

252:
are less likely to be carried by retail bookstores or public libraries. Outside of GB, the vast majority of academic books remain inaccessible to anyone who does not live near a major research university with a library that allows public access. Only the wealthiest scholars are location independent in that they can afford the extravagant convenience of buying copies of all potentially relevant books and articles, sight unseen, either as hard copy books shipped to their location or digital works downloaded through paywalls, and then sorting out later which works are actually relevant to a specific research topic. Most researchers still need to visit libraries to download digital works under institutional licenses, or they have to first read through hard copy books in libraries to determine whether they are relevant and then make scans of only the pages they actually need.
103:
to the hard copy versions of their works, but in 2023 withdrew those previews and provided alternative page previews generated from e-book versions. Wolters Kluwer does provide in-line notations in its e-book versions to indicate hard copy pagination, but the e-book versions are not visually identical to the hard copy versions (that is, not set in the exact same typefaces) and accompanying illustrations are often either omitted or distorted.
212:
preview" annotation when specifically searched for. But their contents never appear in GB search results, even when searching for specific text already known to be present in those books. This requires researchers exploring a particular topic to ascertain whether the leading publishers specializing in that topic have already locked up their content away from GB in commercial databases, and then conduct separate searches of those databases.
110:
device for a very long period of time (several months). Some books have working links and bookmarks allowing for jumps to later chapters, but some do not. It is usually possible to jump to a specific page to begin exhaustion of preview pages from that location by editing the URL (e.g., "&pg=PA100" jumps to page 100), but GB does not educate users about this or provide a user interface to facilitate direct page jumps.
31: 255:
GB is often the only way for the general public in one country to become aware of relevant content from small commercial publishers in other countries which is unlikely to be exported or purchased by libraries or bookstores outside of those publishers' home countries. Only the largest publishers are
117:
Searches within particular books are buggy. For some books, GB provides links from snippets displayed in search results to the corresponding pages. For other books, snippets are displayed as search results but no links are made available, even when the book's pages can be otherwise previewed on GB.
235:
When it works, GB is marvelously convenient. Traditionally, it takes at least 15 to 30 minutes for an able-bodied researcher who lives next to a research library to visit the library, find and retrieve a cited hard copy book in the library's collection, and verify whether a citation to that book is
102:
Publishers can withdraw permissions to view book previews at any time. For example, book previews of most publications by Cengage disappeared in 2023, meaning that links to pages in those books now redirect to "About this book" pages. Wolters Kluwer formerly provided page previews visually identical
251:
GB is often the easiest way for the general public to directly access book content from many university presses, which are often more generous than commercial publishers in allowing GB to search and display lengthy previews of their books. Books from university presses on obscure academic subjects
194:
GB shifts content while you are not looking. For example, a link to a 1985 edition might in the future become the 2019 edition because the publisher released a new edition. This is good for book sellers and publishers, because the newest edition of the book is always at the same Google Book ID. For
113:
Searches across GB's entire book database are buggy. When publishers submit multiple editions of a book, GB will often link first to the e-book edition which almost always lacks original pagination. Users must then explore the various editions linked from the e-book edition page, to determine if a
109:
Book previews of copyrighted works are restricted to a certain number of pages. It is easy to exhaust that number from a given IP address and device while scrolling through or searching the contents of the book, and then GB will not display any additional pages to that particular IP address and/or
211:
GB search is far less comprehensive than it may initially appear. Many commercial publishers with their own online databases behind paywalls will not allow GB to index and search the contents of their books. GB usually knows of the existence of those books, in that book titles appear with a "no
98:
GB links are unstable and prone to disappear. An estimated 15% of GB links on Knowledge (XXG) are dead (404). An even larger percentage of the page previews no longer work and redirect to the "About this book" page. Google is not a library nor archive for long term preservation. Books can and do
247:
When publishers allow GB to index and search their books but not display pages in full, the snippets returned by GB in search results can still be useful. Those snippets can reveal obscure sources which would have been much harder to find through traditional research methods—usually because the
243:
GB has greatly expanded access to page images of public domain books. Early e-book projects focused on optical character recognition or manual keyboarding of text, but did not capture full page images. When it works, GB is a wonderful resource for historical researchers who need to see full
190:
offering. It's impossible to know whether in the future an URL which currently works for everyone will become subject to registration or payment: for instance, certain buttons to download a (public domain) book in PDF or EPUB format have changed their positions and requirements multiple
329:
is creating five thousand or more new book scans a day (as of 2023). It is their stated goal to scan every book cited on every Knowledge (XXG). Most of the new books being scanned are modern, but they already have more Public Domain books than Google. More info at
236:
accurate (i.e., that the book actually states the specific assertion at issue at the cited page). The time burden is much higher for those persons who do not live next to a library, or are physically disabled and have difficulty navigating
156:
The ostensible reason for user monitoring is to allow Google to respect the contracts it has with publishers, which require Google to make life miserable for readers; however, some such requirements are Google's own creation, see next
431:, or to get their links converted and corrected. This is largely a matter of developing the user interface and tools, but the existing wikitext can be improved as well: adding unique identifiers to citations always helps. 356:
books for preservation purposes, and may still provide access to our digital collections. We may continue to display “short portions” of books as is consistent with fair use — for example, Knowledge (XXG) references (as
337:
Internet Archive also offers a full text search which is superior to that of Google Books, because it indexes content which is restricted by Google, and because the context of the matches is easier to understand.
361:). The injunction does not affect lending of out-of-print books. And of course, the Internet Archive will still make millions of public domain texts available to the public without restriction." 106:
Book previews are not equally accessible to all internet users. People living in other countries may not be able to view a page in "preview" that you were able to when you used it as a citation.
208:
When searching for a term inside a book from a commercial publisher, it only displays results for those pages that are available for preview. It is not a comprehensive book search.
186:
Google Books tries to make users register a Google account and access books only while logged in, both to make user monitoring easier and to direct users to its paid
496:"Web accessibility and technology protection measures: Harmonizing the rights of persons with cognitive disabilities and copyright protections on the web" 456: 248:
particular assertion at issue is a digression from the main topic which would not have been captured in titles, subject classifications, or synopses.
422:
homepage, enter a search term into the search box (not wayback machine search, the other search box). Choose the radio button "search inside text".
240:. Now one can link directly to the relevant page on GB, and if the link is still working, anyone else can verify the citation in a few seconds. 334:
and other sources. Internet Archive is a non-profit library and archive; its URLs and links are significantly more stable and understandable.
95:
GB is a commercial book seller. Its parent company is in the business of making money. The following problems all arise from this core truth.
349: 170: 46:
It contains the advice or opinions of one or more Knowledge (XXG) contributors. This page is not an encyclopedia article, nor is it one of
47: 150: 375:: As of 20 May 2020, this project has over 62,000 items in its collection of free eBooks, created from texts in the public domain. 228:
A 48-hour EventStream poll showed about 400 new Google Books links being added per day to the English Knowledge (XXG) (Feb 2020).
198:
GB books have free preview for some books but as a commercial book seller they have no interest in freely lending books with CDL (
140:
This is also a problem for accessibility. Libraries like the Internet Archive have specific services for the visually impaired.
122: 205:
GB in 2020 started offering "new" GB which ignores significant portions of existing URLs resulting in different final results.
114:
hardcover or paperback version was also made available with original pagination suitable for direct citation and direct links.
369:
is a non-profit archive with stable links. Most of the PD books at Google are also available there, and at Internet Archive.
130: 232:
The core strength of Google is search and this is true with Books. It is easy to find a citation for a given search term.
341: 546: 358: 437:
Internet Archive received millions of uploads by users. Volunteer MediaWiki developers have helped in the past, with
178: 445: 345: 199: 412: 304: 480: 215:
Google Books ingests low-quality AI-generated books, some of which are trained on Knowledge (XXG) itself. See
137:
and other web archives often fail to archive even the Google page previews specifically linked from articles.
285:
Google Books search inside a book is fast. However, see above for why this can result in incomplete searches.
316:
Lacks Controlled Digital Lending. Free previews are great, but viewing the complete book for free is better.
256:
able to regularly promote books to an international audience and arrange for distribution and localization.
149:
and requires the user to run proprietary JavaScript. All its users are monitored for various purposes and
298: 51: 331: 310: 282:
For public domain books, Google Books sometimes provides cleaner and smaller PDFs than other providers.
61: 216: 399:: it has over 20 million, comparable in size to Google Books, and it is larger in some collections. 39: 17: 494:
Giannoumi, G. Anthony; Land, Molly; Beyene, Wondwossen Mulualem; Blanck, Peter (May 31, 2017).
372: 507: 326: 118:
This forces users to manually edit URLs as explained above in order to preview those pages.
348:
service which lends books in full. In August 2023, a negotiated judgement was reached. See
382: 134: 444:
Be on the lookout for materials at risk, which you may upload to the Internet Archive.
270: 174: 162:
Google Books makes governments and public entities sign contracts which go against the
540: 263: 237: 163: 146: 54:. Some essays represent widespread norms; others only represent minority viewpoints. 259: 83: 166:
by stating that Google has an exclusive right on the scans for a number of years.
527: 366: 187: 126: 452: 415:). Note that Google indexes only a small part of the Internet Archive content. 403: 378: 266:
links Google Books very prominently. More than 90% of users use Google Search.
381:
has over four million articles over 72 languages. All are public domain or
173:
and long-standing Wikimedia policy statements and goals: see WMF policy on
353: 512: 495: 350:"What the Hachette v. Internet Archive Decision Means for Our Library" 195:
Knowledge (XXG), this causes havoc with page references and citations.
396: 16:
For information on linking to Google Books in Knowledge (XXG), see
457:
It's No Secret - Millions of Books Are Openly in the Public Domain
438: 500:
Cyberpsychology: Journal of Psychosocial Research on Cyberspace
25: 276:
Problems are hidden. Most users are unaware of these issues.
273:—the more Google Books links we have, the more we will have. 133:, archiving Google Books is hard and sometimes impossible: 402:
A simple way to search the Internet Archive via Google or
301:. Commercial book seller vs. non-profit archival library. 69: 419: 307:. Links that break create problems with verification. 478:
Especially what copyright laws euphemistically call
455:
before they make their copies of books open access:
427:
It needs to be easy for the user to link the better
82:
The document helps explain why we prefer not to use
352:, which states: "The Internet Archive may still 279:In-line search-term highlighting is very nice. 8: 446:Local newspapers are vanishing very quickly 262:: especially when searching rare keywords, 511: 288:There is often nothing better available. 244:illustrations and original page layouts. 48:Knowledge (XXG)'s policies or guidelines 528:2010 contract with the Italian ministry 471: 313:. Links should be reliable and stable. 344:. The lawsuit primarily concerns the 7: 171:Bridgeman Art Library v. Corel Corp. 52:thoroughly vetted by the community 14: 428: 151:privacy concerns regarding Google 86:(GB) where better options exist. 418:More in-depth searching: At the 395:Search the Internet Archive for 293:Why we should stop when possible 145:In general, Google Books is not 29: 23:Essay on editing Knowledge (XXG) 217:"Google Books Indexes AI Trash" 123:digital restrictions management 451:Hathi Trust conducts thorough 1: 90:Why Google Books isn't good 563: 202:) like other providers do. 200:Controlled Digital Lending 59: 15: 177:and Wikimedia chapters' 269:Force of habit and the 547:Knowledge (XXG) essays 434:Expand the libraries! 99:disappear at any time. 397:books and periodicals 340:Internet Archive was 50:, as it has not been 408:<search term: --> 224:Why we use it anyway 89: 420:https://archive.org 179:statement of intent 513:10.5817/CP2017-1-5 481:technical measures 359:shown in the image 342:sued by publishers 175:commons:COM:PD-Art 169:This goes against 453:copyright reviews 373:Project Gutenberg 80: 79: 554: 531: 526:For example the 524: 518: 517: 515: 491: 485: 476: 410: 409:site:archive.org 327:Internet Archive 305:WP:Verifiability 72: 33: 32: 26: 562: 561: 557: 556: 555: 553: 552: 551: 537: 536: 535: 534: 525: 521: 493: 492: 488: 477: 473: 468: 407: 392: 383:freely licensed 323: 295: 226: 219:(April 4, 2024) 135:Wayback Machine 92: 76: 75: 68: 64: 56: 55: 30: 24: 21: 12: 11: 5: 560: 558: 550: 549: 539: 538: 533: 532: 519: 486: 470: 469: 467: 464: 463: 462: 461: 460: 449: 442: 432: 425: 424: 423: 416: 391: 388: 387: 386: 376: 370: 364: 363: 362: 338: 332:Wired Magazine 322: 319: 318: 317: 314: 308: 302: 294: 291: 290: 289: 286: 283: 280: 277: 274: 271:network effect 267: 257: 253: 249: 245: 241: 238:library stacks 233: 225: 222: 221: 220: 213: 209: 206: 203: 196: 192: 184: 183: 182: 160: 159: 158: 143: 142: 141: 119: 115: 111: 107: 104: 100: 96: 91: 88: 78: 77: 74: 73: 65: 60: 57: 45: 44: 36: 34: 22: 13: 10: 9: 6: 4: 3: 2: 559: 548: 545: 544: 542: 529: 523: 520: 514: 509: 505: 501: 497: 490: 487: 483: 482: 475: 472: 465: 458: 454: 450: 447: 443: 440: 436: 435: 433: 430: 426: 421: 417: 414: 405: 401: 400: 398: 394: 393: 389: 384: 380: 377: 374: 371: 368: 365: 360: 355: 351: 347: 343: 339: 336: 335: 333: 328: 325: 324: 320: 315: 312: 309: 306: 303: 300: 297: 296: 292: 287: 284: 281: 278: 275: 272: 268: 265: 264:Google Search 261: 258: 254: 250: 246: 242: 239: 234: 231: 230: 229: 223: 218: 214: 210: 207: 204: 201: 197: 193: 189: 185: 180: 176: 172: 168: 167: 165: 164:public domain 161: 155: 154: 152: 148: 147:free software 144: 139: 138: 136: 132: 128: 124: 120: 116: 112: 108: 105: 101: 97: 94: 93: 87: 85: 71: 67: 66: 63: 58: 53: 49: 43: 41: 35: 28: 27: 19: 522: 503: 499: 489: 479: 474: 429:alternatives 321:Alternatives 299:WP:AFFILIATE 260:Market power 227: 84:Google Books 81: 37: 390:How to help 367:Hathi Trust 311:WP:Link rot 188:Google Play 127:geoblocking 38:This is an 404:DuckDuckGo 379:Wikisource 129:and other 18:WP:GBOOKS 541:Category 354:digitize 131:barriers 62:Shortcut 413:example 153:apply. 121:Due to 70:WP:GBWP 191:times. 157:point. 506:(1). 466:Notes 40:essay 508:doi 439:BUB 346:CDL 543:: 504:11 502:. 498:. 406:: 125:, 530:. 516:. 510:: 484:. 459:. 448:. 441:. 411:( 385:. 181:. 42:. 20:.

Index

WP:GBOOKS
essay
Knowledge (XXG)'s policies or guidelines
thoroughly vetted by the community
Shortcut
WP:GBWP
Google Books
digital restrictions management
geoblocking
barriers
Wayback Machine
free software
privacy concerns regarding Google
public domain
Bridgeman Art Library v. Corel Corp.
commons:COM:PD-Art
statement of intent
Google Play
Controlled Digital Lending
"Google Books Indexes AI Trash"
library stacks
Market power
Google Search
network effect
WP:AFFILIATE
WP:Verifiability
WP:Link rot
Internet Archive
Wired Magazine
sued by publishers

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.

↑