Knowledge (XXG)

:Search engine test - Knowledge (XXG)

Source đź“ť

1981:
search on "Taco Bell" will give only a couple of pages from tacobell.com even though many in that domain will certainly match. Further, Google's list of distinct results is constructed by first selecting the top 1000 results and then eliminating duplicates without replacements. Hence the list of distinct results will always contain fewer than 1000 results regardless of how many webpages actually matched the search terms. For example, as of 14 December 2010, from the about 742 million pages related to "Microsoft", Google was returning 572 "distinct" results.. Caution must be used in judging the relative importance of websites yielding well over 1000 search results.
2719:—Meert observes that "The temptation to find a quick retort means that, many times, people don't bother to check the source carefully." and that "people will look for a specific phrase that may be taken out-of-context to support their argument". He states that it is "dangerous and irresponsible to think that we can Google away a complex discussion" and that he has "learned long ago that there is no substitute for detailed research on a topic". 66: 190:. This facilitates research by offering an immediate variety of applicable options. Possibly useful items on the results list include the source material or the electronic tools that a web site can provide, such as a dictionary, but the list itself, as a whole, can also indicate important information. However, discerning that information may require insight. 1740:– Some sources are accessible to all, but many are payment only, or not reported online. This may, for example, affect the search results you get for a historical topic that achieved its peak media prominence 50 or 100 years ago; valid sources may very well exist, but would be found on microfilms or subscription news archiving sites like 150: 1791:– Sometimes other sites clone Knowledge (XXG) content, which is then passed around the Internet, and more pages built up based upon it (and often not cited), meaning that in reality the source of much of the search engine's findings are actually just copies of Knowledge (XXG)'s own previous text, not genuine sources. 2067:, so there may also be many results returned that lead to a page that only serves as an advertisement. Sometimes pages contain hundreds of keywords designed specifically to attract search engine users to that page, but in fact serve an advertisement instead of a page with content related to the keyword. 1662:
hits (less than 1000) the actual count of hits needed to reach the bottom of the last page of results may be more accurate, but even this is not a sure thing. Google returns different search results depending on factors such as your previous search history and on which Google server you happen to hit.
1969:
Note also, that the number of search string matches reported by search engines is only an estimate. For example, Google will only calculate the actual number of matches once the user navigates through all result pages, to the last one, and even then it places restrictions on the figure. At times, the
1185:
Google has options to specify web sites to search or not search, and where in the page to search. These are able to be added to the end of any search and will restrict the locations Google will report matches from. Examples of useful searches, using "(Atom OR Bomb)" as the example text being searched
1040:
by someone who can't remember the spelling. Again, they could equally search using connected terms (Google: bitch womb spay open closed antibiotic – all terms associated with the veterinary condition pyometra). The odds are good someone else has already misspelt it like you did and it's been indexed,
669:
The single most useful search engine tool may be the use of quotation marks to find an exact match for a phrase. However, a search engine such as Google has both an easy, and an advanced search with further search options. The advanced search makes it easier to enter advanced options, that may help
2049:, or Macromedia Flash, or where a website is displayed as part of an image. Search engines also can not listen to podcasts or other audio streams, or even video mentioning a search term. Similarly, search engines cannot read PDF files consisting of photoscans or look inside compressed (.zip) files. 1915:
name, for instance, needs to be searched for in the original script, which is easily done with Google (provided one knows what to search for), but problems may arise if – for example – English, French and German webpages transcribe the name using different conventions. Even for English-only webpages
1879:
in Arabic will likely find pages which reflect a different bias than an English speaker searching in English on the same subject, since popular and media views and beliefs about homosexuality can differ widely between English-speaking countries (US, UK, Australia, etc.) that tend to include a higher
2263:
works well for fields that are paper-oriented and have an online presence in all (or nearly all) respected venues. This search engine is a good complement for the commercially available Thompson ISI Web of Knowledge, especially in the areas which are not well covered in the latter, including books,
2052:
Forums, membership-only and subscription-only sites (since Googlebot does not sign up for site access) and sites that cycle their content are not cached or indexed by any search engine. With more sites moving to AJAX/Web 2.0 designs, this limitation will become more prevalent as search engines only
1579:
used to be less susceptible to manipulation by self-promoters, but with the advent of pseudo-news sites designed to collect ad revenues or to promote specific agendas, this test is often no more reliable than others in areas of popular interest, and indexes many "news" sources that reflect specific
665:
pages for further information as search engines' capabilities and operation often differ. Note that if you are signed in to a Google account when searching on Google then this may affect the results that you get, based on your search history. Also be sure to check "Languages for Displaying (Search)
2041:
by sites that do not wish their content to be indexed or cached by Google. Sites that contain large amounts of copyrighted content (Image galleries, subscription newspapers, webcomics, movies, video, help desks), usually involving membership, will block Google and other search engines. Other sites
1550:
has a pattern of coverage that is in closer accord with traditional encyclopedia content than is the Web, taken as a whole; if it has systemic bias, it is a very different systemic bias from Google Web searches. Multiple hits on an exact phrase in Google Book search provide convincing evidence for
2335:
Several generalized search engines exist. These adapt your query to many search engines. Web browsers offer a choice of search engines to choose to employ for the search box, and these can be used one at a time to experiment with search results. Meta-search engines use several search engines at
1661:
In the case of Google (and other search engines such as Bing and Yahoo!), the hit count at the top of the page is unreliable and should usually not be reported. The hit count reported on the penultimate (second-to-last) page of results may be slightly more accurate. For searches with few reported
717:
Since this isn't in quotes, Google looks for pages containing all of these terms. It finds all pages that contain "john" and "smith". This will return pages that contain "john smith", "john michael smith" but also pages that contain both terms separately, such as "The secretary, john arnold, and
298:
a variety of common search engines. The distinct advantages of each are their user interface and, less obviously, their algorithms for compiling and searching their own indexes. Because a web crawler can be blocked—specific ones or just in general—different search engines can list different web
1980:
For search terms that return many results, Google uses a process that eliminates results which are "very similar" to other results listed, both by disregarding pages with substantially similar content and by limiting the number of pages that can be returned from any given domain. For example, a
1657:
demonstrate notability or non-notability, case by case. Hit counts have always been, and very likely always will remain, an extremely erroneous tool for measuring notability, and should not be considered either definitive or conclusive. A manageable sample of results found should be opened
2268:
algorithm utilised by Google Scholar demonstrated that this search engine, as well as its commercial analogs, provides an adequate information about popularity of some concrete source, although that does not automatically reflect the real scientific contribution of concrete publication.
1761:– Search engines exclude a vast number of pages, and this may include systematic bias so that some matters are excluded disproportionately (for example, because they are commonly visible on sites that do not allow Google indexing, or the content for technical reasons cannot be indexed ( 1720:– Biased towards information from Internet-using developed countries and affluent parts of society (internet access). Countries where computer use is not so common will often have lower rates of reference to equally notable material, which may therefore appear (mistakenly) non-notable. 635:
matching as a partial match, as well as other Madonna references not related to the painting, the results of a Google or Bing search result count will be disproportionate as compared to any equally notable Renaissance painting. To exclude partial matches when Googling for the phrase,
2279:, is the original broadly based search engine, originating over four decades ago and indexing even earlier papers. Thus, especially in biology and medicine, PubMed "associated articles" is a Google Scholar proxy for older papers with no on-line presence. E.g., The journal 2070:
Hit counts reported by Google are only estimates, which in some cases have been shown to necessarily be off by nearly an order of magnitude, especially for hit counts above a few thousands. For such common words as to yield several thousand Google hits, freely available
1956:
A search like this requires a certain linguistic competence which not every individual Wikipedian possesses, but the Knowledge (XXG) community as a whole includes many bilingual and multilingual people and it is important for nominators and voters on AfD at least to
731:
The name is in double quotes. Google will look for pages containing the exact expression "John smith", or the two words next to each other ("The author was John. Smith was the composer..."). But it won't pick up name variants such as "John M. Smith".
2753: 1936:-endings or other grammatical variations not obvious for someone who does not know the language. Names from many cultures are traditionally given together with titles that are considered part of the name, but may also be omitted (as in 490:
Guarantee that the results reflect the uses you mean, rather than other uses. (E.g., a search for a specific John Smith may pick up many "John Smiths" who aren't the one meant, many pages containing "John" and "Smith" separately,
1596:
Topics alleged to be notable by popular reference can have the type of reference, and popularity, checked. An alleged notable issue that only has a few hundred references on the Internet may not be very notable; truly popular
1588:
provides evidence of how many times a publication, document, or author has been cited or quoted by others. Best for scientific or academic topics. Can include Masters and Doctorate thesis papers, patents, and legal documents.
450:
For example, a Google search for "the green goldfish", with quotes, in 2021 initially reports around 209,000 results, yet on paging through to the last search results page shows the returned number of hits to be 303. See also
1615:
Alternative spellings and usages can have their relative frequencies checked (e.g., for a debate which is the more common of two equally neutral and acceptable terms). Google Trends can compare usage in the "News" category
1555:. Google Book search can locate print-published testimony to the importance of a person, event, or concept. It can also be used to replace an unsourced "common knowledge" fact with a print-sourced version of the same fact. 851:
references will be about the US president, it makes sense to rule out all pages with that word, or even tighter, even though some pages may contain both references to non-presidential george bushes and the word president.
205:. Discerning the reliability of the source material is an especially core skill for using the web, while the wiki itself only facilitates the creation of multiple drafts. As presentations and deletions progress, this 1861:, but it may only be with careful research that it is revealed there are medical peer-reviewed assessments of the former, and that people are usually not allergic to fur, but to the sticky skin and saliva particles ( 411:
Depending on the subject matter, and how carefully it is used, a search engine test can be very effective and helpful, or produce misleading or non-useful results. In most cases, a search engine test is a first-pass
555:
before using or citing it. Less reliable sources may be unhelpful, or need their status and basis clarified, so that other readers gain a neutral and informed understanding to judge how reliable the sources are.
2754:
http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&Sect2=HITOFF&u=%2Fnetahtml%2FPTO%2Fsearch-adv.htm&r=1&p=1&f=G&l=50&d=PTXT&S1=6615209.PN.&OS=pn/6615209&RS=PN/6615209
579:
a source of neutral titles – only of popular ones. Neutrality is mandatory on Knowledge (XXG) (including deciding what things are called) even if not elsewhere, and specifically, neutrality trumps popularity.
2770:
Baroni, Marco and Ueyama, Motoko (2006) Building general- and special-purpose corpora by Web crawling, Proceedings of the 13th NIJL International Symposium Language Corpora Their Compilation and Application.
704:). An expression is given in "double quote" marks, and expressions can be grouped with parentheses. Expressions are not usually case-sensitive. So the following are all valid texts to search for, on Google: 1601:
can have millions or even tens of millions of references. However note that in some areas, a notable subject may have very few references; for example, one might only expect a handful of references to some
1953:, the spelling and rendering of older names may allow dozens of variations for the same person. A simplistic search for one particular variant may underrepresent the web presence by an order of magnitude. 1652:
A raw hit count should never be relied upon to prove notability. Attention should instead be paid to what (the books, news articles, scholarly articles, and web pages) is found, and whether they actually
2007:
Many, probably most, of the publicly available web pages in existence are not indexed. Each search engine captures a different percentage of the total. Nobody can tell exactly what portion is captured.
2264:
conference papers, non-American journals, the general journals in the field of strategy, management, international business, English language education and educational technology. The analysis of the
452: 1371:
from Argentina requires a much longer search string in order to eliminate a flood of results from his tennis namesake (see above): Simply click the link then add the positive and negative match terms
2030:
Google, like all Internet search engines can only find information that has actually been made available on the Internet. There is still a sizable amount of information that is not on the Internet.
1063:
may pull up many unhelpful answers, such as companies with these initials. So it is likely that a person who wants to look up this item and doesn't know much already, will have to search like this:
2737:—Turner points out that "that something gets hits on Google does not make it correct" and gives several examples of things that are incorrect that garner thousands of hits on Google search results. 605:
Raw "hit" (search result) count is a very crude measure of importance. Some unimportant subjects have many "hits", some notable ones have few or none, for reasons discussed further down this page.
2746:
Thelwall, M. (2008). Extracting accurate and complete results from search engines: Case study Windows Live. Journal of the American Society for Information Science and Technology, 59(1), 38–50.
1903:
Often for items of non-English origin, or in non-Latin scripts, a considerably larger number of hits result from searching in the correct script or for various transcriptions—be sure to check "
1714:– Tendency to be more receptive to beliefs that one is familiar with, agrees with, or are common in one's daily culture, and to discount beliefs and views that contradict one's preferred views. 1580:
points of view. The news archive goes back many years but may not be free beyond a limited period. News results often include press releases, which are not neutral, independent sources.
2764:
Nakov, Preslav and Hearst, Marti (2005). A Study of Using Search Engine Page Hits as a Proxy for n-gram Frequencies, Proceedings of Recent Advances in Natural Language Processing 2005
1830: 2384: 403:
If an unsourced addition to an article appears plausible, consider taking a moment to use a suitable search engine to find a reliable source before deciding whether to revert.
612:, without further discussion of the type of hits, what's been searched for, how it was searched, and what interpretation to give the results. On the other hand, examining the 1512:
Specialized searches work on the same principles and same basic search expressions as the above, but might be used to check in specialized archives, or with unusual options.
1163:
will specify that the search terms must appear in the page's URL itself, not just as a term on the page. This is mostly helpful for blogs and news sites that use blog-based
2291: 458:
Search more specifically within certain websites, or for combined and alternative phrases (or excluding certain words and phrases that would otherwise confuse the results).
2110:
Via Internet Archive you have proof that some information regarding "Impact of Advances in Computer Technology in Evidence Processing" existed on the Internet. Yet today
2099:
site. It is very graphics heavy, providing Google with little to nothing to look for and many missing pages in the Internet Archive version. So while you can bring up the
2027:
website is an example; although a search engine can find its main page, one can only search its database of individual patents by entering queries into the site itself.
1775:
seeking to influence site position, popularity, and ratings in such searches, or sell advertising space related to searches and search positions. Some subjects, such as
2146:
The most common search engines are Google, Bing, Yahoo, and DuckDuckGo but the most useful search engine, which depend on a context, may not be the most common ones.
1298:
from (Spanish-speaking) Argentina, research how his name is spelled in reliable English sources. The search results should include articles with the word "tennis" but
2114:
A program known to be part of the 2002 Economic Crime Summit Conference and at one time was listed on a website on the Internet currently cannot be found by Google.
1977:
A site-specific search may help determine if most of the matches are coming from the same web site; a single web site can account for hundreds of thousands of hits.
291:
This page describes both these web search tests and the web search tools that can help develop Knowledge (XXG), and it describes their biases and their limitations.
2403:
For example, if there are 16 hits at Google Books under one name, and 24 under another, there is only a 70% confidence that the second name is actually more common.
794:) means: exclude pages that contain this term. The danger is that pages will be excluded because of a term that actually has nothing to do with the search in hand. 2758:
Thelwall, M. (2008). Quantitative comparisons of search engine results, Journal of the American Society for Information Science and Technology, 59(11), 1702–1710.
2740:
Thelwall, M. (2008). Quantitative comparisons of search engine results, Journal of the American Society for Information Science and Technology, 59(11), 1702–1710.
964:("zytox is the worlds leading producer of widgets" OR "merger with IBM in 1929" OR "exports radar components to over fifty countries") NOT Knowledge (XXG) NOT wiki 35: 1552: 568:. Knowledge (XXG) does. Google indexes self-created pages and media pages which do not have a neutrality policy. Knowledge (XXG) has a neutrality policy that is 1669:
Article scope: If narrow, fewer references are required. Try to categorize the point of view, whether it is NPoV, or other; e.g., notice the difference between
551:
Search engine tests may return results that are fictitious, biased, hoaxes or similar. It is important to consider whether the information used derives from
2696:
Maslov, S.; Redner, S. (2008). Promise and pitfalls of extending Google's PageRank algorithm to citation networks. Journal of Neuroscience, 28, 11103–11105
2287: 623:
Additionally, search engines do not disambiguate, and tend to match partial searches. (However, as described below, you can eliminate partial matches by
1609:
Topics alleged to be genuine can be checked to test if they are referenced by reliable independent sources; this is a good test for hoaxes and the like.
1347: 2053:
simulate following the links on a web page. AJAX page setups (like Google Maps) dynamically return data based on real-time manipulation of JavaScript.
2024: 1531: 2724: 1535: 1240:(This is a good way to avoid a deluge of results which are all either from Knowledge (XXG), or from copies and mirrors of Knowledge (XXG) articles.) 956:, references to start-up under three common terms that might be used, and other words that hopefully will be commonly related to start-up in Linux. 2379: 1023:) and knows some terms it might be associated with but can't remember the term itself. Use associated terms to try and find pages that mention it. 2080: 2038: 907:. Last, pages containing references related to food and cooking are explicitly excluded, since most references to "flavor" will be of this kind. 448:
Note, however, that Google searches may report vastly more hits than will ever be returned to the user, especially for exact quoted expressions.
2535: 2349: 1697:
In most cases, search results should be reviewed with an awareness and careful skepticism before relying upon them. Common biases include:
82: 1314: 495:
miss out all the useful references indexed under "J. Smith" or, if the term is put in quotes, "John Michael Smith" and "Smith, John")
1688:, it may be on 700 pages and might still not be considered 'existing' enough to show any notability, for Knowledge (XXG)'s purposes. 1680:
Article subject: If it's about some historical person, one or two mentions in reliable texts might be enough; if it's some Internet
2797: 1880:
proportion of homosexuality-accepting groups, and Arabic-speaking countries (Middle East) that tend to include a lower proportion.
765:
be given in upper case) to find possible alternate spellings when it isn't clear whether or not words are joined by page authors.
2802: 670:
your searching. The following collapsible sections cover basic examples and help for using search engines with Knowledge (XXG).
428:
A search engine can index pages and text which others have placed on the internet, just like a big index at the back of a book.
1894:, have a different systemic bias from Google Web searches and give an interesting cross-check and a somewhat independent view. 46: 31: 1617: 468:
Guarantee the results are reliable or "true" (search engines index whatever text people choose to put online, true or false).
74: 2656: 2023:
are formatted by a Web server when a user requests them and as such cannot be indexed by conventional search engines. The
1932:
languages should take into account that arriving at the total number of hits may require searching for forms with varying
2107:
is even worse as that was in three places and none of the archived links tells you anything about the papers presented.
1539: 1368: 1295: 673:
Specialized search engines such as medical paper archives have their own specialized search structure not covered here.
2792: 2063:
Google and other popular search engines are also a target for search engine "search result enhancement", also known as
1812: 1674: 2019:, estimated at over 3 trillion pages, exists within databases whose contents the search engines do not index. These 2457: 52:"Knowledge (XXG):GOOGLETEST" redirects here. For the argument about "many google hits" in deletion discussions, see 2104: 2016: 1772: 952:
This search looks for pages that contain references to Linux, references to the two most common boot loaders with
38:. For templates that create clickable Google search links to search multiple reliable sources simultaneously, see 1164: 1080:
Using those pages to find the correct term is "deoxyribonucleic acid", sometimes written "deoxyribo-nucleic acid"
39: 2759: 2741: 168: 2747: 1563:
or other date-stamped media can help establish the timing and context of early references to a word or phrase.
519:
Provide the latest research in depth to the same extent as journals and books, for rapidly developing subjects.
334:
amongst others. Several generalized search engines exist. These adapt your query to many search engines. See
661:. Similar approaches will work in many other search engines, and other Google searches, but always read their 2710: 2412: 1785:
varies; some sites accept any information, while others have some form of review or checking system in place.
2423: 2076: 2511: 816:
There are many references and you want to narrow down the search by excluding less likely page suggestions.
284: 53: 2324: 2123: 213:. Depending on the type of query and kind of search engine, this variety can open up to a single author. 2732: 2100: 2045:
Search engines also might not be able to read links or metadata that normally requires a browser plugin,
86: 2588: 2556: 2356: 1102: 632: 508:
instance of a piece of text and not a reprint, excerpt, quotation, misquotation, or copyright violation.
2129: 2064: 1990: 1730:(some matters may be given far more space and others far less, than fairly represents their standing): 1635: 903:
the other. Also the page must contain some other words likely to be related to subatomic physics, thus
379: 96: 2481: 2083:(for American English) can provide a more accurate estimate of the relative frequencies of two words. 2680:
van Aalst, Jan. (2010) Using Google Scholar to Estimate the Impact of Journal Articles in Education.
2060:
that may cause it to return more results for a specific search term than exist actual content pages.
896: 628: 345: 2317: 2217: 2034: 1937: 1776: 1564: 538:
and deciding what they really show. Appearance in an index alone is not usually proof of anything.
2437: 641: 2667:
Harzing, A. W. K.; van der Wal, R. (2008). Google Scholar as a new source for citation analysis?
2042:
may also block Google due to the stress or bandwidth concerns on the server hosting the content.
1971: 1332: 988:
If this text is copied from a website, a search like this will often help to locate the source.
983: 2468: 1077:– using words commonly associated with that meaning of DNA, to get pages covering that meaning. 2337: 2228: 484: 2284: 1489:
To find sites from a given country (more likely to end with that country's initials, such as
631:
is certainly an encyclopedic and notable entry, it's not a pop culture icon. However, due to
593: 585: 565: 523: 210: 2634: 2020: 1917: 1835: 950:
start-up (or boot) process, but doesn't know where on the net to look for reliable sources.
888: 476: 1209:
Only report pages from websites ending in "wikipedia.org", Knowledge (XXG) in any language
1199:
Only report pages from websites ending in "en.wikipedia.org", the English Knowledge (XXG).
847:
You want references to George Bush, but not the one who's the president. Given that 90% of
440:
Confirm "who's reported to have said what" according to sources (useful for neutral citing)
226: 17: 2601: 2569: 2448:
Avoid inauthor:"Books, LLC", as LLC 'publishes' raw printouts of Knowledge (XXG) articles.
2366: 1912: 1808: 1779:, are so dominated by these that searches cannot be reliably used to establish popularity. 1727: 1434:, to find pages on a website (or not on the website) with the given expression in a title 892: 884: 81:
It explains concepts or processes used by the Knowledge (XXG) community. It is not one of
2765: 2376:, a way to filter sites from Google search to remove sites which mirror Wikimedia content 552: 202: 2657:
http://web.archive.org/web/20011212161658/http://www.summit.nw3c.org/Programs_Agenda.htm
1350:
Simply click the template-generated link then add the positive and negative match terms
970:
Looks for any of three memorable phrases from a suspected copyright violation, which do
809:
There is a clear expression or term and a page that contains that meaning probably will
2373: 2260: 2190: 2167: 2012: 1961:
and not make untoward assumptions when language or transcription bias may be a factor.
1745: 1584: 299:
sites, and there are more web sites available by URL than are indexed in any database.
2057: 1180:
Specialized options, including searches to include or exclude Knowledge (XXG) itself.
873:(flavor OR flavour) (quark OR quantum OR physics) -eat -food -drink -cooking -culinary 609: 589: 198: 194: 2786: 2489:
Proceedings of the 10th international conference on Current trends in web engineering
2171: 2163: 1876: 1598: 1559: 1526: 1019:
A search for someone who wants to find what the molecule which reproduces is called (
658: 480: 417: 353: 349: 179: 2725:"Argumentum ad Googlum; Why Getting a Million Hits on Google Doesn't Prove Anything" 498:
Guarantee you aren't missing crucial references through choice of search expression.
2772: 2639: 2620: 2232: 2096: 1798: 1546: 1051: 501:
Guarantee that little-mentioned or unmentioned items are automatically unimportant.
277:– Identify the names used for things (including alternative names and terminology). 246: 2361:, a template designed to help with Google Books, News archive and Scholar searches 2297: 2213: 2072: 1950: 1921: 1850: 1762: 1612:
Copyright violations from websites can often be identified (as described above).
1603: 1571: 861: 848: 331: 2615: 1551:
the real use of the phrase or concept. You can compare usage of terms, such as
1223:
end with "wikipedia.org", i.e. pages that are NOT on a Knowledge (XXG) website
1109:
they might know others, including useful words that might help narrow it down.
45:"Knowledge (XXG):Set" redirects here. For the set index article guideline, see 2548: 2236: 2175: 2103:, the overview link that would tell you who presented what does not work. The 1933: 1929: 1925: 1819: 1530:
can allow you to find which rendering of a word or name is most searched for,
201:) source material, depending on their reliability. There is a high demand for 2760:
http://www.scit.wlv.ac.uk/~cm1993/papers/SearchEngineComparisons_preprint.doc
2742:
http://www.scit.wlv.ac.uk/~cm1993/papers/SearchEngineComparisons_preprint.doc
1849:
are likely to be more reported. For example, there may be many references to
2748:
http://www.scit.wlv.ac.uk/~cm1993/papers/2007_Accurate_Complete_preprint.doc
2220:(how web pages looked and their contents, at different times or if deleted) 1823: 1685: 1681: 1468: 1393: 1303: 413: 2046: 1348:
Sources for Facundo ArgĂĽello on Google, excluding language(es)/country(ar)
986:
but also other wikis, which are not the sorts of sites we're looking for.
2583:
More, Alvin; Murray, Brian H. (2000). "Sizing the Internet". Cyveillance.
2340:
can add a search engine or a meta-search engine to your list of choices.
2301: 2265: 2208: 1741: 1670: 1037: 572:
and applies to all articles, and all article-related editorial activity.
319: 303: 187: 183: 30:"Knowledge (XXG):Google" redirects here. For the Google WikiProject, see 1328:. It's possible to greatly simplify such a search by using the template 940:
linux (grub OR lilo) (boot OR startup OR "start-up") kernel init process
879:
An example of a more complex search. The author is looking for the term
2309: 2272: 2186: 1854: 323: 1970:"match" count estimate can be significantly different (by one or more 1726:– May disproportionally represent some matters, especially related to 1392:
Find pages which link to a particular page, such as Knowledge (XXG)'s
933:, and parentheses, which can be used to make quite detailed searches. 437:
Provide information and lead to pages that assist with the above goals
2276: 1862: 1494: 341: 2752:
Gomes, et al. (2000). Detecting query-specific duplicate documents.
1478:
To find pages that are official US or UK government sources (end in
1471:
terminology that are not self-published by Microsoft (not ending in
1272:
the Google search that you performed, so that others can repeat it.
294:
The advantages of a specific search engine can be distinguished by
1455:
Site inclusion/exclusion is often very useful to get views either
1444:
Specify that the page's URL must contain a particular expression.
947: 352:, but aims for generality where it can. For example, it describes 858:, and one has a second exclusion to rule out pages with the term 1974:) to the total count of results shown on the last results page. 327: 1924:) may have to be searched for both including and excluding the 925:
Google allows all sorts of combinations of words, expressions,
608:
Hit-count numbers alone can only rarely "prove" anything about
2491:. Computer Science and Engineering Division, Waseda University 2313: 1875:– For example, an Arabic speaker searching for information on 1858: 1095:("she's got" OR "she has") "do right by me" ticket ride lyrics 1020: 144: 60: 2092: 1606:
matter, and some matters will not be reflected online at all.
1590: 1313:
etc., like Spanish Knowledge (XXG)), omit web sites with the
1002:
Finding vaguely remembered information and unfamiliar terms.
357: 2385:
Knowledge (XXG):You can't fix Google through Knowledge (XXG)
2283:
puts papers on-line back through 1970s. For this 1978 paper
1122:
Searches restricted to news, newsgroups, and other sources.
236: 34:. For how to influence the indexing of pages by Google, see 1891: 1658:
individually and read, to actually verify their relevance.
1576: 1132: 446:
Confirm roughly how popularly referenced an expression is.
365: 311: 209:
of choices for input tend to produce the desired objective—
2200: 1753:
General web search engines (Google, Bing web search etc.):
361: 2469:
Google Answers question on word frequency in news sources
2305: 1665:
Other useful considerations in interpreting results are:
255:– Decide whether a page should be nominated for deletion. 2482:"Reliability Verification of Search Engines' Hit Counts" 2033:
Google, like all major Web search services, follows the
1383:(and so on) to the search string and repeat the search. 1367:
To research the preferred spelling of the soccer player
2246: 2137: 1998: 1643: 1302:
the word "tenis" (the Spanish-language spelling), omit
854:
Two variations are shown; one looks for the expression
798:
always means "and also not" in Google. The best use of
475:
something is mentioned a lot, and that it isn't due to
394: 387: 307: 193:
Search engine results can help editors retain (what is
132: 125: 118: 111: 104: 2766:
http://biotext.berkeley.edu/papers/nakov_ranlp2005.pdf
739:"John Smith" OR "John M Smith" OR "John Michael Smith" 261:– Discover what sources (including websites) actually 2547:
Gulli, Antonio; Signorini, Alessio (28 August 2005).
1928:, and searches for names and other words in strongly 1829:
Urban legends are often reported widely, for example
1059:
An example of a problematic search. The obvious term
162:
it is you're measuring and what your measurement can
2458:
Google search for: AYB OR AYBABTU OR "All your base"
2148: 1839:
set sail in 1779, although the correct date is 1797.
1405:
Specify that the expression must appear in the HTML
1187: 1127: 1007: 934: 819: 705: 2549:"The Indexable Web is more than 11.5 billion pages" 1899:
Foreign languages, non-Latin scripts, and old names
1085:"Deoxyribonucleic acid" OR "Deoxyribo nucleic acid" 564:Google (and other search systems) do not aim for a 443:
Often provide full cited copies of source documents
2387:- for addressing errors in Google Knowledge Panels 1920:name. Personal names in other languages (Russian, 1822:will often report it spelt "El Nino", without the 1520:Specific uses of search engines in Knowledge (XXG) 1167:that use a lot of plain language in article URLs. 1013:biology reproduction cell nucleus chromosome helix 620:provide useful information related to notability. 36:Knowledge (XXG):Controlling search engine indexing 1916:there may be many variants of the same Arabic or 1575:can help assess whether something is newsworthy. 1438:allintitle: (atom NOT bomb) site:en.wikipedia.org 653:Search engine expressions (examples and tutorial) 235:– Identify a term's notability. (See for example 2773:http://tokuteicorpus.jp./result/pdf/2006_004.pdf 2243:Universities and higher education organisations 1041:so you can look up more information from there. 899:way, so the first expression is to look for one 542:Search engine tests and Knowledge (XXG) policies 271:– Review the reliability of facts and citations. 1620:), but this may not be reliable for older news. 315: 1890:Note that other Google searches, particularly 946:A person who wants to write an article on the 2105:2004 Economic Crime Summit Conference archive 1811:gives 10 times more results than the correct 588:for information on balancing the policies on 166:. Web searches test the understanding of the 8: 2669:Ethics in Science and Environmental Politics 2413:Google Search Operators and more search help 1706:General (the Internet or people as a whole): 1358:to the search string and repeat the search. 1105:"), for a person who knows some phrases and 1030:piometra OR pieometra OR pyametra OR pymetra 659:search expressions used in Google web search 487:, or self-promotion, rather than importance. 424:What a search test can do, and what it can't 2380:Knowledge (XXG):Google searches and numbers 2015:is at least 11.5 billion pages, but a much 1985:Search engine limitations – technical notes 1246:(atom OR bomb) NOT Knowledge (XXG) NOT wiki 841:George Bush NOT president NOT "White House" 27:Knowledge (XXG) how-to guide about sourcing 2480:Takuya, Funahashi; Hayato, Yamana (2010). 1542:, see also the Google Books example below. 1344:{{subst:google LC|Facundo Argüello|es|ar}} 1338:(though it does not auto-exclude the term 1176: 1118: 998: 974:appear on the same page as a reference to 916: 774: 675: 504:Guarantee that a particular result is the 336: 158:Measuring is easy. What's hard is knowing 2692: 2690: 2512:"Why Google Can't Count Results Properly" 2350:Knowledge (XXG):Advanced source searching 2025:United States Patent and Trademark Office 1905:Languages for Displaying (Search) Results 1818:A search for the most common spelling of 1304:Spanish-language web sites prefixed with 1139:Search for a term within a certain site: 1028:Search for a term with unknown spelling: 839:Search for a term with a 2nd meaning v3: 831:Search for a term with a 2nd meaning v2: 823:Search for a term with a 2nd meaning v1: 813:be relevant to the meaning you are after. 684:Most searches allow searching for words ( 535: 2433: 2431: 1505: 1501: 1490: 1483: 1479: 1472: 1447: 1437: 1427: 1423: 1419: 1412: 1398: 1380: 1376: 1372: 1355: 1351: 1339: 1325: 1321: 1316: 1310: 1305: 1282: 1265: 1261: 1257: 1253: 1245: 1234: 1226: 1219:Only report pages from websites that do 1212: 1202: 1160: 1156: 1148: 1140: 1094: 1084: 1074: 1067: 1060: 1047: 1029: 1012: 979: 975: 963: 953: 939: 930: 926: 904: 900: 880: 872: 859: 855: 840: 832: 824: 803: 799: 795: 791: 787: 779: 758: 746: 738: 724: 710: 701: 697: 693: 689: 685: 83:Knowledge (XXG)'s policies or guidelines 2396: 2318:Kent University Law Library and sources 2081:Corpus of Contemporary American English 1748:rather than in a general Google search. 1346:displays as a clickable external link: 700:), as well as excluding certain items ( 2597: 2586: 2565: 2554: 2300:online, in many countries, including: 1500:Or particular media publishers (e.g., 1463:websites. For example, it can be used 1399:link:http://en.wikipedia.org/Main_Page 920:Advanced searches and copyvio checks. 757:of these expressions. Note the use of 616:of hit arising (or their lack) often 586:WP:NPOV § Neutrality and Verifiability 455:to calculate statistical significance. 2101:2002 Economic Crime Summit Conference 1853:and confirming that people are often 747:"Ahmed Abu-Sayed" OR "Ahmed Abusayed" 694:"war on terror" OR "war on terrorism" 640:the phrase to be matched as follows: 372:Good-faith searching: a rule of thumb 7: 2249:(University websites search engine) 2112:Google cannot find that information! 1203:(atom OR bomb) site:en.wikipedia.org 1070:– finding that it has many meanings. 806:in Google) is in two circumstances: 85:, and may reflect varying levels of 2510:Sullivan, Danny (21 October 2010). 2056:Google has also been the victim of 1801:is often reported over correctness 1406: 1147:Search for a term in a site's URL: 302:The most common search engines are 1540:"Tidal wave" vs. "Tsunami" example 1227:(atom OR bomb) -site:wikipedia.org 47:Knowledge (XXG):Set index articles 32:Knowledge (XXG):WikiProject Google 25: 1965:Google distinct page count issues 1959:be aware of their own limitations 1618:"Tidal wave" vs "Tsunami" example 1213:(atom OR bomb) site:wikipedia.org 1011:Search for a vaguely known term: 627:the phrase to be matched): While 287:, and if so, check the licensing. 245:– Identify a spurious hoax or an 2723:Rich Turner (29 February 2004). 2715:Science, AntiScience and Geology 2225:Books and historical literature 1769:Search engines as promotion tool 1718:Cultural and computer-usage bias 1315:Argentine top-level domain name 1194:Enter a search string like this 883:, in the sense of a property in 666:Results" in "Search Settings".) 596:on how articles should be named) 575:As such, Google is specifically 148: 64: 1765:- or image-based websites etc.) 1320:, and omit pages that meantion 871:Narrow down widely used terms: 283:– Identify whether material is 2616:Quotes with and without quotes 2424:Search history personalization 2183:Professional research indexes 2079:(for British English) and the 1738:Sources not readily accessible 1141:"George Bush" site:www.bbc.com 203:reliability on Knowledge (XXG) 1: 2536:Google search for "Microsoft" 2095:site is a rather Google- and 1847:Popular views and perceptions 1285:on a specific list of sites. 905:(quark OR quantum OR physics) 340:below. This page mostly uses 2306:Library of Congress (THOMAS) 2296:There are a large number of 2292:lists 89 associated articles 1732:popularity is not notability 1101:A search for a song title (" 1046:Search for ambiguous terms: 982:, to weed out both a lot of 2798:Knowledge (XXG) editor help 2709:Joe Meert (30 April 2006). 2097:Internet Archive-unfriendly 1813:Charles Mountbatten-Windsor 1807:A search for the incorrect 1675:Ontology (computer science) 1534:(note: sports category) or 1256:, avoid pages that mention 887:. Sources may spell it the 833:"George Bush" NOT president 657:This section explains some 583: 18:Knowledge (XXG):Google test 2819: 2803:Knowledge (XXG) notability 2671:, vol. 8, no. 1, pp. 62–71 2331:Generalized search engines 2256:Specialized search engines 2127: 2121: 2087:Example of the limitations 2011:The estimated size of the 1988: 1633: 1553:"Tidal wave" vs. "Tsunami" 1413:allintitle: (atom OR bomb) 790:(in Google represented by 718:treasurer, mike smith..." 377: 316:Specialized search engines 265:for possible presentation. 94: 51: 44: 29: 2288:lists 100 citing articles 2189:(medical), science, law, 1459:a named website, or from 1233:Avoid pages that mention 1083:Doing a final search for 978:. Also excludes the term 825:George Bush NOT president 536:interpreting your results 197:) or delete (what is not 40:Template:Google templates 2647:posts linked from there. 2633:Liberman, Mark (2005), " 2065:search engine optimizers 1149:allinurl:bbc George Bush 594:WP:NPOV § Article naming 217:Some search engine tests 156:This page in a nutshell: 2711:"Argumentum ad Googlum" 2614:Mark Liberman (2009), " 2207:Historical archives of 2160:General search engines 2077:British National Corpus 2017:deeper (and larger) Web 1873:Language selection bias 1131:To search all news use 984:Knowledge (XXG) mirrors 337:§ Common search engines 2793:Knowledge (XXG) how-to 2682:Educational Researcher 2596:Cite journal requires 2564:Cite journal requires 2438:Google Search Settings 2325:list of search engines 2124:List of search engines 1294:For the tennis player 1075:DNA cell biology helix 642:"Madonna of the Rocks" 463:Search engines cannot: 227:Google's trending tool 2310:Indiana Supreme Court 2118:Common search engines 2093:Economic Crime Summit 1693:Biases to be aware of 1591:Google Scholar search 753:Looks for pages with 692:), and combinations ( 566:neutral point of view 532:cannot help you avoid 530:A search engine test 275:Names and terminology 2516:SearchEngineLand.com 2336:once. A web browser 2218:Search engine caches 2058:redirection exploits 1797:– Popular usage and 1625:Interpreting results 1565:Google Groups search 1448:inurl:(atom OR bomb) 1191:To search like this 1159:isn't enough, using 648:Using search engines 629:Madonna of the Rocks 592:and neutrality, and 73:This help page is a 2635:Questioning reality 2302:Library of Congress 2035:robots.txt protocol 1972:orders of magnitude 1777:pornographic actors 1326:Knowledge (XXG).org 1266:Knowledge (XXG).org 1155:If searching using 512:and search engines 432:Search engines can: 407:Search engine tests 237:Google's ngram tool 211:a neutral viewpoint 172:of Knowledge (XXG). 2374:Meta:Mirror filter 1892:Google Book Search 1795:Popular usage bias 479:, reposting as an 2286:, Google Scholar 2253: 2252: 2229:Project Gutenberg 2021:dynamic web pages 1831:hundreds of sites 1517: 1516: 1467:To find pages on 1453: 1452: 1241: 1175: 1174: 1171: 1170: 1117: 1116: 1113: 1112: 997: 996: 993: 992: 915: 914: 911: 910: 773: 772: 769: 768: 356:(usenet groups), 176: 175: 143: 142: 16:(Redirected from 2810: 2736: 2735:on 3 March 2016. 2731:. Archived from 2718: 2697: 2694: 2685: 2678: 2672: 2665: 2659: 2654: 2648: 2631: 2625: 2612: 2606: 2605: 2599: 2594: 2592: 2584: 2580: 2574: 2573: 2567: 2562: 2560: 2552: 2544: 2538: 2533: 2527: 2526: 2524: 2522: 2507: 2501: 2500: 2498: 2496: 2486: 2477: 2471: 2466: 2460: 2455: 2449: 2446: 2440: 2435: 2426: 2421: 2415: 2410: 2404: 2401: 2370: 2360: 2149: 2140: 2001: 1836:USS Constitution 1833:report that the 1646: 1408: 1387: 1369:Facundo Argüello 1362: 1345: 1337: 1331: 1296:Facundo Argüello 1289: 1276: 1252:Find the phrase 1239: 1188: 1177: 1128: 1119: 1008: 999: 962:Copyvio search: 935: 917: 820: 775: 706: 690:war on terrorism 688:), expressions ( 679:Basic searches. 676: 597: 553:reliable sources 397: 390: 152: 151: 145: 135: 128: 121: 114: 107: 68: 67: 61: 21: 2818: 2817: 2813: 2812: 2811: 2809: 2808: 2807: 2783: 2782: 2722: 2708: 2705: 2703:Further reading 2700: 2695: 2688: 2679: 2675: 2666: 2662: 2655: 2651: 2632: 2628: 2613: 2609: 2595: 2585: 2582: 2581: 2577: 2563: 2553: 2546: 2545: 2541: 2534: 2530: 2520: 2518: 2509: 2508: 2504: 2494: 2492: 2484: 2479: 2478: 2474: 2467: 2463: 2456: 2452: 2447: 2443: 2436: 2429: 2422: 2418: 2411: 2407: 2402: 2398: 2394: 2364: 2354: 2346: 2333: 2290:, while PubMed 2258: 2197:News and media 2144: 2143: 2136: 2132: 2126: 2120: 2089: 2005: 2004: 1999:WP:GOOGLELIMITS 1997: 1993: 1987: 1967: 1909:Search Settings 1901: 1809:Charles Windsor 1773:industry exists 1728:popular culture 1703: 1695: 1650: 1649: 1642: 1638: 1632: 1627: 1522: 1507: 1503: 1492: 1486:, accordingly), 1485: 1481: 1474: 1449: 1439: 1429: 1425: 1421: 1414: 1400: 1386: 1382: 1378: 1374: 1361: 1357: 1353: 1343: 1341: 1335: 1329: 1327: 1323: 1322:Knowledge (XXG) 1318: 1312: 1307: 1288: 1284: 1275: 1267: 1263: 1259: 1258:Knowledge (XXG) 1255: 1247: 1236: 1235:Knowledge (XXG) 1228: 1214: 1204: 1162: 1158: 1150: 1142: 1096: 1086: 1076: 1069: 1062: 1049: 1031: 1014: 981: 977: 976:Knowledge (XXG) 965: 955: 941: 932: 928: 906: 902: 885:quantum physics 882: 874: 865: 857: 842: 834: 826: 805: 801: 797: 793: 789: 781: 760: 748: 740: 726: 712: 703: 702:Bush NOT George 699: 695: 691: 687: 655: 650: 603: 562: 549: 544: 514:often will not: 426: 423: 409: 401: 400: 393: 386: 382: 374: 219: 169:WP:Five pillars 149: 139: 138: 131: 124: 117: 110: 103: 99: 91: 90: 65: 57: 50: 43: 28: 23: 22: 15: 12: 11: 5: 2816: 2814: 2806: 2805: 2800: 2795: 2785: 2784: 2780: 2776: 2775: 2768: 2762: 2756: 2750: 2744: 2738: 2720: 2704: 2701: 2699: 2698: 2686: 2673: 2660: 2649: 2626: 2607: 2575: 2539: 2528: 2502: 2472: 2461: 2450: 2441: 2427: 2416: 2405: 2395: 2393: 2390: 2389: 2388: 2382: 2377: 2371: 2362: 2352: 2345: 2342: 2332: 2329: 2323:See also this 2275:, now part of 2261:Google Scholar 2257: 2254: 2251: 2250: 2244: 2240: 2239: 2226: 2222: 2221: 2211: 2204: 2203: 2198: 2194: 2193: 2191:Google Scholar 2184: 2180: 2179: 2161: 2157: 2156: 2153: 2142: 2141: 2133: 2128: 2119: 2116: 2113: 2088: 2085: 2013:World Wide Web 2003: 2002: 1994: 1989: 1986: 1983: 1966: 1963: 1960: 1941:Mustafa Kemal 1900: 1897: 1896: 1895: 1882: 1881: 1870: 1868: 1844: 1843: 1842: 1841: 1840: 1827: 1816: 1792: 1789:Self-mirroring 1786: 1783:Review process 1780: 1766: 1750: 1749: 1746:Newspapers.com 1735: 1721: 1715: 1702: 1701:General biases 1699: 1694: 1691: 1690: 1689: 1678: 1656: 1648: 1647: 1639: 1634: 1631: 1628: 1626: 1623: 1622: 1621: 1613: 1610: 1607: 1604:archaeological 1599:Internet memes 1594: 1585:Google Scholar 1581: 1568: 1556: 1543: 1521: 1518: 1515: 1514: 1510: 1509: 1498: 1487: 1476: 1462: 1458: 1451: 1450: 1445: 1441: 1440: 1435: 1433: 1416: 1415: 1410: 1407:<title: --> 1402: 1401: 1396: 1389: 1388: 1384: 1364: 1363: 1359: 1301: 1291: 1290: 1286: 1278: 1277: 1273: 1271: 1249: 1248: 1243: 1230: 1229: 1224: 1222: 1216: 1215: 1210: 1206: 1205: 1200: 1196: 1195: 1192: 1182: 1181: 1173: 1172: 1169: 1168: 1152: 1151: 1144: 1143: 1136: 1135: 1124: 1123: 1115: 1114: 1111: 1110: 1108: 1103:Ticket to Ride 1098: 1097: 1090: 1089: 1088: 1087: 1081: 1078: 1071: 1056: 1055: 1043: 1042: 1033: 1032: 1025: 1024: 1016: 1015: 1004: 1003: 995: 994: 991: 990: 973: 967: 966: 959: 958: 954:(grub OR lilo) 943: 942: 922: 921: 913: 912: 909: 908: 876: 875: 868: 867: 844: 843: 836: 835: 828: 827: 818: 817: 814: 812: 783: 782: 771: 770: 767: 766: 764: 756: 750: 749: 742: 741: 734: 733: 728: 727: 720: 719: 714: 713: 698:John AND Smith 681: 680: 654: 651: 649: 646: 602: 599: 578: 561: 558: 548: 545: 543: 540: 533: 528: 527: 520: 515: 510: 509: 502: 499: 496: 488: 469: 460: 459: 456: 449: 444: 441: 438: 425: 422: 408: 405: 399: 398: 395:WP:GOOGLECHECK 391: 383: 378: 373: 370: 358:Google Scholar 289: 288: 278: 272: 266: 256: 250: 240: 230: 218: 215: 174: 173: 153: 141: 140: 137: 136: 129: 122: 115: 108: 100: 95: 92: 80: 79: 71: 69: 26: 24: 14: 13: 10: 9: 6: 4: 3: 2: 2815: 2804: 2801: 2799: 2796: 2794: 2791: 2790: 2788: 2781: 2778: 2774: 2769: 2767: 2763: 2761: 2757: 2755: 2751: 2749: 2745: 2743: 2739: 2734: 2730: 2726: 2721: 2716: 2712: 2707: 2706: 2702: 2693: 2691: 2687: 2683: 2677: 2674: 2670: 2664: 2661: 2658: 2653: 2650: 2646: 2642: 2641: 2636: 2630: 2627: 2623: 2622: 2617: 2611: 2608: 2603: 2590: 2579: 2576: 2571: 2558: 2550: 2543: 2540: 2537: 2532: 2529: 2517: 2513: 2506: 2503: 2490: 2483: 2476: 2473: 2470: 2465: 2462: 2459: 2454: 2451: 2445: 2442: 2439: 2434: 2432: 2428: 2425: 2420: 2417: 2414: 2409: 2406: 2400: 2397: 2391: 2386: 2383: 2381: 2378: 2375: 2372: 2368: 2363: 2358: 2353: 2351: 2348: 2347: 2343: 2341: 2339: 2330: 2328: 2326: 2321: 2319: 2315: 2311: 2307: 2303: 2299: 2298:law libraries 2294: 2293: 2289: 2285: 2282: 2278: 2274: 2270: 2267: 2262: 2255: 2248: 2245: 2242: 2241: 2238: 2234: 2230: 2227: 2224: 2223: 2219: 2215: 2212: 2210: 2206: 2205: 2202: 2199: 2196: 2195: 2192: 2188: 2185: 2182: 2181: 2177: 2173: 2169: 2165: 2162: 2159: 2158: 2154: 2151: 2150: 2147: 2139: 2135: 2134: 2131: 2125: 2117: 2115: 2111: 2108: 2106: 2102: 2098: 2094: 2086: 2084: 2082: 2078: 2074: 2068: 2066: 2061: 2059: 2054: 2050: 2048: 2043: 2040: 2036: 2031: 2028: 2026: 2022: 2018: 2014: 2009: 2000: 1996: 1995: 1992: 1984: 1982: 1978: 1975: 1973: 1964: 1962: 1958: 1954: 1952: 1947: 1945: 1944: 1940: 1935: 1931: 1927: 1923: 1919: 1914: 1910: 1906: 1898: 1893: 1889: 1888: 1887: 1886: 1878: 1877:homosexuality 1874: 1871: 1866: 1864: 1860: 1856: 1852: 1848: 1845: 1838: 1837: 1832: 1828: 1825: 1821: 1817: 1814: 1810: 1806: 1805: 1803: 1802: 1800: 1796: 1793: 1790: 1787: 1784: 1781: 1778: 1774: 1770: 1767: 1764: 1760: 1757: 1756: 1755: 1754: 1747: 1743: 1739: 1736: 1733: 1729: 1725: 1722: 1719: 1716: 1713: 1712:Personal bias 1710: 1709: 1708: 1707: 1700: 1698: 1692: 1687: 1683: 1679: 1676: 1672: 1668: 1667: 1666: 1663: 1659: 1654: 1645: 1641: 1640: 1637: 1629: 1624: 1619: 1614: 1611: 1608: 1605: 1600: 1595: 1592: 1587: 1586: 1582: 1578: 1574: 1573: 1569: 1566: 1562: 1561: 1560:Google Groups 1557: 1554: 1549: 1548: 1544: 1541: 1537: 1533: 1529: 1528: 1527:Google Trends 1524: 1523: 1519: 1513: 1499: 1496: 1488: 1477: 1473:microsoft.com 1470: 1466: 1465: 1464: 1460: 1456: 1446: 1443: 1442: 1436: 1431: 1418: 1417: 1411: 1409:of the page. 1404: 1403: 1397: 1395: 1391: 1390: 1385: 1370: 1366: 1365: 1360: 1349: 1334: 1319: 1308: 1299: 1297: 1293: 1292: 1287: 1280: 1279: 1274: 1269: 1251: 1250: 1244: 1242: 1232: 1231: 1225: 1220: 1218: 1217: 1211: 1208: 1207: 1201: 1198: 1197: 1193: 1190: 1189: 1184: 1183: 1179: 1178: 1166: 1154: 1153: 1146: 1145: 1138: 1137: 1134: 1130: 1129: 1126: 1125: 1121: 1120: 1106: 1104: 1100: 1099: 1092: 1091: 1082: 1079: 1072: 1065: 1064: 1058: 1057: 1053: 1045: 1044: 1039: 1036:A search for 1035: 1034: 1027: 1026: 1022: 1018: 1017: 1010: 1009: 1006: 1005: 1001: 1000: 989: 985: 971: 969: 968: 961: 960: 957: 949: 945: 944: 937: 936: 924: 923: 919: 918: 898: 894: 890: 886: 878: 877: 870: 869: 866: 863: 856:"George Bush" 850: 846: 845: 838: 837: 830: 829: 822: 821: 815: 810: 808: 807: 785: 784: 777: 776: 762: 754: 752: 751: 744: 743: 736: 735: 730: 729: 722: 721: 716: 715: 708: 707: 683: 682: 678: 677: 674: 671: 667: 664: 660: 652: 647: 645: 643: 639: 634: 630: 626: 621: 619: 615: 611: 606: 600: 598: 595: 591: 590:verifiability 587: 581: 576: 573: 571: 567: 559: 557: 554: 547:Verifiability 546: 541: 539: 537: 531: 525: 521: 518: 517: 516: 513: 507: 503: 500: 497: 494: 489: 486: 482: 481:internet meme 478: 474: 470: 467: 466: 465: 464: 457: 454: 447: 445: 442: 439: 436: 435: 434: 433: 429: 421: 419: 418:rule of thumb 415: 406: 404: 396: 392: 389: 385: 384: 381: 376: 371: 369: 367: 363: 359: 355: 354:Google Groups 351: 347: 343: 339: 338: 333: 329: 325: 321: 317: 313: 309: 305: 300: 297: 292: 286: 282: 279: 276: 273: 270: 267: 264: 260: 257: 254: 251: 248: 244: 241: 238: 234: 231: 228: 224: 221: 220: 216: 214: 212: 208: 204: 200: 196: 191: 189: 185: 181: 180:search engine 171: 170: 165: 161: 157: 154: 147: 146: 134: 130: 127: 126:WP:GOOGLETEST 123: 120: 116: 113: 109: 106: 102: 101: 98: 93: 88: 84: 78: 76: 70: 63: 62: 59: 55: 54:WP:GOOGLEHITS 48: 41: 37: 33: 19: 2779: 2777: 2733:the original 2728: 2714: 2681: 2676: 2668: 2663: 2652: 2645:Language Log 2644: 2643:; and other 2640:Language Log 2638: 2629: 2621:Language Log 2619: 2610: 2589:cite journal 2578: 2557:cite journal 2542: 2531: 2519:. Retrieved 2515: 2505: 2493:. Retrieved 2488: 2475: 2464: 2453: 2444: 2419: 2408: 2399: 2357:Find sources 2334: 2322: 2295: 2280: 2271: 2259: 2233:Google Books 2145: 2109: 2090: 2075:such as the 2073:text corpora 2069: 2062: 2055: 2051: 2044: 2032: 2029: 2010: 2006: 1979: 1976: 1968: 1955: 1948: 1942: 1938: 1908: 1904: 1902: 1884: 1883: 1872: 1846: 1834: 1799:urban legend 1794: 1788: 1782: 1768: 1758: 1752: 1751: 1737: 1731: 1724:Undue weight 1723: 1717: 1711: 1705: 1704: 1696: 1664: 1660: 1651: 1583: 1570: 1558: 1547:Google Books 1545: 1525: 1511: 1454: 1356:-tenis -wiki 1238: 1052:cell biology 1050:(as in, the 987: 951: 897:Commonwealth 853: 725:"John Smith" 672: 668: 662: 656: 637: 624: 622: 617: 613: 607: 604: 582: 574: 569: 563: 550: 534:the work of 529: 511: 505: 492: 472: 462: 461: 431: 430: 427: 410: 402: 375: 366:Google Books 360:(academia), 335: 301: 295: 293: 290: 281:Copyrighting 280: 274: 268: 262: 258: 252: 247:urban legend 242: 232: 222: 206: 192: 177: 167: 163: 159: 155: 75:how-to guide 72: 58: 2214:Archive.org 2201:Google News 2037:and can be 1951:Old English 1922:Anglo-Saxon 1851:acupuncture 1577:Google News 1572:Google News 1381:-futbolista 1281:Search for 1133:Google News 862:White House 849:George Bush 362:Google News 344:instead of 269:Information 243:Genuineness 2787:Categories 2392:References 2237:Amazon.com 2176:DuckDuckGo 2122:See also: 1926:patronymic 1857:to animal 1804:Examples: 1420:allintitle 1324:or are on 1311:http://es. 1264:or are on 711:John Smith 610:notability 601:Notability 560:Neutrality 471:Guarantee 318:exist for 253:Notability 223:Popularity 199:verifiable 2598:|journal= 2566:|journal= 2209:web pages 2155:Examples 2047:Adobe PDF 1930:inflected 1824:diacritic 1682:neologism 1536:like this 1532:like this 1506:bbc.co.uk 1469:Microsoft 1461:any other 1430:) can be 1394:Main Page 1333:Google LC 1283:atom bomb 1254:atom bomb 1161:allinurl: 1054:meaning) 786:The term 570:mandatory 477:marketing 414:heuristic 380:Shortcuts 259:Existence 184:web pages 119:WP:GOOGLE 97:Shortcuts 87:consensus 2729:Grumbles 2684:39: 387. 2344:See also 2266:PageRank 2247:4icu.org 2130:Shortcut 1991:Shortcut 1949:Even in 1869:the fur. 1855:allergic 1759:Dark net 1742:ProQuest 1686:pop song 1671:Ontology 1636:Shortcut 1432:combined 1377:football 1093:Search: 1038:pyometra 938:Search: 889:American 745:Search: 737:Search: 723:Search: 709:Search: 506:original 493:and also 485:spamming 320:medicine 188:Internet 133:WP:GTEST 2314:FindLaw 2273:MedLine 2187:Medline 2039:blocked 1918:Russian 1820:El Niño 1644:WP:HITS 1630:General 1502:cnn.com 1484:.gov.uk 1270:link to 1073:Search 1066:Search 893:British 891:way or 778:Use of 761:(which 633:Madonna 625:quoting 524:neutral 324:science 207:variety 195:notable 186:on the 112:WP:GOOG 2367:Google 2338:plugin 2320:(UK). 2316:(US); 2281:Stroke 2277:PubMed 2172:Yahoo! 2164:Google 1913:Arabic 1911:". An 1907:" in " 1885:Other: 1867:within 1863:dander 1495:France 1428:-site: 1373:soccer 1352:tennis 1268:, and 1107:thinks 881:flavor 388:WP:GFG 364:, and 342:Google 310:, and 304:Google 285:copied 229:below. 225:– See 182:lists 105:WP:SET 2521:5 May 2495:5 May 2485:(PDF) 2178:etc. 2152:Type 2138:H:CSE 1943:Pasha 1771:– An 1763:Flash 1684:or a 1424:site: 1186:for: 1165:CMSes 1157:site: 948:Linux 638:quote 614:types 584:(See 350:Yahoo 312:Yahoo 296:using 263:exist 233:Usage 2602:help 2570:help 2523:2015 2497:2015 2235:and 2168:Bing 2091:The 1939:Gazi 1934:case 1673:and 1493:for 1482:and 1480:.gov 1457:from 1426:(or 1422:and 1354:and 1340:wiki 1262:wiki 980:wiki 802:(or 763:must 686:acid 663:help 618:does 453:here 416:or " 346:Bing 330:and 328:news 308:Bing 164:mean 160:what 2637:", 2618:", 1946:). 1859:fur 1744:or 1504:or 1491:.fr 1342:): 1300:not 1260:or 1221:not 1068:DNA 1061:DNA 1048:DNA 1021:DNA 972:not 931:NOT 811:not 800:NOT 796:NOT 788:NOT 780:NOT 755:any 577:not 522:Be 473:why 420:". 348:or 332:law 2789:: 2727:. 2713:. 2689:^ 2593:: 2591:}} 2587:{{ 2561:: 2559:}} 2555:{{ 2514:. 2487:. 2430:^ 2369:}} 2365:{{ 2359:}} 2355:{{ 2327:. 2312:, 2308:, 2304:, 2231:, 2216:, 2174:, 2170:, 2166:, 1865:) 1655:do 1538:. 1497:), 1475:), 1379:, 1375:, 1336:}} 1330:{{ 1317:ar 1306:es 1237:. 929:, 927:OR 901:OR 759:OR 696:; 644:. 483:, 368:. 326:, 322:, 314:. 306:, 239:.) 178:A 2717:. 2624:. 2604:) 2600:( 2572:) 2568:( 2551:. 2525:. 2499:. 1826:. 1815:. 1734:. 1677:. 1616:( 1593:. 1567:. 1508:) 1309:( 895:/ 864:" 860:" 804:- 792:- 526:. 249:. 89:. 77:. 56:. 49:. 42:. 20:)

Index

Knowledge (XXG):Google test
Knowledge (XXG):WikiProject Google
Knowledge (XXG):Controlling search engine indexing
Template:Google templates
Knowledge (XXG):Set index articles
WP:GOOGLEHITS
how-to guide
Knowledge (XXG)'s policies or guidelines
consensus
Shortcuts
WP:SET
WP:GOOG
WP:GOOGLE
WP:GOOGLETEST
WP:GTEST
WP:Five pillars
search engine
web pages
Internet
notable
verifiable
reliability on Knowledge (XXG)
a neutral viewpoint
Google's trending tool
Google's ngram tool
urban legend
copied
Google
Bing
Yahoo

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.

↑