502:
They continue to be popular throughout the pandemic. From March 1st and onward, however, the “region” category Wiki pages get larger aggregate views. This shows how the attention shifted between these two content types, i.e., people continue to learn more information about the virus and, at the same time, track regional progress of transmissions. The “people” category content becomes popular from mid-March, even leading to larger aggregate views during days in April. Wiki pages on celebrity and public figures are generally popular within
Knowledge (XXG). As the virus progresses, more public figures become associated with COVID-19, diverging the public interest. The structured nature of Wikidata even allows us to understand how/why these people are associated with the disease. By using a
236:
450:
391:
110:
130:
507:
555:
646:
90:
120:
574:. The proportion of the “Region” category has rapidly increased since February with domestic spreading outside China. The “Virus” category had its peak in January and was gradually decreasing. The “People” has the highest variance which is also significant in the proportion. The peak related to Tom Hanks even has a significant proportion among all pages related to COVID-19.
36:
140:
100:
205:, the activity generated by these volunteers and readers also generates a considerable amount of data itself. For example, we can explore how many Knowledge (XXG) articles have been created about COVID-19 related topics. Which sources are cited in those articles? How many people had reviewed such articles? Which are the most visited pages?
251:
other that might come in the future. We analyzed the complex and diverse attention of the public during the COVID-19 pandemic from the browsing logs of
English Knowledge (XXG) pages. This post will feature findings on English content and patterns from other languages such as Korean, Italian, and Spanish will be revealed in our next post.
232:
leading the World Health
Organization (WHO) to declare it a pandemic. Countries like Iran, Italy, Spain, and the United States had seen over 50,000 confirmed cases (Fig 1). As of April 14, less than 90 days since the lockdown in China, COVID-19 has infected over 1,880,000 people and has killed more than 117,000 patients worldwide.
150:
501:
Next, we check how attention is divided across the four topical categories — virus, region, people, and others — by examining the aggregate view counts on these categories. The “virus” category Wiki pages altogether receive the most views during the initial phase until the end of
February (Figure 4).
250:
What kinds of information did people most seek on COVID-19? How did their attention change over time, as the number of patients quickly rose globally? How quickly were pages tracking regional cases updated? These are critical questions that help us better cope with the current pandemic as well as any
583:
The speed at which these regional pages are updated is unprecedented, it is as quick as any local CDC reports (which we will look at in future reports). Overall, the increasing attention from the virus to regional and people pages indicate that
Knowledge (XXG) has served multiple purposes during the
456:
Top 5 content Wiki pages. “2019–20 coronavirus pandemic” was viewed far more than the other pages. “2020 coronavirus pandemic in the United States” are getting more views with the spreading in the US. “Tom Hanks” had a sharp peak on March 12 because of his infection, but it soon lost attention. Most
397:
English
Knowledge (XXG) Views on Two Pandemic Pages. “Pandemic” as a general term had high attention only on March 11–12 when WHO declared the coronavirus outbreak a global pandemic. “2019–20 coronavirus pandemic” has been viewed steadily since January. The view counts had the steepest increase in a
361:
Note that this page is not linked to the COVID-19 Wikidata item and hence is considered “not relevant” in our analysis. Nonetheless, many people might arrived at this page either by searching for “pandemic” or by following the hyperlinks that lead to this page. There are other examples (particularly
231:
Coronavirus disease (COVID-19) is an infectious disease caused by a newly discovered coronavirus. The first case was observed in China in late 2019, quickly followed by an outbreak in nearby East Asian countries like South Korea. In a few weeks, outbreaks could be seen throughout Europe and
America,
561:
Stacked Chart and Ratio Chart by
Topical Category. (a) The total number of views on COVID-19-related pages had exponentially increased from late February to early March. In March, the daily views were varying in the range of 6–8 millions. “Virus” category was dominant at the beginning of spreading,
596:
The open data available on
Knowledge (XXG) allows researchers and the community, in general, to cope with the urgent information needs during crises. The huge demand for local and regional-specific content highlights the importance of having a distributed community of editors who can generate such
242:
Case
Statistics of COVID-19 in China, South Korea, Spain, and the US (right axis — log scale). These countries have outbreaks at different times. While the patient count increases at a smaller rate for China and South Korea by early March, Spain and the US show a sharp rise. On gray the number of
522:
English Knowledge (XXG) Views by Topical Category. We divide COVID-19 related Knowledge (XXG) pages into 4 categories; Virus, Region, People, and Others. “Virus” category was dominant at the beginning of spreading, but the “Region” category became dominant on March 1. “Region” category has its
584:
pandemic, from a reliable source to collect scientific facts about the virus as well as to serve regional breaking news, and to other less critical updates related to the people’s category. Meeting such dynamic attention would not have been possible without the dedicated participation of
523:
highest peak on March 23. “People” category has a sharp peak on March 12 because of Tom Hanks’s infection. Interestingly, the other categories all have peaked on that day. ‘People’ fluctuated when the news of the confirmed famous spread. Most categories except “People” show decreasing
580:
The shift of attention from the virus Wiki pages to regional tracker pages suggests that internet users are most interested in first gaining knowledge about the disease, but their attention shifts to more geographically constrained information (that likely have immediate impact on
259:
Central to any content we observe on Knowledge (XXG) lies Wikidata, a type of structured data that links all Wiki projects. Most of the articles in Knowledge (XXG) link to a Wikidata Item, which among themselves are linked. For example, there is a link between the
550:
This remark followed large-scale outbreaks in several countries in Europe and the Middle East, especially in Italy, Iran, and France. Since this point, the aggregate attention has stayed at a similar level, reaching over 6,000,000 views a day.
513:
The “others” category pages that describe the socio-economic impact and other related events also track more extensive attention from mid-March. However, far less attention is paid to these pages compared to the other three categories.
192:
From the very start of COVID-19, when it was known just as an outbreak of an atypical pneumonia in China, people around the world have been finding and sharing information about the virus on Knowledge (XXG), a frequent
480:
was created in January, but only became popular in March as local outbreaks began in the US. The regional tracking sites and the accumulated view counts on those pages often mimic the outbreak pattern in that country.
319:
Celebrities and public figures who are related to COVID-19 either as spokespersons, doctors, or as infected patients were grouped as the people category. This category has the largest number of Wiki pages; 516 in
174:
Changwook Jung, Sun Geng, Meeyoung Cha are from the Institute for Basic Science, South Korea & KAIST. Inho Hong is from the Center for Humans & Machines, Max Planck Institute for Human Development,
215:, which will be updated regularly, from the Wikimedia Foundation. All the data used in this article is public and can be scrutinized, accessed, and used by third parties, using the MediaWiki API and other
766:
740:
725:
690:
680:
457:
pages in the People category show a similar view pattern. “Severe acute respiratory syndrome coronavirus 2” had the most views at the beginning of the spreading. For the original version of Figure 3
359:
At first, the Pandemic Wiki page was not a frequently visited one. On March 11, however, this page showed a sharp increase in the number of page-views when the WHO declared the disease as a pandemic.
685:
275:
By looking at the Knowledge (XXG) pages that link to those items, we can obtain a list of articles related to COVID-19 in each language. Constraining results to English Knowledge (XXG) results in
705:
476:
are among the most visited pages from mid-January and throughout the pandemic. The second page describes SARS-Cov-2, the virus that causes COVID-19. On the other hand, the regional tracker page
695:
670:
633:
624:
76:
211:
from general information about the pandemic and regional responses, to the people who have been involved in the pandemic and misinformation about the virus. You can see some of this data in a
675:
663:
840:
730:
735:
657:
55:
44:
720:
700:
426:
we can compare the number of times that each of these pages were visited. Sorting content by the number of maximum daily views, we arrive at these five Wiki pages in Figure 3:
710:
540:
Internet users sought information actively through credible sources like Knowledge (XXG) during the time when not much information was available through official sources.
384:. Such dynamic nature of Wiki content is representative of the time-evolving nature of events. When analyzing Knowledge (XXG) content, such dynamics should be understood.
534:
It is noteworthy to observe that the total view counts on all content are prominent, reaching over 500,000 views in late January. This is faster than the time when the
473:
443:
181:
469:
435:
347:
One challenge in tracking people’s attention is the dynamics in data describing the event. Figure 2 shows a live example of the daily page-views of two Wiki pages:
300:
892:
545:
477:
431:
93:
310:
326:
All other pages that had linked to the COVID-19 Wikidata were categorized as 'others.' These pages often contained information about a specific event (such as
21:
868:
863:
858:
491:" Wikidata, but sometimes this linkage is removed by Knowledge (XXG) editors once the event passes. This adds to the complexity of Knowledge (XXG) data.
180:
Diego Saez-Trumper is a researcher employed by the WMF. This paper represents work beyond his regular duties. This article was originally published on
853:
209:
This post offers an overview of the COVID-19 related data generated in Knowledge (XXG), highlighting the diversity of content that people read:
276:
30:
Open data and COVID-19: Knowledge (XXG) as an informational resource during the pandemic: What COVID-19 data are available from the WMF?
532:
Stacked charts or ratio charts are excellent ways to visualize how the aggregate view counts by topical categories evolve over time.
202:
848:
645:
49:
35:
17:
465:
427:
403:
399:
381:
377:
371:
352:
284:
These Wiki pages covered a plethora of topics, which could be grouped into one of the four categories found by qualitative coding.
235:
243:
page views on English Knowledge (XXG) COVID-19 related articles (left axis — linear scale). For the original version of Figure 1.
376:
This discrepancy may appear when titles of Wiki pages change over time. In this particular case, the original title had been
197:
for medical information. While the content and quality of the information on Knowledge (XXG) is shaped by volunteer editors (
493:
Many other pages on people involved in some way with the disease show a similar spike in page-views during the pandemic.
267:, and the Pandemic one. Therefore, we can identify relevant content related to COVID-19 by looking at these connections.
815:
797:
398:
few days before the declaration. From late March, the view counts are decreasing with the slowing down growth rate. *
331:
874:
292:
216:
335:
487:, who was infected with the virus and has now recovered. Wiki pages of individuals are connected to the "
811:
793:
449:
390:
309:
Tracking pages dedicated to specific regions were quickly created as outbreaks spread globally (e.g.,
578:
The ratio chart could identify the day-to-day composition of public attention across diverse topics.
327:
536:
WHO declared the outbreak a Public Health Emergency of International Concern (PHEIC) on January 30th
825:
303:) belong to this category. Out of 878 total pages, 11 Wiki pages were classified in this category.
363:
185:
562:
but the “Region” category has been dominant since March. For the original version of Figure 5a.
220:
103:
762:
123:
807:
789:
296:
163:
554:
133:
535:
588:
editors who contributed to creating and updating these English Knowledge (XXG) articles.
374:
Wiki page appears to have been created much earlier than the WHO’s announcement in March.
153:
571:
563:
519:
458:
407:
244:
71:
Open data and COVID-19: Knowledge (XXG) as an informational resource during the pandemic
613:
You can learn more about the results and methodology used in this analysis by visiting
503:
886:
601:
how readers are especially interested in what is happening in their local regions.
143:
599:
In our next post, we will show, by analyzing non-English Knowledge (XXG) pages,
113:
614:
484:
439:
270:
585:
212:
198:
824:
I,m so sad bad news from USA, aboutCorona Virus I hope to finished soon -
348:
544:
The most significant peak occurs a few days before March 11th, when the
506:
language, we can see that most of the people were linked by having
291:
Wiki pages that directly cover topics on the virus itself (such as
273:" button that exists in every Knowledge (XXG) and Wikidata article.
194:
553:
448:
406:
before WHO’s announcement. For the original version of Figure 2
389:
234:
54:
644:
227:
Seeking information during COVID-19: English Knowledge (XXG)
34:
162:
By Changwook Jung, Sun Geng, Meeyoung Cha, Inho Hong, and
418:
So which individual pages attracted the most attention?
778:
771:
751:
546:
WHO declared COVID-19 to be characterized as a pandemic
421:
184:
The text, but not the graphs, on "Medium" are licensed
488:
261:
338:). A total of 41 Wiki pages belong to this category.
217:
online resources offered by the Wikimedia Foundation
776:If your comment has not appeared here, you can try
366:) that temporarily became popular due to COVID-19.
470:Severe acute respiratory syndrome coronavirus 2
436:Severe acute respiratory syndrome coronavirus 2
301:Severe acute respiratory syndrome coronavirus 2
199:over 34K contributing to COVID-19 related pages
478:2020 coronavirus pandemic in the United States
432:2020 coronavirus pandemic in the United States
311:2020 coronavirus pandemic in New York (state)
295:), developments on tests and vaccines (e.g.,
8:
474:Timeline of the 2019–20 coronavirus pandemic
444:Timeline of the 2019–20 coronavirus pandemic
269:This can be easily done by clicking on the "
219:. Sample source codes are made available at
893:Knowledge (XXG) Signpost archives 2020-04
483:We also turn our attention to celebrity
362:disease and outbreak-related pages like
313:). Our data contain 310 such Wiki pages.
18:Knowledge (XXG):Knowledge (XXG) Signpost
779:
755:
508:"Medical Condition" or "Cause of Death"
70:
334:), or socio-economic impact (such as
29:
7:
570:Figure 5(b). To view this figure
56:
28:
761:These comments are automatically
148:
138:
128:
118:
108:
98:
88:
772:add the page to your watchlist
609:All this analysis is based on
518:Figure 4. To view this figure
1:
466:2019–20 coronavirus pandemic
428:2019–20 coronavirus pandemic
404:2019–20 coronavirus outbreak
400:2019–20 coronavirus pandemic
382:2019–20 coronavirus pandemic
378:2019–20 coronavirus outbreak
372:2019–20 coronavirus pandemic
353:2019–20 coronavirus pandemic
203:policies about verifiability
497:Content by Topical Category
909:
798:23:27, 26 April 2020 (UTC)
816:02:02, 11 June 2020 (UTC)
528:Content Share by Category
422:Wikimedia pageviews tools
380:and was later changed to
332:NHS Nightingale Hospitals
278:878 Knowledge (XXG) pages
293:Coronavirus disease 2019
213:new interactive resource
430:(on the top red line),
336:2020 stock market crash
299:), and symptoms (e.g.,
769:. To follow comments,
649:
566:
461:
410:
247:
39:
648:
557:
452:
393:
330:), location (such as
238:
221:this Jupyter notebook
38:
765:from this article's
328:2020 Hubei lockdowns
788:well done by all!--
611:public information.
756:Discuss this story
741:WikiProject report
726:On the bright side
691:Arbitration report
650:
637:"By the numbers" →
567:
462:
411:
364:Influenza pandemic
248:
164:Diego Saez-Trumper
45:← Back to Contents
40:
780:purging the cache
681:Discussion report
442:(blue line), and
420:Using the public
414:Most Viewed Pages
50:View Latest Issue
900:
877:
783:
781:
775:
754:
686:Featured content
668:
660:
653:
636:
629:"By the numbers"
628:
343:Content Dynamics
297:COVID-19 vaccine
166:
152:
151:
142:
141:
132:
131:
122:
121:
112:
111:
102:
101:
92:
91:
62:
60:
58:
908:
907:
903:
902:
901:
899:
898:
897:
883:
882:
881:
880:
879:
878:
873:
871:
866:
861:
856:
851:
844:
832:
831:
785:
777:
770:
759:
758:
752:+ Add a comment
750:
746:
745:
744:
706:Recent research
661:
656:
654:
651:
640:
639:
634:
631:
626:
620:
619:
607:
605:Reproducibility
594:
530:
499:
446:(purple line).
434:(yellow line),
416:
345:
271:what links here
257:
229:
195:online resource
168:
167:
161:
160:
159:
158:
149:
139:
129:
119:
109:
99:
89:
83:
80:
69:
65:
63:
53:
52:
47:
41:
31:
26:
25:
24:
12:
11:
5:
906:
904:
896:
895:
885:
884:
872:
867:
862:
857:
852:
847:
846:
845:
834:
833:
830:
829:
828:
821:
820:
819:
818:
801:
800:
760:
757:
749:
748:
747:
743:
738:
733:
728:
723:
718:
716:By the numbers
713:
708:
703:
698:
696:Traffic report
693:
688:
683:
678:
673:
671:News and notes
667:
655:
643:
642:
641:
632:
623:
622:
621:
606:
603:
593:
590:
576:
575:
529:
526:
525:
524:
504:semantic query
498:
495:
438:(green line),
415:
412:
387:
344:
341:
340:
339:
321:
314:
304:
256:
253:
228:
225:
190:
189:
177:
170:
169:
157:
156:
146:
136:
126:
116:
106:
96:
85:
84:
81:
75:
74:
73:
72:
68:By the numbers
67:
66:
64:
61:
48:
43:
42:
33:
32:
27:
15:
14:
13:
10:
9:
6:
4:
3:
2:
905:
894:
891:
890:
888:
876:
870:
865:
860:
855:
850:
842:
838:
827:
823:
822:
817:
813:
809:
805:
804:
803:
802:
799:
795:
791:
787:
786:
782:
773:
768:
764:
753:
742:
739:
737:
734:
732:
729:
727:
724:
722:
719:
717:
714:
712:
709:
707:
704:
702:
699:
697:
694:
692:
689:
687:
684:
682:
679:
677:
674:
672:
669:
665:
659:
658:26 April 2020
652:In this issue
647:
638:
630:
618:
616:
612:
604:
602:
600:
592:Final remarks
591:
589:
587:
582:
581:individuals).
573:
569:
568:
565:
560:
556:
552:
549:
547:
541:
539:
537:
527:
521:
517:
516:
515:
511:
509:
505:
496:
494:
492:
490:
486:
479:
475:
471:
467:
460:
455:
451:
447:
445:
441:
437:
433:
429:
425:
423:
413:
409:
405:
401:
396:
392:
388:
385:
383:
379:
375:
373:
367:
365:
360:
356:
354:
350:
342:
337:
333:
329:
325:
322:
318:
315:
312:
308:
305:
302:
298:
294:
290:
287:
286:
285:
282:
280:
279:
274:
272:
266:
265:Wikidata item
264:
254:
252:
246:
241:
237:
233:
226:
224:
222:
218:
214:
210:
206:
204:
200:
196:
188:
187:
183:
178:
176:
172:
171:
165:
155:
147:
145:
137:
135:
127:
125:
117:
115:
107:
105:
97:
95:
87:
86:
78:
59:
57:26 April 2020
51:
46:
37:
23:
19:
836:
715:
676:In the media
664:all comments
610:
608:
598:
595:
579:
577:
558:
543:
542:
533:
531:
512:
500:
482:
463:
453:
419:
417:
394:
386:
370:Second, the
369:
368:
358:
357:
346:
323:
316:
306:
288:
283:
277:
268:
262:
258:
249:
239:
230:
208:
207:
191:
179:
173:
94:PDF download
875:Suggestions
808:Diego (WMF)
790:Ozzie10aaaa
763:transcluded
586:over 34,000
464:Pages like
255:Methodology
144:X (Twitter)
839:. You can
835:It's your
806:Thanks! --
572:click here
564:click here
520:click here
510:COVID-19.
459:click here
408:click here
245:click here
82:Share this
77:Contribute
22:2020-04-26
869:Subscribe
767:talk page
731:Interview
615:this page
597:content.
559:Figure 5.
485:Tom Hanks
454:Figure 3.
440:Tom Hanks
402:page was
395:Figure 2.
240:Figure 1.
201:) and by
182:"Medium".
887:Category
864:Newsroom
859:Archives
837:Signpost
826:A M Muse
736:In focus
627:Previous
489:COVID-19
349:Pandemic
263:COVID-19
175:Germany.
134:Facebook
124:LinkedIn
114:Mastodon
20: |
841:help us
721:Opinion
701:Gallery
324:Others:
317:People:
307:Region:
472:, and
320:total.
289:Virus:
154:Reddit
104:E-mail
854:About
711:Essay
16:<
849:Home
812:talk
794:talk
635:Next
351:and
186:CC0
79:—
889::
814:)
796:)
625:←
617:.
468:,
355:.
281:.
223:.
843:.
810:(
792:(
784:.
774:.
666:)
662:(
548:.
538:.
424:,
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.