Knowledge (XXG)

Change detection and notification

Source 📝

82:
activity by change detection and notification vendors is minimal. No one vendor has successfully leveraged exclusive rights to change detection and notification technology through patents or other legal means. This has resulted in significant functional overlap between products and services.
30:
are designed to find web pages, CDN systems are designed to monitor changes to web pages. Before change detection and notification, it was necessary for users to manually check for web page changes, either by revisiting web sites or periodically searching again. Efficient and effective change
70:
Monitoring options vary by service or product and range from monitoring a single web page at a time to entire web sites. What is actually monitored also varies by service or product with the possibilities of monitoring text, links, documents, scripts, images or screen shots.
155:
Some web pages change regularly, due to the inclusion of adverts or feeds in the presented page. This can trigger false-positives in the change-detection, since users are often only interested in changes to the main content. Some approaches to mitigate this issue exist.
59:. Historically, change polling has been done either by a server which sent email notifications or a desktop program which audibly alerted the user to a change. Change alerting is also possible directly to mobile devices and through 47:
developed the first change detection and notification tool, known as Mind-it, which ran for six years. This spawned new services such as ChangeDetection (1999), ChangeDetect (2002),
170:
Content extraction. For popular sites, or sites running popular software, content may be actively separated from chaff by selecting a sub-tree of the DOM, for example using
127:, where the software which would normally run on a separate server runs on your own hardware locally, generally means that the software provides a miniature 390: 167:) and ignore changes below some threshold. The threshold may be set by the user, or estimated automatically by comparing some early versions of the page. 111:. Typically, an associated website with a configuration is managed by the user. Some services also have a mobile device application which connects to a 160:
Create a metric of difference between two versions of a page (calculated for example from change in total size, changes in HTML file, or changes in the
56: 52: 31:
detection and notification is hampered by the fact that most servers do not accurately track content changes through Last-Modified or
322: 23:) is the automatic detection of changes made to World Wide Web pages and notification to interested users by email or other means. 196:
Mallawaarachchi, Vijini; Meegahapola, Lakmal; Alwis, Roshan; Heshan, Eranga; Meedeniya, Dulani; Jayarathna, Sampath (2020-05-14).
255: 305:
Chakravarthy, S.; Hara, S. C. H. (2006). "Automating Change Detection and Notification of Web Pages (Invited Paper)".
124: 147:
polls content, tracks changes and logs data. Client applications can be browser extensions, mobile apps or programs.
144: 132: 280: 103:
A server polls content, tracks changes and logs data, sending alerts in the form of email notifications,
161: 92: 79: 213: 175: 328: 237: 203: 356: 318: 229: 60: 341: 310: 221: 35:
web-server headers. In 2019 a comprehensive analysis regarding CDN systems was published.
217: 123:
A relatively newer approach, which lays between server-based and client-based is to use
164: 384: 259: 75: 48: 27: 241: 332: 307:
17th International Conference on Database and Expert Systems Applications (DEXA'06)
112: 374: 233: 128: 360: 32: 342:"A Survey on Web Page Change Detection System Using Different Approaches" 314: 64: 104: 44: 197: 91:
Change detection and notification services can be categorized by the
225: 208: 171: 74:
With the notable exception of Google's patent filings related to
349:
International Journal of Computer Science and Mobile Computing
284: 108: 95:
they use. Three principal approaches can be distinguished:
199:
Change Detection and Notification of Web Pages: A Survey
377:
Self-hosted website change detection and notification
281:"He created Google Alerts. Now he's an almond farmer" 51:(2003), and Versionista (2007) which was used by the 178:
to extract only the text the user is interested in.
67:and HTTP callbacks for application integration. 340:Shobhna, Bansal; Chadhaury, Manoj (June 2013). 131:with a browser interface instead of a classic 8: 115:and provides alerts to the mobile device. 207: 57:2008 United States presidential election 188: 174:. Another typical method is the use of 53:John McCain 2008 presidential campaign 7: 256:"To the Wayback Machine, Sherman!" 143:A local client application with a 14: 391:Change detection and notification 17:Change detection and notification 1: 135:provided by an application. 407: 375:changedetection.io github 145:graphical user interface 133:graphical user interface 87:Architectural approaches 355:(6). IJCSMC: 294–299. 93:software architecture 80:intellectual property 315:10.1109/DEXA.2006.34 55:in the race for the 218:2019arXiv190102660M 176:regular expressions 61:push notifications 119:Self-hosted based 398: 371: 369: 367: 346: 336: 297: 296: 294: 292: 277: 271: 270: 268: 266: 252: 246: 245: 211: 193: 406: 405: 401: 400: 399: 397: 396: 395: 381: 380: 365: 363: 344: 339: 325: 309:. p. 465. 304: 301: 300: 290: 288: 279: 278: 274: 264: 262: 254: 253: 249: 226:10.1145/3369876 195: 194: 190: 185: 153: 141: 121: 101: 89: 41: 12: 11: 5: 404: 402: 394: 393: 383: 382: 379: 378: 372: 337: 323: 299: 298: 287:. 4 April 2016 272: 247: 187: 186: 184: 181: 180: 179: 168: 152: 151:Considerations 149: 140: 137: 120: 117: 100: 97: 88: 85: 40: 37: 28:search engines 13: 10: 9: 6: 4: 3: 2: 403: 392: 389: 388: 386: 376: 373: 362: 358: 354: 350: 343: 338: 334: 330: 326: 324:0-7695-2641-1 320: 316: 312: 308: 303: 302: 286: 282: 276: 273: 261: 260:The Economist 257: 251: 248: 243: 239: 235: 231: 227: 223: 219: 215: 210: 205: 201: 200: 192: 189: 182: 177: 173: 169: 166: 163: 159: 158: 157: 150: 148: 146: 138: 136: 134: 130: 126: 118: 116: 114: 110: 106: 98: 96: 94: 86: 84: 81: 77: 76:Google Alerts 72: 68: 66: 62: 58: 54: 50: 49:Google Alerts 46: 38: 36: 34: 29: 24: 22: 18: 364:. Retrieved 352: 348: 306: 289:. Retrieved 275: 263:. Retrieved 250: 198: 191: 154: 142: 139:Client based 125:self-hosting 122: 113:cloud server 102: 99:Server based 90: 73: 69: 42: 25: 20: 16: 15: 366:8 September 291:9 September 234:1201518429 209:1901.02660 183:References 129:web server 361:2320-088X 265:9 January 43:In 1996, 385:Category 242:57759312 105:webhooks 65:webhooks 26:Whereas 333:6395251 214:Bibcode 45:NetMind 39:History 359:  331:  321:  240:  232:  345:(PDF) 329:S2CID 238:S2CID 204:arXiv 172:XPath 368:2016 357:ISSN 319:ISBN 293:2016 267:2019 230:OCLC 165:tree 33:ETag 311:doi 285:CNN 222:doi 162:DOM 109:RSS 21:CDN 387:: 351:. 347:. 327:. 317:. 283:. 258:. 236:. 228:. 220:. 212:. 202:. 107:, 78:, 63:, 370:. 353:2 335:. 313:: 295:. 269:. 244:. 224:: 216:: 206:: 19:(

Index

search engines
ETag
NetMind
Google Alerts
John McCain 2008 presidential campaign
2008 United States presidential election
push notifications
webhooks
Google Alerts
intellectual property
software architecture
webhooks
RSS
cloud server
self-hosting
web server
graphical user interface
graphical user interface
DOM
tree
XPath
regular expressions
Change Detection and Notification of Web Pages: A Survey
arXiv
1901.02660
Bibcode
2019arXiv190102660M
doi
10.1145/3369876
OCLC

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.