307:
auxiliary component called an ESP server which provides interfaces for external client access to the cluster; and additional common components which are shared with a Thor cluster in an HPCC environment. Although a Thor processing cluster can be implemented and used without a Roxie cluster, an HPCC environment which includes a Roxie cluster should also include a Thor cluster. The Thor cluster is used to build the distributed index files used by the Roxie cluster and to develop online queries which will be deployed with the index files to the Roxie cluster.
270:
220:
51:
285:. This platform is designed as an online high-performance structured query and analysis platform or data warehouse delivering the parallel data access processing requirements of online applications through Web services interfaces supporting thousands of simultaneous queries and users with sub-second response times. Roxie utilizes a
311:
192:. The HPCC platform includes system configurations to support both parallel batch data processing (Thor) and high-performance online query applications using indexed data files (Roxie). The HPCC platform also includes a data-centric declarative programming language for parallel data processing called
265:
Figure 2 shows a representation of a physical Thor processing cluster which functions as a batch job execution engine for scalable data-intensive computing applications. In addition to the Thor master and slave nodes, additional auxiliary and common components are needed to implement a complete HPCC
327:
components, an external communications layer, client interfaces which provide both end-user services and system management tools, and auxiliary components to support monitoring and to facilitate loading and storing of filesystem data from external sources. Usually a HPCC environment includes only
306:
Figure 3 shows a representation of a physical Roxie processing cluster which functions as an online query execution engine for high-performance query and data warehousing applications. A Roxie cluster includes multiple nodes with server and worker processes for processing queries; an additional
254:
is a reference to the mythical Norse god of thunder with the large hammer symbolic of crushing large amounts of raw data into useful information. A Thor cluster is similar in its function, execution environment, filesystem, and capabilities to the Google and
250:) processing of the raw data, record linking and entity resolution, large-scale ad-hoc complex analytics, and creation of keyed data and indexes to support high-performance structured queries and data warehouse applications. The data refinery name
350:
2.0. The
Enterprise Edition is available under a paid commercial license and includes training, support, indemnification and additional modules. In November 2011, HPCC Systems announced the availability of its Thor Data Refinery Cluster on
302:
capabilities added, and provides for near real time predictable query latencies. Both Thor and Roxie clusters utilize the ECL programming language for implementing applications, increasing continuity and programmer productivity.
685:
Sandia
National Laboratories Leverages the Data Analytics Supercomputer (DAS) by LexisNexis Risk & Information Analytics Group, Which Offers Breakthrough High Performance Computing to Address Data Management and Analysis
289:
to provide parallel processing of queries using an optimized execution environment and filesystem for high-performance online processing. A Roxie cluster is similar in its function and capabilities to
695:
328:
Thor clusters, or both Thor and Roxie clusters, although Roxie occasionally is used to build its own indexes. The overall HPCC software architecture is shown in Figure 4.
346:
HPCC Systems offers both a
Community Edition and an Enterprise Edition. The Community Edition is free to download, includes the source code and is released under the
246:
whose overall purpose is the general processing of massive volumes of raw data of any type for any purpose but typically used for data cleansing and hygiene, ETL (
684:
755:
710:
343:
and was formed to promote and sell the HPCC software. In June 2011, it announced the offering of the software under an open source dual license model.
725:
383:
193:
118:
624:
679:
765:
35:
494:
750:
715:
99:
745:
458:, "ECL/HPCC: A Unified Approach to Big Data," by A.M. Middleton. Handbook of Data Intensive Computing. Springer, 2011.
83:
425:, "Data-Intensive Technologies for Cloud Computing," by A.M. Middleton. Handbook of Cloud Computing. Springer, 2010.
721:
FAU Receives
National Science Foundation Rapid Response Grant to Develop Innovative Computer Model for Ebola Spread
422:
598:
546:
468:
340:
177:
64:
650:
760:
247:
173:
572:
286:
520:
705:
690:
436:
181:
441:
203:
in 2011, after ten years of in-house development (according to LexisNexis). It is an alternative to
352:
185:
378:
200:
720:
398:
356:
137:
125:
716:
High
Performance Computing Clusters (HPCC) and Big Data Analytics Certificate - Stand-Alone
235:, each of which can be optimized independently for its parallel data processing purpose.
323:
The HPCC software architecture incorporates the Thor and Roxie clusters as well as common
435:"HPCC Systems: Introduction to HPCC (High-Performance Computing Cluster)". 24 May 2011.
347:
142:
739:
393:
388:
368:
291:
256:
373:
295:
227:
The HPCC system architecture includes two distinct cluster processing environments
269:
455:
188:
to provide high-performance, data-parallel processing for applications utilizing
299:
219:
324:
58:
403:
259:
17:
547:"LexisNexis Will Open-Source Its Hadoop Alternative for Handling Big Data"
469:"LexisNexis Will Open-Source Its Hadoop Alternative for Handling Big Data"
50:
726:
CPL Online delivers added value for clients through its Big Data
Platform
208:
189:
691:
Programming models for the LexisNexis High
Performance Computing Cluster
310:
711:
LexisNexis Brings Its Data
Management Magic To Bear on Scientific Data
204:
625:"HPCC Announces Availability of ETL Cluster On Amazon Web Services"
114:
309:
268:
218:
130:
104:
251:
277:
The second of the parallel data processing platforms is called
730:
706:
Reference to the term BORPS (Billions of
Records Per Second)
700:
154:
355:. In January 2012, HPCC Systems announced distributed
168:(High-Performance Computing Cluster), also known as
172:(Data Analytics Supercomputer), is an open source,
149:
136:
124:
110:
98:
82:
70:
57:
339:(High Performance Computing Cluster) is part of
573:"HPCC A New/Old Kid In Town To Take On Hadoop"
680:Sandia sees data management challenges spiral
8:
43:
651:"HPCC Systems Intros Machine Learning Beta"
521:"LexisNexis open-sources its Hadoop killer"
49:
42:
440:
696:LexisNexis Data Analytics Supercomputer
415:
384:ECL (data-centric programming language)
238:The first of these platforms is called
495:"9 Useful Open Source Big Data Tools"
7:
456:Handbook of Data Intensive Computing
314:Figure 4. HPCC software architecture
180:. The HPCC platform incorporates a
599:"LexisNexis Joins Linux Foundation"
273:Figure 3. Roxie processing cluster
25:
756:Declarative programming languages
223:Figure 2. Thor processing cluster
37:Harry Potter and the Cursed Child
27:High-performance computer cluster
199:The public release of HPCC was
105:https://github.com/hpcc-systems
287:distributed indexed filesystem
1:
176:system platform developed by
186:commodity computing clusters
423:Handbook of Cloud Computing
782:
283:rapid data delivery engine
29:
766:Data warehousing products
341:LexisNexis Risk Solutions
178:LexisNexis Risk Solutions
94:
78:
65:LexisNexis Risk Solutions
48:
34:West End stage play, see
266:processing environment.
248:extract, transform, load
174:data-intensive computing
89:7.4.18-1 / 13-09-2019
701:LexisNexis HPCC Systems
315:
274:
224:
751:Distributed computing
629:Cloud Computing Today
319:Software architecture
313:
272:
222:
182:software architecture
603:The Linux Foundation
499:EnterpriseAppsToday
353:Amazon Web Services
281:and functions as a
215:System architecture
45:
746:Parallel computing
631:. 17 December 2012
379:Aster Data Systems
316:
275:
225:
657:. 31 January 2012
163:
162:
16:(Redirected from
773:
667:
666:
664:
662:
647:
641:
640:
638:
636:
621:
615:
614:
612:
610:
595:
589:
588:
586:
584:
569:
563:
562:
560:
558:
543:
537:
536:
534:
532:
517:
511:
510:
508:
506:
491:
485:
484:
482:
480:
465:
459:
453:
447:
446:
444:
432:
426:
420:
399:Machine learning
357:machine learning
294:and Hadoop with
159:
156:
126:Operating system
53:
46:
21:
781:
780:
776:
775:
774:
772:
771:
770:
761:Query languages
736:
735:
676:
671:
670:
660:
658:
649:
648:
644:
634:
632:
623:
622:
618:
608:
606:
597:
596:
592:
582:
580:
571:
570:
566:
556:
554:
545:
544:
540:
530:
528:
519:
518:
514:
504:
502:
493:
492:
488:
478:
476:
467:
466:
462:
454:
450:
442:10.1.1.456.3571
434:
433:
429:
421:
417:
412:
365:
334:
321:
217:
184:implemented on
153:
90:
71:Initial release
41:
28:
23:
22:
15:
12:
11:
5:
779:
777:
769:
768:
763:
758:
753:
748:
738:
737:
734:
733:
728:
723:
718:
713:
708:
703:
698:
693:
688:
682:
675:
674:External links
672:
669:
668:
642:
616:
605:. 17 June 2011
590:
579:. 16 June 2011
564:
553:. 15 June 2011
538:
527:. 15 June 2011
512:
486:
475:. 15 June 2011
460:
448:
427:
414:
413:
411:
408:
407:
406:
401:
396:
391:
386:
381:
376:
371:
364:
361:
348:Apache License
333:
330:
320:
317:
216:
213:
161:
160:
151:
147:
146:
143:Apache License
140:
134:
133:
128:
122:
121:
112:
108:
107:
102:
96:
95:
92:
91:
88:
86:
84:Stable release
80:
79:
76:
75:
72:
68:
67:
63:HPCC Systems,
61:
55:
54:
26:
24:
14:
13:
10:
9:
6:
4:
3:
2:
778:
767:
764:
762:
759:
757:
754:
752:
749:
747:
744:
743:
741:
732:
729:
727:
724:
722:
719:
717:
714:
712:
709:
707:
704:
702:
699:
697:
694:
692:
689:
687:
683:
681:
678:
677:
673:
656:
652:
646:
643:
630:
626:
620:
617:
604:
600:
594:
591:
578:
574:
568:
565:
552:
548:
542:
539:
526:
522:
516:
513:
501:. 11 Nov 2015
500:
496:
490:
487:
474:
470:
464:
461:
457:
452:
449:
443:
438:
431:
428:
424:
419:
416:
409:
405:
402:
400:
397:
395:
394:Sector/Sphere
392:
390:
389:ElasticSearch
387:
385:
382:
380:
377:
375:
372:
370:
369:Apache Hadoop
367:
366:
362:
360:
358:
354:
349:
344:
342:
338:
331:
329:
326:
318:
312:
308:
304:
301:
297:
293:
292:ElasticSearch
288:
284:
280:
271:
267:
263:
261:
258:
253:
249:
245:
244:data refinery
241:
236:
234:
230:
221:
214:
212:
210:
206:
202:
197:
195:
191:
187:
183:
179:
175:
171:
167:
158:
152:
148:
144:
141:
139:
135:
132:
129:
127:
123:
120:
116:
113:
109:
106:
103:
101:
97:
93:
87:
85:
81:
77:
73:
69:
66:
62:
60:
56:
52:
47:
39:
38:
33:
19:
731:HPCC Systems
659:. Retrieved
654:
645:
633:. Retrieved
628:
619:
607:. Retrieved
602:
593:
581:. Retrieved
577:NetworkWorld
576:
567:
555:. Retrieved
550:
541:
529:. Retrieved
524:
515:
503:. Retrieved
498:
489:
477:. Retrieved
472:
463:
451:
430:
418:
374:Apache Spark
359:algorithms.
345:
337:HPCC Systems
336:
335:
332:HPCC Systems
322:
305:
282:
278:
276:
264:
243:
239:
237:
232:
228:
226:
198:
169:
165:
164:
59:Developer(s)
36:
32:Harry Potter
31:
18:HPCC Systems
661:29 November
635:30 November
609:29 November
557:20 November
505:18 November
479:20 November
262:platforms.
211:platforms.
155:hpccsystems
740:Categories
686:Challenges
583:2 December
531:8 November
410:References
325:middleware
207:and other
111:Written in
100:Repository
74:15-06-2011
551:ReadWrite
473:ReadWrite
437:CiteSeerX
404:MapReduce
260:MapReduce
201:announced
655:Datanami
363:See also
209:Big data
190:big data
30:For the
150:Website
138:License
525:GigaOM
439:
257:Hadoop
205:Hadoop
296:HBase
279:Roxie
233:Roxie
131:Linux
663:2014
637:2014
611:2014
585:2014
559:2014
533:2014
507:2015
481:2014
300:Hive
298:and
252:Thor
242:, a
240:Thor
231:and
229:Thor
166:HPCC
157:.com
44:HPCC
194:ECL
170:DAS
145:2.0
119:ECL
115:C++
742::
653:.
627:.
601:.
575:.
549:.
523:.
497:.
471:.
196:.
117:,
665:.
639:.
613:.
587:.
561:.
535:.
509:.
483:.
445:.
40:.
20:)
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.