Knowledge (XXG)

HPCC

Source 📝

307:
auxiliary component called an ESP server which provides interfaces for external client access to the cluster; and additional common components which are shared with a Thor cluster in an HPCC environment. Although a Thor processing cluster can be implemented and used without a Roxie cluster, an HPCC environment which includes a Roxie cluster should also include a Thor cluster. The Thor cluster is used to build the distributed index files used by the Roxie cluster and to develop online queries which will be deployed with the index files to the Roxie cluster.
270: 220: 51: 285:. This platform is designed as an online high-performance structured query and analysis platform or data warehouse delivering the parallel data access processing requirements of online applications through Web services interfaces supporting thousands of simultaneous queries and users with sub-second response times. Roxie utilizes a 311: 192:. The HPCC platform includes system configurations to support both parallel batch data processing (Thor) and high-performance online query applications using indexed data files (Roxie). The HPCC platform also includes a data-centric declarative programming language for parallel data processing called 265:
Figure 2 shows a representation of a physical Thor processing cluster which functions as a batch job execution engine for scalable data-intensive computing applications. In addition to the Thor master and slave nodes, additional auxiliary and common components are needed to implement a complete HPCC
327:
components, an external communications layer, client interfaces which provide both end-user services and system management tools, and auxiliary components to support monitoring and to facilitate loading and storing of filesystem data from external sources. Usually a HPCC environment includes only
306:
Figure 3 shows a representation of a physical Roxie processing cluster which functions as an online query execution engine for high-performance query and data warehousing applications. A Roxie cluster includes multiple nodes with server and worker processes for processing queries; an additional
254:
is a reference to the mythical Norse god of thunder with the large hammer symbolic of crushing large amounts of raw data into useful information. A Thor cluster is similar in its function, execution environment, filesystem, and capabilities to the Google and
250:) processing of the raw data, record linking and entity resolution, large-scale ad-hoc complex analytics, and creation of keyed data and indexes to support high-performance structured queries and data warehouse applications. The data refinery name 350:
2.0. The Enterprise Edition is available under a paid commercial license and includes training, support, indemnification and additional modules. In November 2011, HPCC Systems announced the availability of its Thor Data Refinery Cluster on
302:
capabilities added, and provides for near real time predictable query latencies. Both Thor and Roxie clusters utilize the ECL programming language for implementing applications, increasing continuity and programmer productivity.
685:
Sandia National Laboratories Leverages the Data Analytics Supercomputer (DAS) by LexisNexis Risk & Information Analytics Group, Which Offers Breakthrough High Performance Computing to Address Data Management and Analysis
289:
to provide parallel processing of queries using an optimized execution environment and filesystem for high-performance online processing. A Roxie cluster is similar in its function and capabilities to
695: 328:
Thor clusters, or both Thor and Roxie clusters, although Roxie occasionally is used to build its own indexes. The overall HPCC software architecture is shown in Figure 4.
346:
HPCC Systems offers both a Community Edition and an Enterprise Edition. The Community Edition is free to download, includes the source code and is released under the
246:
whose overall purpose is the general processing of massive volumes of raw data of any type for any purpose but typically used for data cleansing and hygiene, ETL (
684: 755: 710: 343:
and was formed to promote and sell the HPCC software. In June 2011, it announced the offering of the software under an open source dual license model.
725: 383: 193: 118: 624: 679: 765: 35: 494: 750: 715: 99: 745: 458:, "ECL/HPCC: A Unified Approach to Big Data," by A.M. Middleton. Handbook of Data Intensive Computing. Springer, 2011. 83: 425:, "Data-Intensive Technologies for Cloud Computing," by A.M. Middleton. Handbook of Cloud Computing. Springer, 2010. 721:
FAU Receives National Science Foundation Rapid Response Grant to Develop Innovative Computer Model for Ebola Spread
422: 598: 546: 468: 340: 177: 64: 650: 760: 247: 173: 572: 286: 520: 705: 690: 436: 181: 441: 203:
in 2011, after ten years of in-house development (according to LexisNexis). It is an alternative to
352: 185: 378: 200: 720: 398: 356: 137: 125: 716:
High Performance Computing Clusters (HPCC) and Big Data Analytics Certificate - Stand-Alone
235:, each of which can be optimized independently for its parallel data processing purpose. 323:
The HPCC software architecture incorporates the Thor and Roxie clusters as well as common
435:"HPCC Systems: Introduction to HPCC (High-Performance Computing Cluster)". 24 May 2011. 347: 142: 739: 393: 388: 368: 291: 256: 373: 295: 227:
The HPCC system architecture includes two distinct cluster processing environments
269: 455: 188:
to provide high-performance, data-parallel processing for applications utilizing
299: 219: 324: 58: 403: 259: 17: 547:"LexisNexis Will Open-Source Its Hadoop Alternative for Handling Big Data" 469:"LexisNexis Will Open-Source Its Hadoop Alternative for Handling Big Data" 50: 726:
CPL Online delivers added value for clients through its Big Data Platform
208: 189: 691:
Programming models for the LexisNexis High Performance Computing Cluster
310: 711:
LexisNexis Brings Its Data Management Magic To Bear on Scientific Data
204: 625:"HPCC Announces Availability of ETL Cluster On Amazon Web Services" 114: 309: 268: 218: 130: 104: 251: 277:
The second of the parallel data processing platforms is called
730: 706:
Reference to the term BORPS (Billions of Records Per Second)
700: 154: 355:. In January 2012, HPCC Systems announced distributed 168:(High-Performance Computing Cluster), also known as 172:(Data Analytics Supercomputer), is an open source, 149: 136: 124: 110: 98: 82: 70: 57: 339:(High Performance Computing Cluster) is part of 573:"HPCC A New/Old Kid In Town To Take On Hadoop" 680:Sandia sees data management challenges spiral 8: 43: 651:"HPCC Systems Intros Machine Learning Beta" 521:"LexisNexis open-sources its Hadoop killer" 49: 42: 440: 696:LexisNexis Data Analytics Supercomputer 415: 384:ECL (data-centric programming language) 238:The first of these platforms is called 495:"9 Useful Open Source Big Data Tools" 7: 456:Handbook of Data Intensive Computing 314:Figure 4. HPCC software architecture 180:. The HPCC platform incorporates a 599:"LexisNexis Joins Linux Foundation" 273:Figure 3. Roxie processing cluster 25: 756:Declarative programming languages 223:Figure 2. Thor processing cluster 37:Harry Potter and the Cursed Child 27:High-performance computer cluster 199:The public release of HPCC was 105:https://github.com/hpcc-systems 287:distributed indexed filesystem 1: 176:system platform developed by 186:commodity computing clusters 423:Handbook of Cloud Computing 782: 283:rapid data delivery engine 29: 766:Data warehousing products 341:LexisNexis Risk Solutions 178:LexisNexis Risk Solutions 94: 78: 65:LexisNexis Risk Solutions 48: 34:West End stage play, see 266:processing environment. 248:extract, transform, load 174:data-intensive computing 89:7.4.18-1 / 13-09-2019 701:LexisNexis HPCC Systems 315: 274: 224: 751:Distributed computing 629:Cloud Computing Today 319:Software architecture 313: 272: 222: 182:software architecture 603:The Linux Foundation 499:EnterpriseAppsToday 353:Amazon Web Services 281:and functions as a 215:System architecture 45: 746:Parallel computing 631:. 17 December 2012 379:Aster Data Systems 316: 275: 225: 657:. 31 January 2012 163: 162: 16:(Redirected from 773: 667: 666: 664: 662: 647: 641: 640: 638: 636: 621: 615: 614: 612: 610: 595: 589: 588: 586: 584: 569: 563: 562: 560: 558: 543: 537: 536: 534: 532: 517: 511: 510: 508: 506: 491: 485: 484: 482: 480: 465: 459: 453: 447: 446: 444: 432: 426: 420: 399:Machine learning 357:machine learning 294:and Hadoop with 159: 156: 126:Operating system 53: 46: 21: 781: 780: 776: 775: 774: 772: 771: 770: 761:Query languages 736: 735: 676: 671: 670: 660: 658: 649: 648: 644: 634: 632: 623: 622: 618: 608: 606: 597: 596: 592: 582: 580: 571: 570: 566: 556: 554: 545: 544: 540: 530: 528: 519: 518: 514: 504: 502: 493: 492: 488: 478: 476: 467: 466: 462: 454: 450: 442:10.1.1.456.3571 434: 433: 429: 421: 417: 412: 365: 334: 321: 217: 184:implemented on 153: 90: 71:Initial release 41: 28: 23: 22: 15: 12: 11: 5: 779: 777: 769: 768: 763: 758: 753: 748: 738: 737: 734: 733: 728: 723: 718: 713: 708: 703: 698: 693: 688: 682: 675: 674:External links 672: 669: 668: 642: 616: 605:. 17 June 2011 590: 579:. 16 June 2011 564: 553:. 15 June 2011 538: 527:. 15 June 2011 512: 486: 475:. 15 June 2011 460: 448: 427: 414: 413: 411: 408: 407: 406: 401: 396: 391: 386: 381: 376: 371: 364: 361: 348:Apache License 333: 330: 320: 317: 216: 213: 161: 160: 151: 147: 146: 143:Apache License 140: 134: 133: 128: 122: 121: 112: 108: 107: 102: 96: 95: 92: 91: 88: 86: 84:Stable release 80: 79: 76: 75: 72: 68: 67: 63:HPCC Systems, 61: 55: 54: 26: 24: 14: 13: 10: 9: 6: 4: 3: 2: 778: 767: 764: 762: 759: 757: 754: 752: 749: 747: 744: 743: 741: 732: 729: 727: 724: 722: 719: 717: 714: 712: 709: 707: 704: 702: 699: 697: 694: 692: 689: 687: 683: 681: 678: 677: 673: 656: 652: 646: 643: 630: 626: 620: 617: 604: 600: 594: 591: 578: 574: 568: 565: 552: 548: 542: 539: 526: 522: 516: 513: 501:. 11 Nov 2015 500: 496: 490: 487: 474: 470: 464: 461: 457: 452: 449: 443: 438: 431: 428: 424: 419: 416: 409: 405: 402: 400: 397: 395: 394:Sector/Sphere 392: 390: 389:ElasticSearch 387: 385: 382: 380: 377: 375: 372: 370: 369:Apache Hadoop 367: 366: 362: 360: 358: 354: 349: 344: 342: 338: 331: 329: 326: 318: 312: 308: 304: 301: 297: 293: 292:ElasticSearch 288: 284: 280: 271: 267: 263: 261: 258: 253: 249: 245: 244:data refinery 241: 236: 234: 230: 221: 214: 212: 210: 206: 202: 197: 195: 191: 187: 183: 179: 175: 171: 167: 158: 152: 148: 144: 141: 139: 135: 132: 129: 127: 123: 120: 116: 113: 109: 106: 103: 101: 97: 93: 87: 85: 81: 77: 73: 69: 66: 62: 60: 56: 52: 47: 39: 38: 33: 19: 731:HPCC Systems 659:. Retrieved 654: 645: 633:. Retrieved 628: 619: 607:. Retrieved 602: 593: 581:. Retrieved 577:NetworkWorld 576: 567: 555:. Retrieved 550: 541: 529:. Retrieved 524: 515: 503:. Retrieved 498: 489: 477:. Retrieved 472: 463: 451: 430: 418: 374:Apache Spark 359:algorithms. 345: 337:HPCC Systems 336: 335: 332:HPCC Systems 322: 305: 282: 278: 276: 264: 243: 239: 237: 232: 228: 226: 198: 169: 165: 164: 59:Developer(s) 36: 32:Harry Potter 31: 18:HPCC Systems 661:29 November 635:30 November 609:29 November 557:20 November 505:18 November 479:20 November 262:platforms. 211:platforms. 155:hpccsystems 740:Categories 686:Challenges 583:2 December 531:8 November 410:References 325:middleware 207:and other 111:Written in 100:Repository 74:15-06-2011 551:ReadWrite 473:ReadWrite 437:CiteSeerX 404:MapReduce 260:MapReduce 201:announced 655:Datanami 363:See also 209:Big data 190:big data 30:For the 150:Website 138:License 525:GigaOM 439:  257:Hadoop 205:Hadoop 296:HBase 279:Roxie 233:Roxie 131:Linux 663:2014 637:2014 611:2014 585:2014 559:2014 533:2014 507:2015 481:2014 300:Hive 298:and 252:Thor 242:, a 240:Thor 231:and 229:Thor 166:HPCC 157:.com 44:HPCC 194:ECL 170:DAS 145:2.0 119:ECL 115:C++ 742:: 653:. 627:. 601:. 575:. 549:. 523:. 497:. 471:. 196:. 117:, 665:. 639:. 613:. 587:. 561:. 535:. 509:. 483:. 445:. 40:. 20:)

Index

HPCC Systems
Harry Potter and the Cursed Child

Developer(s)
LexisNexis Risk Solutions
Stable release
Repository
https://github.com/hpcc-systems
C++
ECL
Operating system
Linux
License
Apache License
hpccsystems.com
data-intensive computing
LexisNexis Risk Solutions
software architecture
commodity computing clusters
big data
ECL
announced
Hadoop
Big data

extract, transform, load
Thor
Hadoop
MapReduce

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.