Quick Start: Using Apache Spark for Large-Scale Data Processing

Xiao-Yong Jin, Argonne National Laboratory
ALCF Dev Sessions

Join us for an interactive webinar focused on using Apache Spark, a framework for parallel data processing, on ALCF computing resources. The webinar will present a brief tutorial on Apache Spark, provide instructions for running the framework on ALCF systems, discuss the unique characteristics of Theta, and recommend a few tuning parameters to achieve optimal performance.

This month's session will be led by the ALCF's Xiao-Yong Jin.