As the most active open-source project in the big data community, Apache SparkTM has become the de-facto standard for big data processing and analytics. Spark’s ease of use, versatility, and speed has ...