Search This Blog

Wednesday, June 22, 2016

Getting the best performance with Pyspark

Some good tips on performance with Pyspark and Dataframes from Holden Karau at Spark Summit 2016

With code
https://github.com/high-performance-spark/high-performance-spark-examples 


No comments:

Post a Comment