Partitioning is one of the basic building blocks on which the Spark framework has been built. Partitioning at various stages of your program plays a very important role in ensuring the reliability, scalability and efficiency of the programs. In fact, just setting the right partitioning across various stages, lot of spark programs can be optimized right away.This book would assist you to understand the various aspects of Spark Partitioning in depth. Armed with the knowledge gained from the book, you would be able to set right partitioning in your Spark Jobs for large Datasets.
Partitioning is one of the basic building blocks on which the Spark framework has been built. Partitioning at various stages of your program plays a very important role in ensuring the reliability, scalability and efficiency of the programs. In fact, just setting the right partitioning across various stages, lot of spark programs can be optimized right away.This book would assist you to understand the various aspects of Spark Partitioning in depth. Armed with the knowledge gained from the book, you would be able to set right partitioning in your Spark Jobs for large Datasets.