Fundamentals of Spark with Python (using PySpark), code examples
But, PySpark+Jupyter combo needs a little bit more love :-)
Resilient Distributed Datasets (RDD) is a fundamental data structure of Spark. It is an immutable distributed collection of objects. Each d…