Tag: Spark: The Definitive Guide
All the articles with the tag "Spark: The Definitive Guide".
-
Matei Zaharia - Spark: The Definitive Guide - Architecture of a Spark Application
The Architecture of a Spark Application The Spark driver The driver is the process “in the driver seat” of your Spark Application. It is the controller of the execution of a Spark Application and
-
Matei Zaharia - Spark: The Definitive Guide - Life Cycle of a Spark Application
The Life Cycle of a Spark Application (Inside Spark) The SparkSession The first step of any Spark Application is creating a SparkSession. In many interactive modes, this is done for you, but in an
-
Matei Zaharia - Spark: The Definitive Guide. Common Operations
Define Schemas manually When using Spark for production Extract, Transform, and Load (ETL), it is often a good idea to define your schemas manually, especially when working with untyped data sources