Spark Cluster Manager
Lesson objectives
In this lesson, we will explain the following topics:
- Understand the role of the cluster manager in Spark applications.
- Learn about the different cluster managers supported by Spark, including Standalone, Hadoop YARN, Apache Mesos, and Kubernetes.
- Explore the resource allocation and management responsibilities of the cluster manager.
Cluster Manager
Introduction to Spark Cluster Managers
- The cluster manager allocates resources for Spark applications.
- Supports several managers: Standalone, Hadoop YARN, Apache Mesos, and Kubernetes.
Role of the Cluster Manager
- The Spark Driver and Executors do not exist in a void, and this is where the cluster manager comes in.
- The cluster manager is important for managing a cluster of machines intended to run Spark Applications.
- Maintains a
driver (or master)
andworker
nodes, tied to physical machines.
Cluster Manager Components
Figure 1: A cluster driver and worker (no Spark Application yet).
Execution of Spark Applications
- The user requests resources from the cluster manager to initiate Spark applications.
- The user configures the application to specify resources for the driver or only for executors.
- The cluster manager directly manages the machines during the execution of the application.
Watch on Youtube
Watch on our Servers
You can download the videog the link and chose save link as: Download Video