Jul 16, 2024
map
, filter
.collect
, show
, save as file
.map
, filter
are transformations.collect
, show
, save as file
are actions.save as file
) is called.map
, filter
.reduceByKey
, groupByKey
.read
, flatMap
, map
(executed sequentially without triggering any job in Spark UI until an action is performed).reduceByKey
(creates a new stage due to shuffling).collect
(triggers the execution of the DAG).read
, map
, flatMap
in one stage, and reduceByKey
in another due to shuffling.