Comparision between Transformation and Action in Spark

      What is the difference between transformation and action operations in Apache Spark?
      Compare Transformation vs Action

      Transformations create new RDD from existing RDD
      Transformations are executed on demand.(Lazy computation)
      Ex: filter(), union()

      An Action will return a non-RDD type (your stored value types usually)
      Actions triggers execution using lineage graph to load the data into original RDD
      Ex: count(), first()

      For more on RDD operations refer RDD transformation and Action.

