1. Why is Spark faster than Hadoop?
  2. RDD vs DataFrame vs Datasets
  3. Components of Apace Spark Eco-system
  4. Fault tolerant and Lazily Evaluated
  5. Types of Transformations in Spark
  6. Map vs FlatMap
  7. Persistence vs cache
  8. Repartition vs Coalesce
  9. Types of Shared variables
  10. Run time architecture of Spark

spark.png


spark.png

Advanced Spark Concepts for Job Interviews

Untitled