Online Course


What is the course about?

The course Spark is an online class provided by Udacity. The skill level of the course is Intermediate. It may be possible to receive a verified certification or use the course to prepare for a degree.

In this course, you’ll learn how to use Spark to work with big data and build machine learning models at scale, including how to wrangle and model massive datasets with PySpark, the Python library for interacting with Spark. In the first lesson, you will learn about big data and how Spark fits into the big data ecosystem. In lesson two, you will be practicing processing and cleaning datasets to get comfortable with Spark’s SQL and dataframe APIs. In the third lesson, you will debug and optimize your Spark code when running on a cluster. In lesson four, you will use Spark’s Machine Learning Library to train machine learning models at scale.

Course description
  • Spark
  • 10 hours
  • Master how to work with big data and build machine learning models at scale using Spark!
  • The Power of Spark
  • Data Wrangling with Spark
  • Debugging and Optimization
  • Machine Learning with Spark
  • Understand the big data ecosystem
  • Understand when to use Spark and when not to use it
  • Manipulate data with SparkSQL and Spark Dataframes
  • Use Spark for wrangling massive datasets
  • Troubleshoot common errors and optimize their code using the Spark WebUI
  • Use Spark’s Machine Learning Library to train machine learning models at scale
  • This course is ideal for students with programming and data analysis experience.
  • See the Technology Requirements for using Udacity.
  • Spark is a top open source project used by the largest companies and startups around the world to efficiently analyze messy data sets.

Prerequisites & Facts


Course Topic

Data Scientist

University, College, Institution


Course Skill Level


Course Language


Place of class

Online, self-paced (see curriculum for more information)



Degree & Cost


To obtain a verified certificate from Udacity you have to finish this course or the latest version of it, if there is a new edition. The class may be free of charge, but there could be some cost to receive a verified certificate or to access the learning materials. The specifics of the course may have been changed, please consult the provider to get the latest quotes and news.
provided by Udacity


Share your experience

Udacity Udacity
Rate the course

Do you recommend the course? *
Here you can find information, reviews and user experiences for the course “Spark“. The provider of the course – “Udacity” – will be glad to answer any questions you may have about the class, click here to use the offical support channels. It would be great if you could share your experience of participating in the course – Your honest review will surely help others to choose the right class!
School: Udacity
Topic: Data Scientist