25.8.20
This website uses cookies to ensure you get the best experience on our website. Learn more

Databricks Certified Associate Developer for Apache Spark

Tristen Wentling

The Databricks Certified Associate Developer for Apache Spark certification exam assesses the
understanding of the Apache Spark Architecture and Components and the ability to apply the
Spark DataFrame API to complete basic data manipulation tasks within a Spark session. These
tasks include selecting, renaming and manipulating columns; filtering, dropping, sorting, and
aggregating rows; handling missing data; combining, reading, writing and partitioning DataFrames
with schemas; and working with UDFs and Spark SQL functions. In addition, the exam will assess
the basics of the Spark architecture like execution/deployment modes, the execution hierarchy,
fault tolerance, garbage collection, lazy evaluation, Shuffling and usage of Actions and
broadcasting, Structured Streaming, Spark Connect, and common troubleshooting and tuning
techniques, Individuals who pass this certification exam can be expected to complete basic Spark
DataFrame tasks using Python.

Skills / Knowledge

  • Apache Spark
  • Apache Spark Architecture and Components
  • Spark SQL
  • Apache Spark™ DataFrame/DataSet API Applications
  • Apache Spark DataFrame API Applications
  • Structured Streaming
  • Spark Connect
  • Pandas API

Issued on

April 1, 2025

Expires on

March 31, 2027