- Tristen WentlingApril 1, 2025https://credentials.databricks.com/4a0bf711-6ea8-4392-abed-54d2eb1812d8139110688Tristen WentlingApril 1, 2025March 31, 2027Tristen Wentling139110688April 1, 2025March 31, 2027
Tristen Wentling
The Databricks Certified Associate Developer for Apache Spark certification exam assesses the
understanding of the Apache Spark Architecture and Components and the ability to apply the
Spark DataFrame API to complete basic data manipulation tasks within a Spark session. These
tasks include selecting, renaming and manipulating columns; filtering, dropping, sorting, and
aggregating rows; handling missing data; combining, reading, writing and partitioning DataFrames
with schemas; and working with UDFs and Spark SQL functions. In addition, the exam will assess
the basics of the Spark architecture like execution/deployment modes, the execution hierarchy,
fault tolerance, garbage collection, lazy evaluation, Shuffling and usage of Actions and
broadcasting, Structured Streaming, Spark Connect, and common troubleshooting and tuning
techniques, Individuals who pass this certification exam can be expected to complete basic Spark
DataFrame tasks using Python.
Skills / Knowledge
- Apache Spark
- Apache Spark Architecture and Components
- Spark SQL
- Apache Spark™ DataFrame/DataSet API Applications
- Apache Spark DataFrame API Applications
- Structured Streaming
- Spark Connect
- Pandas API