https://www.databricks.com/learn/certification/data-engineer-associate
Exam Details
Key details about the certification exam are provided below.
Minimally Qualified Candidate
The minimally qualified candidate should be able to:
- Understand how to use and the benefits of using the Databricks Lakehouse Platform and its tools, including:
- Data Lakehouse (architecture, descriptions, benefits)
- Data Science and Engineering workspace (clusters, notebooks, data storage)
- Delta Lake (general concepts, table management and manipulation, optimizations)
- Build ETL pipelines using Apache Spark SQL and Python, including:
- Relational entities (databases, tables, views)
- ELT (creating tables, writing data to tables, cleaning data, combining and reshaping tables, SQL UDFs)
- Python (facilitating Spark SQL with string manipulation and control flow, passing data between PySpark and Spark SQL)
- Incrementally process data, including:
- Structured Streaming (general concepts, triggers, watermarks)
- Auto Loader (streaming reads)
- Multi-hop Architecture (bronze-silver-gold, streaming applications)
- Delta Live Tables (benefits and features)
- Build production pipelines for data engineering applications and Databricks SQL queries and dashboards, including:
- Jobs (scheduling, task orchestration, UI)
- Dashboards (endpoints, scheduling, alerting, refreshing)
- Understand and follow best security practices, including:
- Unity Catalog (benefits and features)
- Entity Permissions (team-based permissions, user-based permissions)
Duration
Testers will have 90 minutes to complete the certification exam.
Questions
There are 45 multiple-choice questions on the certification exam. The questions will be distributed by high-level topic in the following way:
- Databricks Lakehouse Platform – 24% (11/45)
- ELT with Spark SQL and Python – 29% (13/45)
- Incremental Data Processing – 22% (10/45)
- Production Pipelines – 16% (7/45)
- Data Governance – 9% (4/45)
Preparation
In order to learn the content assessed by the certification exam, candidates should take one of the following Databricks Academy courses:
- Instructor-led: Data Engineering with Databricks
- Self-paced: Data Engineering with Databricks (available in Databricks Academy) Candidates are also able to learn more about the certification exam by taking the certification exam’s overview course (coming soon).
Before taking the exam, it is recommended that candidates complete the practice exam.
content
https://community.cloud.databricks.com/
layout: post title: “Databricks Certified Data Engineer Associate Journey” comments: true date: “2023-01-08 03:36:13.288000+00:00” —
https://www.linkedin.com/posts/jihen-trabelsi_databricks-dataengineering-clouddataengineer-ugcPost-7015682578137391104-NQ4Y?utm_source=share&utm_medium=member_desktop
-Practice exam: https://lnkd.in/ePY3AeD4 https://files.training.databricks.com/assessments/practice-exams/PracticeExam-DataEngineerAssociate.pdf
-Complete course repo: https://lnkd.in/eKhmhBAJ https://github.com/databricks-academy/data-engineering-with-databricks-english
-Databricks community edition : https://lnkd.in/eYAyW_r7 https://community.cloud.databricks.com/login.html
-Data Engineering on Databricks Demo video: https://lnkd.in/ecHmAPUq
-Databricks training platform link: https://lnkd.in/eFKcrptB
https://www.linkedin.com/posts/aviralb_how-to-crack-databricks-data-engineer-associate-activity-7015641266625372160-8MDR?utm_source=share&utm_medium=member_desktop
https://medium.com/knowledgelens/how-to-crack-databricks-data-engineer-associate-exam-2023-41cb3eb6d29d
How to prepare for the certification:
Complete The Data Engineering with Databricks (Databricks Academy)
Complete The Data Engineering Notebooks(Link)
Read the data bricks documentation (recommended)
Additional resources :