Databricks
Databricks Logo

Advanced Data Engineering with Databricks 

  • Offered byDatabricks

Advanced Data Engineering with Databricks
 at 
Databricks 
Overview

The course delves into advanced concepts and practices essential for designing, building, and maintaining scalable data systems

Duration

Total fee

1.27 Lakh

Mode of learning

Online

Difficulty level

Advanced

Official Website

Go to Website External Link Icon

Credential

Certificate

Advanced Data Engineering with Databricks
Table of content
Accordion Icon V3

Advanced Data Engineering with Databricks
 at 
Databricks 
Highlights

  • Earn a certificate after completion of course from Databricks
Details Icon

Advanced Data Engineering with Databricks
 at 
Databricks 
Course details

Skills you will learn
More about this course

The "Advanced Data Engineering" course is designed for professionals who are looking to deepen their expertise in the field of data engineering and tackle sophisticated data challenges

In this course, students will build upon their existing knowledge of Apache Spark, Structured Streaming, and Delta Lake to unlock the full potential of the data lakehouse by utilizing the suite of tools provided by Databricks

This course places a heavy emphasis on designs favoring incremental data processing, enabling systems optimized to continuously ingest and analyze ever-growing data

 

 

Advanced Data Engineering with Databricks
 at 
Databricks 
Curriculum

Incremental Processing with Spark Structured Streaming and Delta Lake

Streaming Data Concepts

Introduction to Structured Streaming

Aggregations, Time Windows, Watermarks

Delta Live Tables Review

Auto Loader

 

Streaming ETL Patterns with DLT

Data Ingestion Patterns

Data Quality Enforcement Patterns

Data Modeling

Streaming Joins and Statefulness

 

Data Privacy Patterns

Store Data Securely

Streaming Data and CDF

Deleting Data in Databricks

 

Performance Optimization with Spark and Delta Lake

Spark Architecture

Designing the Foundation

Introduction of Spark UI

Fine-Tuning - Choosing the Right Cluster

Code Optimization

Shuffles

Spill
Skew

 Serialization

 

SWE Practices for Delta Live Tables Pipelines

 

Automate Production Workflows

Introduction to REST API and CLI

Deploy Batch and Streaming Jobs

Working with Terraform

Other courses offered by Databricks

1.27 L
16 hours
– / –
– / –
– / –
– / –
– / –
– / –
– / –
– / –
– / –
– / –
View Other 32 CoursesRight Arrow Icon
qna

Advanced Data Engineering with Databricks
 at 
Databricks 

Student Forum

chatAnything you would want to ask experts?
Write here...