Course Outline

Introduction

  • Overview of Databricks and Apache Spark
  • Understanding the Databricks architecture

Getting Started

  • Setting up the Environment
  • Setting up and configuring Databricks
  • Navigating the Databricks user interface
  • Creating a Databricks workspace

Working with Data in Databricks

  • Connecting to an Apache Spark data source
  • Understanding the basics columns and datatypes
  • Managing file system into Notebooks

Managing Jobs and Clusters

  • Creating and configuring clusters
  • Creating jobs using Notebook
  • Running jobs
  • Viewing jobs and job details

Using Delta Lake in Databricks

  • Loading data into Delta Lake
  • Managing data in Delta Lake

Securing Databricks

  • Managing Databricks security
  • Managing backup and recovery

Troubleshooting

Summary and Next Steps

Requirements

  • Basic understanding of data analytics
  • Knowledge of Apache Spark

Audience

  • Data Engineers
  • Data Scientists
  • Developers
 14 Hours

Number of participants



Price per participant

Testimonials (2)

Related Courses

Analytic Functions Fundamentals

21 Hours

Apache Arrow for Data Analysis across Disparate Data Sources

14 Hours

AWS Glue Fundamentals

14 Hours

Azure for Data Engineer

35 Hours

A Practical Introduction to Data Analysis and Big Data

35 Hours

Data and Analytics - from the ground up

42 Hours

Scaling Data Analysis with Python and Dask

14 Hours

Data Analysis for Marketers

14 Hours

Data Analytics With R

21 Hours

Datameer for Data Analysts

14 Hours

Data Analysis with Python, Pandas and Numpy

14 Hours

A Practical Introduction to Data Science

35 Hours

Introduction to dbt Cloud

21 Hours

Dremio for Self-Service Data Analysis

21 Hours

Elasticsearch for Developers

14 Hours

Related Categories

1