Skip to content

Databricks Project including understanding of Data Lake, DataFrames. SQL, Python

Notifications You must be signed in to change notification settings

sonya-stefanova/formula1_data_engineering_project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 

Repository files navigation

Formula 1 Data Engineering Project

Real World Project on Formula1 Racing. The technology used: Azure Databricks, Delta Lake, Unity Catalog, Azure Data Factory The project includes:

  • application of design patterns: full data load, incremental load;
  • working with delta and parquet files;
  • working with Azure storage blob containers;
  • Data vacuuming, deletion, time travel and restoration; dashboard_dominant_teams databricks_containers_partitionBy report_on_f1_drivers report_dashboard

About

Databricks Project including understanding of Data Lake, DataFrames. SQL, Python

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages