Formula-1 Data Analysis with Azure Databricks

View on GitHub

  • Built a real world data project using Azure Databricks and Spark Core - Formula1 Data Analysis.
  • Used Azure Databricks, Delta Lake, Spark Core, Azure Data Lake Gen2 and Azure Data Factory (ADF).
  • Created notebooks, dashboards, clusters, cluster pools and jobs in Azure Databricks.
  • Ingested and transformed data using PySpark in Azure Databricks.
  • Transformed and analysed data using Spark SQL in Azure Databricks
  • Implemented a Lakehouse architecture using Delta Lake.
  • Created Azure Data Factory pipelines to execute Databricks notebooks.
  • Created dashboards using databricks notebooks.
  • Created Azure Data Factory triggers to schedule pipelines as well as monitor them.
  • Connected to Azure Databricks from PowerBI to create reports.
  • Gained a comprehensive understanding about Unity Catalog and the data governance capabilities offered by Unity Catalog.
  • Implemented a data governance solution using Unity Catalog enabled Databricks workspace.

drivers constructors