● Built a complete Formula 1 data pipeline using Databricks and Delta Lake on Azure Data Lake, ingesting diverse data
formats and metadata/images via APIs.
● Implemented incremental loading using merge techniques and used Azure Data Factory event-based triggers to detect new
weekly races, applying PySpark for transformations and aggregations.
● Developed a rich Power BI dashboard connected to the Gold layer, visualizing top drivers, constructors, circuits, wins and
points with filters by year, decade, and overall.