تفاصيل العمل

This project builds a complete data analytics pipeline for Netflix content using PySpark.

The pipeline includes data ingestion, cleaning, and transformation of Netflix movies and TV shows dataset. After processing, multiple insights are generated such as content distribution, top countries producing content, and yearly trends.

The project demonstrates end-to-end Data Engineering skills including ETL processes, big data processing with Spark, and data visualization readiness.

Technologies used:

- Python

- PySpark

- Pandas

- SQL

- Parquet storage

This project is part of my Data Engineering portfolio.

بطاقة العمل

اسم المستقل
عدد الإعجابات
0
عدد المشاهدات
1
تاريخ الإضافة
تاريخ الإنجاز
المهارات