تفاصيل العمل

A complete PySpark pipeline that processes raw retail sales data and produces key business insights. The project includes reading CSV files, cleaning and transforming data, handling missing values, and performing aggregations to compute metrics such as top-selling products, monthly sales trends, average order value per customer, and regional sales performance. The final results are written in optimized formats like Parquet/CSV for downstream analytics.

ملفات مرفقة

بطاقة العمل

اسم المستقل
عدد الإعجابات
0
عدد المشاهدات
14
تاريخ الإضافة
تاريخ الإنجاز
المهارات