This project focused on the critical process of data cleaning and transformation using a real-world dataset of Nashville housing records and SQL. The goal was to convert raw, messy data into a clean, structured and reliable dataset suitable for analysis.
Key steps included:
-Data Transformation: SQL queries were used to clean and standardise the dataset, including handling date conversions and populating missing address information.
-Data Parsing: Advanced SQL functions were applied to extract and split address details (street, city, state) from single columns into multiple, structured fields.
-Data Quality: The project improved data consistency by standardising categorical values and eliminating duplicate records, ensuring the dataset was accurate for subsequent analysis.