Get the data clean it then visualize it
To understand it, then clean it remove outliers deal with over fitting
Then predict values