split training and testing data to be applied to model i use the logistic regression (sigmoid function )algorithm implemented from scikit learn library as it deals successfully with our case of binary classification