As you can see from above, usually, most of your time (some estimate 70% or more) is spent on getting and preparing your data. So get used to the "janitor work", it's good for your data science muscles!
A simple yet powerful machine learning algorithm is logistic regression. It's sort of the linear equation you learned in high school, $y = mx + b$. You feed your input, $x$, into the linear function and it spits out an output, $y$.
Instead of a linear function, logistic regression uses a sigmoid function to give a binary output, 1 or 0. In reality, it produces a probability. As you also remember from your high school math, probability is between 0 and 1. In logistic regression, the default threshold is set at 0.5. This means that if the probability is less than 0.5, well, the model outputs 0. If equal or greater than 0.5, it outputs 1.
Easy enough. If you are still curious, here's what the sigmoid function looks like.