# Logistic Regression: a quick introduction

Logistic Regression is very popular in Machine Learning, used to give predictions on something. (It is not the exact probabilities, but general values. )

Basically, L.R. has three main parts: Regression, Linear Regression and Logistic Function.

Regression

Regression is to estimate uncertain parameters in a certain function.

For example, $y=a\times x+b$, uncertain parameters are a and b. So if we have large set of (x, y), then we can give approximate estimation of a and b, maybe by drawing a line.

But in the real world, we do not have some “certain functions”. Like $G=m\times g$ (Who knows how it came from…). So we can assume a structure of a function by doing analysis on a large dataset. Linear and non-linear regressions are depending on the functions.

Linear Regression

So we can easily solve the problem of $y=a\times x+b$, as described above.

If a functions has more than one variables: $f\left( { x }_{ 0 }\quad ,{ x }_{ 1 }\quad ,{ x }_{ 2 }\quad ,\quad ... \right)$

We leave it to Linear Regression, use it to solve it. (Like a tool.)

But keep in mind that the formular need to be a right one.

Logistic Funtion

We need some values to be in the range of 0~1, we call it normalisation.

Logistic Regression is a linear regression that normalized by a logistic function.

To some extend, logistic function can help us removing noise.

When will we use Logistic Regression?

1) Predictions on probabilities, or classification

Not all ML methods can be used to do predictions on exact probs, like SVM. As for classification, a threshold might be needed.

2) Only for linear problem

When feature and target has a linear relation, Logistic Regression can be useful.

2) Dependence of features

We do not have the features to be all conditionally independent, they can be calculated independently. No posterior probs are needed. LR will not provide the combinations of the features.

Logistic Regression Methods

Cost Function: all the points (xi,yi), they have to be very close to the final line. So we have a cost function, and want to find out a min. We use The Least Square Method. $\sum { (f({ x }_{ i })-{ y }_{ i }) } ^{ 2 }$

But it is to define the function, how we find out the values? We use Gradient Descent.