Regularized Logistic Regression (CS229)

發表於 2018-07-13

Logistic Regression of Cost Function

Recall logistic regression of cost function
$$J(\theta)=-\frac{1}{m}\left[\sum_{i=1}^my^{(i)}\log\left(h_{\theta}(x^{(i)})\right)+(1-y^{(i)})\log\left(1-h_{\theta}(x^{(i)})\right)\right]\tag{1}$$

閱讀全文 »

Regularized Linear Regression (CS229)

發表於 2018-07-13

Recall

For linear regression, we’ve learned two learning algorithms, one based on gradient descent, and another one based on the normal equation.

閱讀全文 »

Regularization of Cost Function (CS229)

發表於 2018-07-13

Small values for parameters $\theta_0,\theta_1,\theta_3,…,\theta_n$

simpler hypothesis
less prone to overfitting

閱讀全文 »

Overfitting Problem of Regularization (CS229)

發表於 2018-07-13

Underfitting (high bias) and overfitting (high varience) are both not good in regularization.

閱讀全文 »

Logistic Regression in Multi-class ClassificationProblems (CS229)

發表於 2018-07-12

One vs all classification

Examples

Email foldering : work$(y=1)$, firends$(y=2)$, family$(y=3)$, hobby$(y=4)$, …
Medical diagrams : not ill$(y=1)$, cold$(y=2)$, flu$(y=3)$, …
Weather : Sunny$(y=1)$, Cloudy$(y=2)$, Rain$(y=3)$, Snow$(y=4)$, …

閱讀全文 »

Advanced Optimization of Logistic Regression (CS229)

發表於 2018-07-12

Optimization algorithm

To compute $J(\theta)$ and $\frac{\partial}{\partial\theta_j}J(\theta)$ with given $\theta$ more efficiently, here are some algorithms:

Gradient descent
Conjugate gradient
BFGS
L-BFGS

閱讀全文 »

Simplify Cost Function and Gradient Descent of Logistic Regression (CS229)

發表於 2018-07-11

Recall

Recall from the previous post, we know that

$$J(\theta)=\frac{1}{m}\sum_{i=1}^m\frac{1}{2}(h_{\theta}(x^{(i)})-y^{(i)})^2\\=\frac{1}{m}\sum_{i=1}^mCost(h_{\theta}(x^{(i)}),y^{(i)})$$

閱讀全文 »

Logistic Regression of Cost Function (CS229)

發表於 2018-07-08

Given and assumptions

Training set

$$\lbrace(x^{(1)},y^{(1)}),(x^{(1)},y^{(1)}),…,(x^{(m)},y^{(m)})\rbrace$$

閱讀全文 »

Logistic Regression With Decision Boundary (CS229)

發表於 2018-07-08

Linear Decision Boundary

Assume $h_{\theta}(x)=g(\theta_0+\theta_1x_1+\theta_2x_2)$, and

$$\mathbf{\theta}=
\begin{Bmatrix}
\theta_0 \\
\theta_1 \\
\theta_2 \\
\end{Bmatrix}=
\begin{Bmatrix}
-3 \\
1 \\
1 \\
\end{Bmatrix}$$

閱讀全文 »

Logistic Regression With Hypothesis Representation (CS229)

發表於 2018-07-07

Logistic Regression Model

Goal: $0\le h_{\theta}(x)\le1$

$$h_{\theta}(x)=g(\mathbf{\theta^\top x})$$
where $g(z)$ is a sigmoid function (i.e. logistic function)
$$g(\mathbf{z})=\frac{1}{1+e^{-z}}$$

閱讀全文 »