Last lecture

Linear classification

Logistic regression

Optimization

Given a function $f(w)$, find its minimum (or maximum), where $f$ is called the objective function. Maximizing $f$ is the same as minimizing $-f$.

Gradient Descent

Stationary points: where $\nabla J(\theta) = \vec 0$. When a function is “well-behaved,” (e.g. convex) then we have simple global maximum. Use gradient descent to observe it.

Procedure