在阅读这篇文章之前,先阅读这一篇文章:
Partial Derivatives and Vector Fields
1. Gradient
The gradient of a scalar-valued multivariable function f ( x , y , … ) f(x, y, \dots) f(x,y,…), denoted ∇ f \nabla{f} ∇f, packages all its partial derivative information into a vector:
∇ f = [ ∂ f ∂ x ∂ f ∂ y … ] \nabla{f}=\begin{bmatrix} \\\frac{\partial f}{\partial x} \\ \\\frac{\partial f}{\partial y} \\\dots \end{bmatrix} ∇f=⎣⎢⎢⎢⎢⎡∂x∂f∂y∂f…⎦⎥⎥⎥⎥⎤
In particular, this means ∇ f \nabla {f} ∇f, f f f is a vector-valued function.
-
If you imagine standing at a point ( x 0 , y 0 , … ) (x_{0}, y_{0}, \dots) (x0,y0,…) in the input space of f f f, the vector ∇ f \nabla f ∇f tells you which direction you should travel to increase the value of f f f most rapidly.
-
These gradient vectors ∇ f \nabla f ∇f are also perpendicular to the Contour lines of f f f.
In the case of scalar-valued multivariable functions, those with a multidimensional input but a one-dimensional output, the full derivative of such a function is the gradient.
Credit To: The gradient
2. Directional Derivatives
Consider some multivariable function: