Gradient of a functionThe gradient of a differentiable function contains the first derivatives of the function with respect to each variable. As seen here, the gradient is useful to find the linear approximation of the function near a point.
DefinitionThe gradient of at , denoted , is the vector in given by Examples:
The function is differentiable, provided , which we assume. Then
The gradient of at is where , . More generally, the gradient of the function with values is given by where , and . Composition rule with an affine functionIf is a matrix, and is a vector, the function with values is called the composition of the affine map with . Its gradient is given by (see here for a proof) Geometric interpretationGeometrically, the gradient can be read on the plot of the level set of the function. Specifically, at any point , the gradient is perpendicular to the level set, and points outwards from the sub-level set (that is, it points towards higher values of the function). |