Directional derivative

From Wikipedia, the free encyclopedia
(Redirected from Normal derivative)
Jump to navigation Jump to search

In multivariable calculus, the directional derivative measures the rate at which a function changes in a particular direction at a given point.[citation needed]

The directional derivative of a multivariable differentiable scalar function along a given vector v at a given point x represents the instantaneous rate of change of the function in the direction v through x.

Many mathematical texts assume that the directional vector is normalized (a unit vector), meaning that its magnitude is equivalent to one. This is by convention and not required for proper calculation. In order to adjust a formula for the directional derivative to work for any vector, one must divide the expression by the magnitude of the vector. Normalized vectors are denoted with a circumflex (hat) symbol: ^.

The directional derivative of a scalar function f with respect to a vector v (denoted as 𝐯^ when normalized) at a point (e.g., position) (x,f(x)) may be denoted by any of the following: 𝐯f(𝐱)=f'𝐯(𝐱)=D𝐯f(𝐱)=Df(𝐱)(𝐯)=𝐯f(𝐱)=f(𝐱)𝐯=𝐯^f(𝐱)=𝐯^f(𝐱)𝐱.

It therefore generalizes the notion of a partial derivative, in which the rate of change is taken along one of the curvilinear coordinate curves, all other coordinates being constant. The directional derivative is a special case of the Gateaux derivative.

Definition

[edit | edit source]
File:Directional derivative contour plot.svg
A contour plot of f(x,y)=x2+y2, showing the gradient vector in black, and the unit vector 𝐮 scaled by the directional derivative in the direction of 𝐮 in orange. The gradient vector is longer because the gradient points in the direction of greatest rate of increase of a function.

The directional derivative of a scalar function f(𝐱)=f(x1,x2,,xn) along a vector 𝐯=(v1,,vn) is the function 𝐯f defined by the limit[1] 𝐯f(𝐱)=limh0f(𝐱+h𝐯)f(𝐱)h||𝐯||=1||𝐯||ddtf(𝐱+t𝐯)|t=0.

This definition is valid in a broad range of contexts, for example, where the norm of a vector (and hence a unit vector) is defined.[2]

For differentiable functions

[edit | edit source]

If the function f is differentiable at x, then the directional derivative exists along any vector v at x, and one has

𝐯f(𝐱)=f(𝐱)𝐯||𝐯||

where the on the right denotes the gradient and is the dot product.[3]

It can be derived by using the property that all directional derivatives at a point make up a single tangent plane which can be defined using partial derivatives. This can be used to find a formula for the gradient vector and an alternative formula for the directional derivative, the latter of which can be rewritten as shown above for convenience.

It also follows from defining a path h(t)=x+tv and using the definition of the derivative as a limit which can be calculated along this path to get: 0=limt0f(x+tv^)f(x)tf(x)v^t=limt0f(x+tv^)f(x)tf(x)v^=vf(x)f(x)v^.f(x)v^=vf(x)

Using only direction of vector

[edit | edit source]
File:Geometrical interpretation of a directional derivative.svg
The angle α between the tangent A and the horizontal will be maximum if the cutting plane contains the direction of the gradient A.

In a Euclidean space, some authors[4] define the directional derivative to be with respect to an arbitrary nonzero vector v after normalization, thus being independent of its magnitude and depending only on its direction.[5]

This definition gives the rate of increase of f per unit of distance moved in the direction given by v. In this case, one has 𝐯f(𝐱)=limh0f(𝐱+h𝐯)f(𝐱)h||𝐯||, or in case f is differentiable at x, 𝐯f(𝐱)=f(𝐱)𝐯||𝐯||.

Restriction to a unit vector

[edit | edit source]

In the context of a function on a Euclidean space, some texts restrict the vector v to being a unit vector for convention. Both of the above equations remain true, though redundant, when a vector is normalized.[6]

Properties

[edit | edit source]

Many of the familiar properties of the ordinary derivative hold for the directional derivative. These include, for any functions f and g defined in a neighborhood of, and differentiable at, p:

  1. sum rule: 𝐯(f+g)=𝐯f+𝐯g.
  2. constant factor rule: For any constant c, 𝐯(cf)=c𝐯f.
  3. product rule (or Leibniz's rule): 𝐯(fg)=g𝐯f+f𝐯g.
  4. chain rule: If g is differentiable at p and h is differentiable at g(p), then 𝐯(hg)(𝐩)=h(g(𝐩))𝐯g(𝐩).

In differential geometry

[edit | edit source]

Let M be a differentiable manifold and p a point of M. Suppose that f is a function defined in a neighborhood of p, and differentiable at p. If v is a tangent vector to M at p, then the directional derivative of f along v, denoted variously as df(v) (see Exterior derivative), 𝐯f(𝐩) (see Covariant derivative), L𝐯f(𝐩) (see Lie derivative), or 𝐯𝐩(f) (see Tangent space § Definition via derivations), can be defined as follows. Let γ : [−1, 1] → M be a differentiable curve with γ(0) = p and γ′(0) = v. Then the directional derivative is defined by 𝐯f(𝐩)=ddτfγ(τ)|τ=0. This definition can be proven independent of the choice of γ, provided γ is selected in the prescribed manner so that γ(0) = p and γ′(0) = v.

The Lie derivative

[edit | edit source]

The Lie derivative of a vector field Wμ(x) along a vector field Vμ(x) is given by the difference of two directional derivatives (with vanishing torsion): VWμ=(V)Wμ(W)Vμ. In particular, for a scalar field ϕ(x), the Lie derivative reduces to the standard directional derivative: Vϕ=(V)ϕ.

The Riemann tensor

[edit | edit source]

Directional derivatives are often used in introductory derivations of the Riemann curvature tensor. Consider a curved rectangle with an infinitesimal vector δ along one edge and δ along the other. We translate a covector S along δ then δ and then subtract the translation along δ and then δ. Instead of building the directional derivative using partial derivatives, we use the covariant derivative. The translation operator for δ is thus 1+νδνDν=1+δD, and for δ, 1+μδ'μDμ=1+δD. The difference between the two paths is then (1+δD)(1+δD)Sρ(1+δD)(1+δD)Sρ=μ,νδ'μδν[Dμ,Dν]Sρ. It can be argued[7] that the noncommutativity of the covariant derivatives measures the curvature of the manifold: [Dμ,Dν]Sρ=±σRσρμνSσ, where R is the Riemann curvature tensor and the sign depends on the sign convention of the author.

In group theory

[edit | edit source]

Translations

[edit | edit source]

In the Poincaré algebra, we can define an infinitesimal translation operator P as 𝐏=i. (the i ensures that P is a self-adjoint operator) For a finite displacement λ, the unitary Hilbert space representation for translations is[8] U(𝝀)=exp(i𝝀𝐏). By using the above definition of the infinitesimal translation operator, we see that the finite translation operator is an exponentiated directional derivative: U(𝝀)=exp(𝝀). This is a translation operator in the sense that it acts on multivariable functions f(x) as U(𝝀)f(𝐱)=exp(𝝀)f(𝐱)=f(𝐱+𝝀).

Proof of the last equation

In standard single-variable calculus, the derivative of a smooth function f(x) is defined by (for small ε) dfdx=f(x+ε)f(x)ε. This can be rearranged to find f(x+ε): f(x+ε)=f(x)+εdfdx=(1+εddx)f(x). It follows that [1+ε(d/dx)] is a translation operator. This is instantly generalized[9] to multivariable functions f(x) f(𝐱+𝜺)=(1+𝜺)f(𝐱). Here 𝜺 is the directional derivative along the infinitesimal displacement ε. We have found the infinitesimal version of the translation operator: U(𝜺)=1+𝜺. It is evident that the group multiplication law[10] U(g)U(f)=U(gf) takes the form U(𝐚)U(𝐛)=U(𝐚+𝐛). So suppose that we take the finite displacement λ and divide it into N parts (N→∞ is implied everywhere), so that λ/N=ε. In other words, 𝝀=N𝜺. Then by applying U(ε) N times, we can construct U(λ): [U(𝜺)]N=U(N𝜺)=U(𝝀). We can now plug in our above expression for U(ε): [U(𝜺)]N=[1+𝜺]N=[1+𝝀N]N. Using the identity[11] exp(x)=[1+xN]N, we have U(𝝀)=exp(𝝀). And since U(ε)f(x) = f(x+ε) we have [U(𝜺)]Nf(𝐱)=f(𝐱+N𝜺)=f(𝐱+𝝀)=U(𝝀)f(𝐱)=exp(𝝀)f(𝐱), Q.E.D.

As a technical note, this procedure is only possible because the translation group forms an Abelian subgroup (Cartan subalgebra) in the Poincaré algebra. In particular, the group multiplication law U(a)U(b) = U(a+b) should not be taken for granted. We also note that Poincaré is a connected Lie group. It is a group of transformations T(ξ) that are described by a continuous set of real parameters ξa. The group multiplication law takes the form T(ξ¯)T(ξ)=T(f(ξ¯,ξ)). Taking ξa=0 as the coordinates of the identity, we must have fa(ξ,0)=fa(0,ξ)=ξa. The actual operators on the Hilbert space are represented by unitary operators U(T(ξ)). In the above notation we suppressed the T; we now write U(λ) as U(P(λ)). For a small neighborhood around the identity, the power series representation U(T(ξ))=1+iaξata+12b,cξbξctbc+ is quite good. Suppose that U(T(ξ)) form a non-projective representation, i.e., U(T(ξ¯))U(T(ξ))=U(T(f(ξ¯,ξ))). The expansion of f to second power is fa(ξ¯,ξ)=ξa+ξ¯a+b,cfabcξ¯bξc. After expanding the representation multiplication equation and equating coefficients, we have the nontrivial condition tbc=tbtciafabcta. Since tab is by definition symmetric in its indices, we have the standard Lie algebra commutator: [tb,tc]=ia(fabc+facb)ta=iaCabcta, with C the structure constant. The generators for translations are partial derivative operators, which commute: [xb,xc]=0. This implies that the structure constants vanish and thus the quadratic coefficients in the f expansion vanish as well. This means that f is simply additive: fabeliana(ξ¯,ξ)=ξa+ξ¯a, and thus for abelian groups, U(T(ξ¯))U(T(ξ))=U(T(ξ¯+ξ)). Q.E.D.

Rotations

[edit | edit source]

The rotation operator also contains a directional derivative. The rotation operator for an angle θ, i.e. by an amount θ = |θ| about an axis parallel to θ^=𝜽/θ is U(R(𝜽))=exp(i𝜽𝐋). Here L is the vector operator that generates SO(3): 𝐋=(000001010)𝐢+(001000100)𝐣+(010100000)𝐤. It may be shown geometrically that an infinitesimal right-handed rotation changes the position vector x by 𝐱𝐱δ𝜽×𝐱. So we would expect under infinitesimal rotation: U(R(δ𝜽))f(𝐱)=f(𝐱δ𝜽×𝐱)=f(𝐱)(δ𝜽×𝐱)f. It follows that U(R(δ𝜽))=1(δ𝜽×𝐱). Following the same exponentiation procedure as above, we arrive at the rotation operator in the position basis, which is an exponentiated directional derivative:[12] U(R(𝜽))=exp((𝜽×𝐱)).

Normal derivative

[edit | edit source]

A normal derivative is a directional derivative taken in the direction normal (that is, orthogonal) to some surface in space, or more generally along a normal vector field orthogonal to some hypersurface. See for example Neumann boundary condition. If the normal direction is denoted by 𝐧, then the normal derivative of a function f is sometimes denoted as f𝐧. In other notations, f𝐧=f(𝐱)𝐧=𝐧f(𝐱)=f𝐱𝐧=Df(𝐱)[𝐧].

In the continuum mechanics of solids

[edit | edit source]

Several important results in continuum mechanics require the derivatives of vectors with respect to vectors and of tensors with respect to vectors and tensors.[13] The directional directive provides a systematic way of finding these derivatives.

The definitions of directional derivatives for various situations are given below. It is assumed that the functions are sufficiently smooth that derivatives can be taken.

Derivatives of scalar valued functions of vectors

[edit source]

Let f(v) be a real valued function of the vector v. Then the derivative of f(v) with respect to v (or at v) is the vector defined through its dot product with any vector u being

f𝐯𝐮=Df(𝐯)[𝐮]=[ddαf(𝐯+α𝐮)]α=0

for all vectors u. The above dot product yields a scalar, and if u is a unit vector gives the directional derivative of f at v, in the u direction.

Properties:

  1. If f(𝐯)=f1(𝐯)+f2(𝐯) then f𝐯𝐮=(f1𝐯+f2𝐯)𝐮
  2. If f(𝐯)=f1(𝐯)f2(𝐯) then f𝐯𝐮=(f1𝐯𝐮)f2(𝐯)+f1(𝐯)(f2𝐯𝐮)
  3. If f(𝐯)=f1(f2(𝐯)) then f𝐯𝐮=f1f2f2𝐯𝐮

Derivatives of vector valued functions of vectors

[edit source]

Let f(v) be a vector valued function of the vector v. Then the derivative of f(v) with respect to v (or at v) is the second order tensor defined through its dot product with any vector u being

𝐟𝐯𝐮=D𝐟(𝐯)[𝐮]=[ddα𝐟(𝐯+α𝐮)]α=0

for all vectors u. The above dot product yields a vector, and if u is a unit vector gives the direction derivative of f at v, in the directional u.

Properties:

  1. If 𝐟(𝐯)=𝐟1(𝐯)+𝐟2(𝐯) then 𝐟𝐯𝐮=(𝐟1𝐯+𝐟2𝐯)𝐮
  2. If 𝐟(𝐯)=𝐟1(𝐯)×𝐟2(𝐯) then 𝐟𝐯𝐮=(𝐟1𝐯𝐮)×𝐟2(𝐯)+𝐟1(𝐯)×(𝐟2𝐯𝐮)
  3. If 𝐟(𝐯)=𝐟1(𝐟2(𝐯)) then 𝐟𝐯𝐮=𝐟1𝐟2(𝐟2𝐯𝐮)

Derivatives of scalar valued functions of second-order tensors

[edit source]

Let f(𝑺) be a real valued function of the second order tensor 𝑺. Then the derivative of f(𝑺) with respect to 𝑺 (or at 𝑺) in the direction 𝑻 is the second order tensor defined as f𝑺:𝑻=Df(𝑺)[𝑻]=[ddαf(𝑺+α𝑻)]α=0 for all second order tensors 𝑻.

Properties:

  1. If f(𝑺)=f1(𝑺)+f2(𝑺) then f𝑺:𝑻=(f1𝑺+f2𝑺):𝑻
  2. If f(𝑺)=f1(𝑺)f2(𝑺) then f𝑺:𝑻=(f1𝑺:𝑻)f2(𝑺)+f1(𝑺)(f2𝑺:𝑻)
  3. If f(𝑺)=f1(f2(𝑺)) then f𝑺:𝑻=f1f2(f2𝑺:𝑻)

Derivatives of tensor valued functions of second-order tensors

[edit source]

Let 𝑭(𝑺) be a second order tensor valued function of the second order tensor 𝑺. Then the derivative of 𝑭(𝑺) with respect to 𝑺 (or at 𝑺) in the direction 𝑻 is the fourth order tensor defined as 𝑭𝑺:𝑻=D𝑭(𝑺)[𝑻]=[ddα𝑭(𝑺+α𝑻)]α=0 for all second order tensors 𝑻.

Properties:

  1. If 𝑭(𝑺)=𝑭1(𝑺)+𝑭2(𝑺) then 𝑭𝑺:𝑻=(𝑭1𝑺+𝑭2𝑺):𝑻
  2. If 𝑭(𝑺)=𝑭1(𝑺)𝑭2(𝑺) then 𝑭𝑺:𝑻=(𝑭1𝑺:𝑻)𝑭2(𝑺)+𝑭1(𝑺)(𝑭2𝑺:𝑻)
  3. If 𝑭(𝑺)=𝑭1(𝑭2(𝑺)) then 𝑭𝑺:𝑻=𝑭1𝑭2:(𝑭2𝑺:𝑻)
  4. If f(𝑺)=f1(𝑭2(𝑺)) then f𝑺:𝑻=f1𝑭2:(𝑭2𝑺:𝑻)

See also

[edit | edit source]


Notes

[edit | edit source]
  1. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  2. ^ The applicability extends to functions over spaces without a metric and to differentiable manifolds, such as in general relativity.
  3. ^ If the dot product is undefined, the gradient is also undefined; however, for differentiable f, the directional derivative is still defined, and a similar relation exists with the exterior derivative.
  4. ^ Thomas, George B. Jr.; and Finney, Ross L. (1979) Calculus and Analytic Geometry, Addison-Wesley Publ. Co., fifth edition, p. 593.
  5. ^ This typically assumes a Euclidean space – for example, a function of several variables typically has no definition of the magnitude of a vector, and hence of a unit vector.
  6. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  7. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  8. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  9. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  10. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  11. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  12. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  13. ^ J. E. Marsden and T. J. R. Hughes, 2000, Mathematical Foundations of Elasticity, Dover.

References

[edit | edit source]
  • Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  • Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  • Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
[edit | edit source]

Error creating thumbnail: File missing Media related to Lua error in Module:Commons_link at line 62: attempt to index field 'wikibase' (a nil value). at Wikimedia Commons