Chain Rule

One advantage of working with differentials is that the chain rule becomes automatic. For example, if you know the temperature $T$ of a metal girder as a function of position $x$, and you know your position as a function of time $t$, then you can of course obtain temperature as a function of time by substitution. The resulting expression could be differentiated to determine how quickly the temperature at your location is changing as you move along the girder. But you could also use the chain rule, starting from the differential expression \begin{equation} dT = \frac{dT}{dx}\,dx \end{equation} and then “dividing” by $dt$ to obtain \begin{equation} \frac{dT}{dt} = \frac{dT}{dx}\,\frac{dx}{dt} \label{chainrule} \end{equation} Equation (\ref{chainrule}) is the traditional statement of the single-variable chain rule.

Functions of several variables can be handled just as easily. For example, if you know the termperature $T$ as a function of position in space, and you know your position as a function of time, then a similar computation would start with \begin{equation} dT = \Partial{T}{x}\,dx + \Partial{T}{y}\,dy + \Partial{T}{z}\,dz \label{dT} \end{equation} and then “divide” by $dt$ to obtain \begin{equation} \frac{dT}{dt} = \Partial{T}{x}\,\frac{dx}{dt} + \Partial{T}{y}\,\frac{dx}{dt} + \Partial{T}{z}\,\frac{dz}{dt} \end{equation}

Now suppose that your position in space depends on latitude and longitude, rather than on time. So \begin{equation} dx = \Partial{x}{\theta}\,d\theta + \Partial{x}{\phi}\,d\phi \label{dx} \end{equation} where ($\theta$,$\phi$) are spherical coordinates, with similar expressions holding for $dy$ and $dz$. Suppose you want to know how the temperature changes as you walk along a line of constant latitude, that is, with $\phi$ held constant. We can substitute ($\ref{dx}$) into ($\ref{dT}$), along with the similar expressions for $dy$ and $dz$, obtaining \begin{eqnarray} dT &=& \Partial{T}{x}\, \left(\Partial{x}{\theta}\,d\theta + \Partial{x}{\phi}\,d\phi\right) + \Partial{T}{y}\, \left(\Partial{y}{\theta}\,d\theta + \Partial{y}{\phi}\,d\phi\right) + \Partial{T}{z}\, \left(\Partial{z}{\theta}\,d\theta + \Partial{z}{\phi}\,d\phi\right) \\ &=& \left( \Partial{T}{x}\,\Partial{x}{\theta} + \Partial{T}{y}\,\Partial{y}{\theta} + \Partial{T}{z}\,\Partial{z}{\theta} \right)\,d\theta + \left( \Partial{T}{x}\,\Partial{x}{\phi} + \Partial{T}{y}\,\Partial{y}{\phi} + \Partial{T}{z}\,\Partial{z}{\phi} \right)\,d\phi \end{eqnarray} But we also have \begin{equation} dT = \Partial{T}{\theta}\,d\theta + \Partial{T}{\phi}\,d\phi \end{equation} and by comparing coefficients we obtain \begin{equation} \Partial{T}{\theta} = \Partial{T}{x}\,\Partial{x}{\theta} + \Partial{T}{y}\,\Partial{y}{\theta} + \Partial{T}{z}\,\Partial{z}{\theta} \label{mchain} \end{equation} as well as a similar expression for $\Partial{T}{\phi}$.

Equation ($\ref{mchain}$) is the traditional statement of the multivariable chain rule. You can easily reconstruct this expression from ($\ref{dT}$) by “dividing” by $d\theta$, so long as you remember to replace ordinary derivatives by partial derivatives.


Personal Tools