In mathematics, particularly in the study of Lie groups and Lie algebras, the matrix exponential is the function on square matrices A defined by
Motivational ideas
This concept has applications to systems of linear differential equations. Recall that a differential equation of the form
- y′ = Cy
has solution eCx. If we consider the vector
-
we can express a system of coupled linear differential equations as
-
If we make an ansatz and use an integrating factor of e−Ax and multiply throughout, we obtain
-
-
If we can calculate eAx, then we can obtain the solution to the system.
Consider then the power series expansion of the exponential function:
-
If we replace x with A, then we have a means of calculating the matrix exponential. The issue of convergence arises, which will be dealt with in functions of matrices .
Properties
Two properties are important in the calculation of the exponential of a matrix via the Jordan form. Suppose J is the Jordan form of A, with P the transition matrix. Then
-
Also, since
-
-
In some cases we have
but when A and B commute with each other, the familiar identity holds.
If A is a skew-symmetric matrix then eA is an orthogonal matrix.
If a matrix is diagonal
then its exponential can be obtained by just exponentiating every entry:
This also allows to exponentiate diagonalizable matrices. If A = U−1DU and D is diagonal, then eA = U−1eDU.
Calculations
Consider the matrix
-
which has Jordan form
-
and transition matrix
-
Now,
-
and
-
Recall that each Jordan block can be written as λI+N where N is nilpotent (suppose nilpotent with index k). The fact that N is nilpotent will aid us greatly:
-
which is a calculation with only finitely many terms!
So,
-
The exponential calculation for a 1×1 matrix is clearly trivial, with eJ1(4)=e4 so,
-
-
Clearly, to calculate the Jordan form and to evaluate the exponential this way is very tedious. Often, it will often suffice to calculate the action of the exponential matrix upon some vector in applications, and there are other methods available to achieve this.
Column method
If we have a matrix A with one eigenvalue λ, we can examine the generalized eigenspace of λ, and we can determine a constant k such that
-
However, observe
-
also
-
which, again, is a calculation with only finitely many terms.
If we firstly determine a basis of generalized eigenvectors for A, {v1,v2,...,vn}, we can form a matrix B with these vectors as columns. Using the above we can calculate the action of eA upon each of the vi, say, eAvi = wi, and likewise,
We have
also, which will be of use later in applications.
Example
Consider the matrix
We need the eigenvectors and generalized eigenvectors for this matrix. From the characteristic polynomial of M, we get (x-2)2(x-4), and so we have eigenvalues 2, 2, and 4.
One eigenvector is v1=(-1, 1, 1)T corresponding to one of the eigenvalues 2. Since
Another is v2=(1, -1, 1)T corresponding to eigenvalue 4. Again,
The generalized eigenspace corresponding to eigenvalue 2,
-
Choose the second vector, ie., v3=(0,1,0)T. Then
-
-
-
Now,
-
and
-
so
-
-
Applications
One application of the matrix exponential is its use in solving systems of differential equations. Say we have a system of differential equations
Clearly, we can represent this system in matrix form:
-
-
defining the derivative of a vector to be the derivative of its components. If b=0, the system is termed homogeneous, otherwise, inhomogeneous.
The general solution of the homogeneous system is given by a linear combination of the columns of exp(tA). Say xi is one of these columns, then
-
Example (homogeneous)
Say we have the system
We have the associated matrix
In the example above, we have calculated the matrix exponential
so the general solution of the system is
-
that is,
Inhomogeneous case - Variation of parameters
For the inhomogeneous case, we can use a method akin to variation of parameters. We seek a particular solution of the form yp(t)=exp(tA)z(t) :
-
-
-
For yp to be a solution:
-
-
-
So,
-
-
where c is determined by the initial conditions of the problem.
Example (inhomogeneous)
Say we have the system
So we then have
and
From before, we have the general solution to the homogeneous equation, Since the sum of the homogeneous and particular solutions give the general solution to the inhomogeneous problem, now we only need to find the particular solution (via variation of parameters).
We have, above:
-
-
-
which can be further simplified to get the requisite particular solution determined through variation of parameters.