Oscillations, Symmetry and Diagonalization
From a single mass on a spring to a vibrating string, oscillatory systems are often taught in the typical classroom via a sequence of seemingly ad-hoc guesses: exponentials in time, normal modes in space, and Fourier series in the continuum limit. In this essay, we look at oscillations from a linear-operator lens, and use symmetries with which those operators commute to show how diagonalization naturally decouples the dynamics. We show how those familiar solutions arise in a principled fashion. The focus throughout is on symmetry, linear algebra and operator structure, making the discussion applicable well beyond mechanics - to spectral methods, linear dynamical systems and kernel methods which are common in Statistics/Machine Learning.
1: Introduction and Motivation)
Anyone that has taken physics in high school or university has likely seen the following equation many times:
This is the equation of motion of a Simple Harmonic Oscillator (SHO): the acceleration of the particle as a function of time, , is proportional to its displacement from the mean position, , and directed towards the mean position. If you were taught the way I was, the typical treatment is to introduce as a "guess" solution:
We check, and indeed, it satisfies the given differential equation. Then the physics teacher would claim "it is the most general solution for an SHO", and the class would proceed to discuss various properties of this solution.
If you are anything like I was in high school, you would be dissatisfied with this business. I was used to solving equations "in a complete manner". When solving a quadratic equation, we derive the roots. We complete the square, do some algebra and the two roots emerge inevitably. We don't "guess" the solution... then why should the situation be any different here?
In this blog, I show how linear algebra and symmetry provide a principled framework for understanding oscillatory systems. We first reformulate the SHO as a linear operator equation, and using time-translation invariance we see how exponential solutions emerge as irreducible temporal behaviors of the particle. Next, we extend this same operator-theoretic viewpoint to systems of finitely many coupled oscillators, where the coupling structure leads to normal modes via diagonalization. Lastly, we take the continuum limit of a system of infinitely coupled oscillators that obey spatial translation symmetry, and show that waves on a string arise as simultaneous eigenfunctions of the spatial symmetry generator and the underlying oscillatory operator.
I write this blog to share a few key insights that emerged for me only after revisiting ideas from several courses at MIT. These were concepts that were not immediately obvious or enlightening to me the first time I encountered them, but ones that I have realized in retrospection, once viewed through a unifying lens. I think I would have enjoyed reading such a blog the first time I encountered 8.03, 18.03 or 18.06 simply because I find these abstractions very beautiful and helpful to appreciate the core ideas underlying the subject.
I have tried to keep this blog to be technically accessible to a first-year undergraduate/advanced high school student. Towards that end, I emphasize the intuition and ideas behind the concepts rather than a fully formal, mathematical treatment. I leave references to resources where the math is discussed more concretely.
2: 1D SHO via linear operator)
2.1 The SHO operator)
As a first step towards a more principled framework to understand oscillations, we look at a single particle oscillating in 1-dimension from an operator viewpoint.
Informally, an operator is a function of functions. It takes in as input a function, and produces as output another function. Define the differential operator
so that our SHO equation can be written compactly as
where I write to denote , for notational convenience later on. If we define , then our SHO equation can be written simply as
where we view as a function of time.
The operator itself is linear, meaning that it acts linearly on functions:
for all functions and scalars .
Note that if and both represent valid motions of the particle, then any linear combination , with constants , is also a valid solution. Thus, the set of solutions is closed under linear combinations, and therefore forms a vector space!
This is the first perspective shift I would like to emphasize. Rather than thinking of as a literal distance, we instead treat it as an abstract function that takes a time as input and returns a real number. Further, these functions inhabit a vector space and behave just like vectors. From this viewpoint, solving the simple harmonic oscillator amounts to finding all functions that lie in the null space (or kernel) of the linear operator . This sounds much more like a linear algebra problem!
Recall that in solving a matrix nullspace problem , we would do typically do Gaussian elimination or row reduction to find out a set of linearly independent vectors that span the nullspace of the matrix . Then, the most general solution is some linear combination:
where the are constants.
To solve our SHO equation, we will try doing something similar, but here we have two important differences:
- We are working in the vector space of arbitrary functions , which is an infinite dimensional vector space!
- Our linear operator isn't a matrix with concrete numbers. It has the operator within it.
Regardless, the core idea remains the same: finding a set of independent vectors that span the nullspace of the linear operator will give us a general solution to the SHO equation. We deal with these differences one-by-one.
2.2 Dimension of nullspace)
First, we show that the dimension of the nullspace of is via a bijection to . So, we need not be concerned with the entire infinite-dimensional space of functions; we are only interested in a dimensional subspace of this vector space of all functions.
We know that two initial conditions are required to fully specify the motion of the particle in SHO, typically the initial displacement and the initial velocity . One way of seeing why this fully constrains the particles motion for all time is the following: if we fix and then all higher order derivatives at are fixed too: the second derivative is , and the third derivative is , and so on. Then, by Taylor's expansion around :
As all the higher-order derivatives at are all fixed, so is fixed (fixing all orders of derivatives of any arbitrary function at a particular time fully determines the function).
One obvious fact that needs to be stated for completion: given any , there is a unique and .
Thus, we showed that given a particular we obtain a unique , and given a particular we get a unique . Thus, there is a bijection: . Further, this is a linear map:
A bijective, linear map is an isomorphism, implying that the dimension of the spaces related by this map are the same (isomorphism literally means same-shape). So, the dimension of the nullspace of is the dimension of the space inhabited by all such pairs . Clearly, any two real numbers can be valid initial conditions, so this space is , which has dimension . So, the dimension of nullspace of is 2.
This yields another beautiful picture that I wish to emphasize: each solution to the SHO is indexable by a unique vector on the real number plane that specifies its initial conditions . Further, choosing to establish this bijection was arbitrary; you could take any to index the path by a similar analysis as above. The set of all such points thus correspond to the same trajectory , and as you slide through all values of , we get a "trajectory" on traced by this particle. You might have already seen this as the phase-space picture of the SHO.
2.3 Finding two basis vectors
Having proved that the nullspace of is dimensional, we thus need to find only two independent functions to write down a general solution: any in the nullspace would be some linear combination of these two.
In linear algebra, it often helps to reorient ourselves to a convenient basis. The same problem that is very difficult in the default basis can become very simple if you exploit the structure of the matrix in question to perform a basis change.
So let's take a closer look at our linear operator :
and specifically, let's focus on the operator . One key property of is that it commutes with time translation. This is just a formal way of saying:
but understanding this symmetry better will be very helpful later on.
Symmetry is described mathematically by members of some symmetry group. When I say that obeys a symmetry in time, I mean that interacts in a certain fashion with elements of the symmetry group of time translations. In general, we can label a time translation by the amount of shift it causes:
These can also be represented as operators on our vector space of functions (i.e. this is one of the representations of our symmetry group). Their action on any is:
i.e. a mapping from the function to . Note that is also a linear operator.
Note that commutes with any such :
which we compactly write as . This can be seen explicitly by the action on any :
Similarly, also commutes with time translations : and consequently, as :
because commutes with any operator (its just a scalar times the identity operator).
Now, the key step that will justify all this hard work:
That sounds like a mouthful, but is a key result taught in Quantum Mechanics and Linear Algebra courses. To break it down, I discuss below the underlying intuitive picture.
In this context, eigenfunctions are simply the solutions to operator equations, akin to eigenvectors in finite-dimensional spaces. If a linear operator has an eigenfunction with eigenvalue , then:
So, applying the operator to the eigenfunction results in the function being scaled by the eigenvalue . Importantly, eigenfunctions are directions (in the function space) that remain unchanged except for scaling when acted upon by . This is not true for a general function. For example, if has eigenfunctions and with distinct eigenvalues and , then:
unless one of or is zero. Eigenfunctions are special directions where the action of the operator is simple scaling.
Our claim is that if two operators commute, then these directions align. The full proof is found in the references, but I prove it for the case where these operators don't have degenerate eigenvalue (i.e. for each eigenvalue, there is a unique associated direction). Let and commute, and take any eigenfunction of with eigenvalue . Here, I will assume that is the only eigenfunction of with the Then:
as . But then:
so, is also an eigenfunction of with eigenvalue . But as these are non-degenerate operators, must be in the same direction as , and so for some constant :
so, is also an eigenfunction of .
The underlying picture is this: because and commute, the order in which you apply them to an eigenfunction of either of these operator does not matter, as they both simply scale that eigenfunction. Their actions on these common directions do not interfere.
Going back to our SHO problem, as commutes with , we must have that any eigenfunction of is also an eigenfunction of . As we are interested in finding distinct functions in the nullspace of :
This is equivalent to finding two distinct eigenfunctions of with eigenvalue . And using the result on commuting operators, all eigenfunctions of are eigenfunctions of !
The eigenfunctions of the time-translation operator are much more fundamental objects, and yield a much more satisfying answer to why they are eigenfunctions of time-translation in the first place, as we see below.
We seek functions such that for some , the set of complex numbers. We use complex numbers instead of real numbers because they are algebraically closed, meaning that we need not be concerned when we solve the eigenvalue problem (which, in general, gives a polynomial in ):
This already makes the answer fairly clear, but I finish the derivation for completeness. Taking , this gives us a differential equation for :
Thus, the eigenfunctions of the time-translation operator are exponentials!
It is very satisfactory to see this result: exponentials are such functions that if you shift their argument by , you scale the function by for some constant . Exponentials are the unique class of functions that survive (i.e. are scaled and don't change direction) under action by translation!
Now that we know that all eigenfunctions of are exponentials, and that all eigenfunctions of are eigenfunctions of , then all that remains to find is two distinct exponentials that lie in the nullspace of . This brings us to the place where most college courses start: plug in as a guess solution, where :
Thus, any general solution to the SHO equation can be written as a linear combination of these two nullspace-spanning eigenfunctions:
where . But as we are dealing with functions that describe the motion of our particle, we require that and are real valued for all . We ensure that the imaginary parts of these two functions are zero, and after some algebra we end up with the general solution to the SHO: