The Science of Everything Podcast - Episode 158: Quantum Electodynamics Part 1

Episode Date: March 1, 2026

A detailed look into the physics and mathematics of quantum electrodynamics, the theory of how light and matter interact. We discuss the generalisation of the Schrodinger equation to the Klein-Gordon ...and Dirac equations, and howw these describe the propogation of light and fermions respectively. We then consider the process of computing transition probabilities between quantum states, including the S matrix and perturbation theory, and Wick's theorem. Recommended pre-listening is Episode 85: Introduction to Quantum Field Theory. If you enjoyed the podcast please consider supporting the show by making a PayPal donation or becoming a Patreon supporter. https://www.patreon.com/jamesfodor https://www.paypal.me/ScienceofEverything

Transcript
Discussion (0)
Starting point is 00:00:33 listening to The Science of Everything podcast episode 158, quantum electrodynamics. I'm your host, James Fodor. In today's episode, we are going to discuss the quantum field theory of quantum electrodynamics, or QED, as it's often called. So this theory describes the relativistic interactions between light and ordinary matter, so that's like electrons and similar types of particles. This episode follows on from one I did quite a few years ago now on an introduction to quantum field theory. So that's the recommended pre-listening episode 85, introduction to quantum field theory.
Starting point is 00:01:15 So that was an introductory conceptual overview of quantum field theory. This episode is going to be building on that and we'll review some of the concepts, but you should really listen to that first or have some background in quantum field theory to appreciate what we're going to be discussing today. So here we're going to go through the mathematics and the formal in much more detail, and I'm going to discuss particularly how we use the formulism, including Feynman diagrams, to calculate experimentally measurable quantities in QED. This episode will also form the basis for a subsequent episode, which hopefully will come out
Starting point is 00:01:53 within the next few months, where we'll talk about the standard model, so we'll look at other interactions beyond. So as I said, Recommended Prelisting is episode 85 on introduction to quantum field theory. The two episodes immediately prior to that 83 and 84, advanced quantum mechanics would also be useful. I mean, they're in turn prerequisites for 85. So hopefully you'll listen to that first. So just be warned, this is going to be a more technical episode. And prerequisites will be important here.
Starting point is 00:02:18 So, for example, the first thing I'm going to do is start by talking about how we can make the Schrodinger equation apply at relativistic energy scale. So if you don't understand what that means, then the episode's going to be. bit hard to get into. So do check out those earlier episodes if you are not really sure what any of that is about. So that being said, let's dive in and start with the good old Schrodinger equation from non-relativistic quantum mechanics. We know that the Schrodinger equation describes how a quantum system, or like a particle, for example, evolves over time. And specifically, what the Schroenger equation actually is, is it says that the evolution of the wave function, so that the partial derivative with respect to time of the wave function,
Starting point is 00:03:11 is equal to the energy operator applied to the wave function. So the energy of the particle determines how it evolves over time effectively, or energy of the particle system, because it can be more than one particle. You can actually derive the Stroudanager equation by starting with the classical total energy momentum relationship, which says that the energy of a system is equal to the mass energy, so MC squared, plus the kinetic energy. So that's one-half mv squared. That's the non-relativistic formula for the kinetic energy. If you then effectively transcribe those energy relations into the quantum mechanics length,
Starting point is 00:03:56 So you take the energy to refer to the partial derivative with the system with respect to time and the momentum referring to the partial derivative with respect to space, multiplied by Planck's constant, of course. If you make that transcription of the energy terms, substitute them in and rearrange, you end up with Schrodinger's equation. So that's kind of how he came up with the equation. But we should be emphasized that that's sort of a heuristic derivation because there's no, formal set of principles that says you should sort of start with that. Ultimately, Schroenger's equation is justified experimentally in terms of how useful it is. The point is, though, that this blueprint provides us with a way to understand how we can make the Schroeniger equation apply in relativistic cases, because as it is, the Schrodinger equation does not incorporate relativistic effects.
Starting point is 00:04:47 So when the velocity of the particle becomes very large, and we start looking at velocities close to the speed of light, relativistic effects become important, and and the Schrodinger equation is not going to appropriately describe those. So one attempt that we can make to develop a relativistically invariant form of the Schroenger equation is to, instead of starting with the classical energy momentum relationship, which remember says that energy is equal to like mass energy plus kinetic energy, E equals MC squared plus half mv squared, right? Instead of starting with that and then translating that into quantum realm and getting the Schrodinger equation,
Starting point is 00:05:26 we can start with the relativistic energy equation and then translate that into quantum realm and see what we get. So the relativistic energy equation, which I've discussed in episodes we talked about relativity, e squared equals m squared c to the power of four plus c squared p squared. So note that this is actually slightly different because this is the equation that gives energy squared. So that's sort of that will become important later. So the mc squared part is the same. what's different is that instead of having the kinetic energy, we now have a term for the total momentum. So C squared, p squared is the momentum, the magnitude of the momentum, and we multiply that by C squared.
Starting point is 00:06:09 The difference between the two, the half mv squared transforming into C squared p squared, the difference there is entirely due to relativistic effects. And you can actually prove that if you restrict two speeds much, much less than the speed of light, that the relativistic form of the energy relationship at the limit with low velocities turns back into the classical energy relationship. So if it's sort of not obvious to you how you go from C squared p squared down to half mv squared, it's just due to relativistic effects. And you can you can prove that using some fairly simple, if a bit tedious algebra. So the point of all this is that the relationship between momentum and energy changes in classical compared to relativistic
Starting point is 00:06:54 context. And so now when we do our conversion into quantum mechanics realm, we replace energy with the partial derivative with respect to time times planks constant, and momentum with the partial derivative with respect to space times blanks constant. We substitute that into the relativistic energy equation and rearrange. What we come up with is a new equation, which is now relativistically invariant, which means it's applicable to relativistic context, and it's called the Klein Gordon equation. The Klein-Gordon equation, if I just sort of read it out mathematically, it says that some of the partial derivatives of the wave function is equal to the mass squared times the wave function with some extra planks constants and c squared's thrown in. Note that this is a relativistically invariant formulation,
Starting point is 00:07:42 which means that it doesn't single out and differentiate derivatives with respect to time with derivatives with respect to space. And this is important because, as you may recall, relativity, we treat space and time as connected in four-dimensional space time. And when you change frames of reference, components that in one frame of reference contributed to spatial terms can, in another frame of reference, contribute to time-like terms and vice versa. They transform into each other. There's no strict separation between space and time. It depends on your, it depends on your velocity and your frame of reference. So that's as expected when we move from the Schroen equation, which separates out space in time, you know, it's got a partial derivative with respect
Starting point is 00:08:26 to time and a second partial derivative of respect to space on the other side of the equation, which gives rise to the kinetic energy. That won't do as a relativistic description. We're going to need a formulation which merges them together, and that's what we get with the Klein-Gordon equation. We actually have an operation which kind of sums together all of the partial derivatives and treats them on equal footing. So that's good. Now we, so the Klein-Gordon equation is what we want. it's relativistically invariant, so we can transform between different frames of reference appropriately, and it will obey the Lorentzian variance that we need from relativity theory. It turns out that the Klein-Gordon equation actually does correctly describe the dynamics of a
Starting point is 00:09:07 spinless particle. So remember, spin is this intrinsic property that most particles have that describes some kind of intrinsic angular momentum. It doesn't really have a classic, analog, so it's sort of hard to give an intuition for what it is, but it affects how it interacts with other types of angular momentum as well as a range of other types of interactions, which we'll discuss further in this episode. But the point is the Klein-Gordon equation describes a spinless particle. So you may recall in previous episodes I've talked about spin. The electrons have a spin of one-half. So the Klein-Gordon equation does not describe the dynamics of electrons. However, photons have no spin, so photons are spinless. So the Klein-Gordon equation does
Starting point is 00:09:55 describe the evolution of the electromagnetic field. We have to just make a couple of minor modifications because the electromagnetic field is a vector field. It has four components. So typically the way the Klein-Gordon equation is written is just for a scalar field that has only one component, but that's a minor modification. The point is the Klein-Gordon equation was sort of developed as a way to generalize the Schroenger equation to become relativistically invariant to incorporate time and space together and use the proper energy relations that we have in relativity. And it turns out that when we do that, we actually get an equation that describes with some modifications the evolution of, or the electromagnetic
Starting point is 00:10:33 field in free space, so how it evolves over time. The problem is the Klein-Gordon equation is not suitable for describing particles with spin one-half, which includes electrons. Or any other fermions as well, including quarks. But, mostly I'll focus on electrons just because they're a bit simpler to understand. We'll talk about quarks more in a future episode. So we need a different equation if we're going to describe in a relativistically invariant way the motion of electrons. So our quest to find a relativistic replacement to the Schrodinger equation that's applicable to electrons is still ongoing. It turns out that the equation we're looking for is called the Dirac equation. And this was developed
Starting point is 00:11:14 shockingly by Dirac. Now, the way he developed it's sort of interesting. What he did is that he was sort of thinking that the Klein-Gordon equation is on the right track, but the problem is that it is an equation in the second partial derivatives of the wave function. I mentioned that before, and the reason that that comes about is because we started with the squared energy equation, the relativistic equation, which is energy squared equals mass squared times C to the four, like that's MC squared, all squared, plus C-squared, right? So everything is squared. And that is how energy is described relativistically.
Starting point is 00:11:50 But Darac was thinking that, you know, that's not what the Schrodinger equation looks like. The Schrodinger equation has first order derivatives with time particularly. And he was thinking that maybe the way to describe thermion, like electrons, is going to need to look a bit more like that. So kind of what he did is found a way to sort of take the square root of the Klein-Gordon equation. I've got to put that in quotes because you can't exactly take the square root of the whole equation. But what he did is he found a way to sort of break it up into two and then cancel one part out so that he was left with sort of the square root. But in order to do that, though, he needed to introduce this new mathematical device, which is a matrix. So it's a four by four matrix.
Starting point is 00:12:32 It's called the gamma matrices. There's actually several. There's actually four of them. And that was necessary in order to make the algebra work in order to sort of do this square root operation. you can't do that with ordinary numbers. He needed to introduce this matrix. So he turned the Klein-Gordon equation into a matrix equation. And so what you end up with is this equation that instead of, instead of in being second-order derivatives, it's now first-order derivatives. So the Dirac equation essentially says that the sum of the first-order derivatives of the wave function is
Starting point is 00:13:04 equal to the mass times the wave function. But there's this extra component in the gamma matrices. So it's not just the derivatives, like the partial derivatives with respect to time and all of the spatial dimensions, it's these partial derivatives multiplied by the gamma matrices. Now, why do we need to introduce this sort of weird four-by-four matrices into the picture? Well, essentially, it turns out that these matrices are necessary for describing spin and how the spin of an electron changes in different frames of reference and also as the velocity of the system changes. So this isn't necessary in the Klein-Gordon equation.
Starting point is 00:13:41 We don't have the gamma matrices there because that describes the evolution of spinless particles, so they don't have spin. It's also not necessary in the Schrodinger equation because the Schrodinger equation ignores spin. There are ways of incorporating spin into the Schrodinger equation, but it's sort of done separately. And critically, when you're not dealing with relativistic effects, spin is easier because, again, you can separate out time and space, and spin doesn't have as complicated interrelated, actions as it does when we go into relativity land, right? So these gamma matrices are four by four matrices. You can write them out in explicitly. It gets technical because you can't write the matrices
Starting point is 00:14:22 out in an explicit form without choosing a basis for it. And I just described it in the Dirac basis, which is one particular way you can write them out. There's other ways to write them. But what it all comes down to is that it doesn't actually matter which basis you choose. The physics all comes out the same. In the end, it's just sort of different ways of doing the calculations. And in fact, often you don't even need to write out the explicit form at all. It's just a way to understand what it is that you're dealing with in terms of the nature of these mathematical beasts. You can write them out explicitly if that helps. The point is, though, that these gamma matrices are going to be very important going forward
Starting point is 00:14:57 because the gamma matrices describe effectively how the spin of an electron transforms in different frames of reference and with changing our velocity. I mean, that comes down to the same thing because different frames of reference differ in terms of the velocity that they ascribe different particles because velocity is all relative, right? You know, speed relative to what different reference frames will ascribe different velocities to things, but the physics all is the same in the end. That's what relativity tells us, right? So we need these gamma matrices in order to make sure that the spin of an electron is correctly described when we make these changes between reference frames. And that's their job. And so we will see them crop up throughout the different calculations that we do. I mean, I'm not always going to describe exactly the mathematics.
Starting point is 00:15:42 But the point is that you need to know about these gamma matrices because they play an important role in the formalism. So I mentioned before that our Dirac equation describes the evolution of, say, an electron over time, and it does so by introducing these gamma matrices, which helps explain how the spin transforms in different reference reps and keeps track of that. It turns out that in order for this to work, the equation is now going to describe the evolution of not just a single field value like we had in the Tronager equation. The wave function, at least in the simple case for a single particle system for the Troniger equation, it just gives the probability of finding the electron in a single particular location, right? You square the wave function, you get a probability of finding the electron in a certain location. So the wave function is a scalar. It just has one value.
Starting point is 00:16:35 It's a complex number. You square it, you get a probability. That's no longer true in the Dirac equation. The wave function is not a scalar anymore. It's actually a special mathematical object called a spinner. S-P-I-N-O-R, spinner. The spinner has four elements. So you can write it as a vector that has like four elements,
Starting point is 00:16:56 but technically it's a spinner, not a vector, because it transforms a bit differently. I won't try to describe the exact sort of mathematical formalism behind that, but it's a known mathematical construct. And the reason it takes this form is effectively because there are two spin polarizations of an electron. There's spin up and they're spin down. Also, and it turns out that this is essential to the formalism, which is interesting, there are two different types of electrons, or two different versions, if you like.
Starting point is 00:17:25 There's electrons and positrons, or the negatively charged electrons and polytron. positively charged electrons, matter and anti-matter versions. You've probably heard of that before, that there are positrons that are positively charged versions of electrons, but have all the same, all the other properties the same. Now, what's interesting is that at the time Dirac formulated his equation, I can't remember if they had just, if positrons had just been discovered or if it was just after that, but it was around the same time. And Dirac's equation essentially predicts that positrons must exist because the spinors have four elements to them, and two of describe the spin of the electron, like essentially whether it's a spin up or a spin down.
Starting point is 00:18:03 One element will be you spin up and the other is spin down. And then the other two elements describe the spin status of your anti-electron, your positron, the antimatter version. And you can't actually get the direct equation to work if you ignore the antimatter. Effectively, this is because matter and antimatter are affected by relativistic transformations. And so we need to fit them all into the same formulism in order to make the equation relativistically invariant. So loosely, you can think that this spinner that we have here, this mathematical object that describes the spin state of an electron, has four elements. And those four elements essentially say how much of the spin state of that electron is in the spin upstate, how much is it in the spin down state, and then how much is in the antimatter spin upstate and the antimatter spin down state? That's not quite right, but it's sort of close enough for our purposes here.
Starting point is 00:18:57 The spin of an electron doesn't change as you change different frames of reference, but the way you describe it does. And so the spinners keep track of that, how the different components change in different frames of reference, because different frames of reference describe the same underlying physics in different ways. So in our formalism, we need to keep track of that, and that's what the spinners do. Now, you may recall from the previous episode, the introduction to quantum field theory, that's the prerequisite. I talked about how in quantum field theory we change the interpretation of the, effectively what is the wave function in non-relativistic quantum mechanics. In that context, we think of the wave function as describing a particular particle or group
Starting point is 00:19:37 of particles, and we square the wave function, we get a probability that the particles are in a particular configuration. In quantum field theory, we don't really work with a wave function anymore. We work with these field states, and a field is actually an operator. So we say that the fields are now operator valued. And what the field does is it acts on some sort of quantum. quantum state, which is more the analog of the wave function, that the quantum state is an abstract description of the system, basically based on how many particles it has. So the vacuum quantum state, which is often just written as like a zero in a bra rocket notation. I talked about that in the previous episode as well, the advanced quantum mechanics ones. In quantum field theory, an operator value
Starting point is 00:20:19 field will act on that vacuum state and either create or destroy a particle. I mean, in the case of the vacuum state, there is no particles to destroy, so the destruction operator won't do anything. The creation operator, however, will create one particle of a specific momentum in that state. And so a field in quantum field theory is actually constructed of a bunch of these operators, each of which will create a particle of a particular momentum when acting on a given quantum state. And you can think of this in terms of the idea that there's like these springs that are oscillating at each point in space, and the operator acts on a particular spring. exciting it to a particular value and number of degrees of excitation that it has
Starting point is 00:21:01 corresponds to the number of particles in that energy state. Again, I talked all about this idea in the episode 85, so go back to that one if you're sort of not really familiar with this, but this idea of a quantum field as an operator that acts on a given state to create a certain number of particles of a given momentum, we're going to be appealing to that again. We're going to still use that idea. The only difference is last time I didn't talk about spin. Here we're going to add spin into the mix because as we've seen that that's essential in order to describe relativistically the behavior of an electron. So we need to add in spin, and that comes in the form of our spinners. So we still have these creation and destruction operators. They're separate, right? They're not the
Starting point is 00:21:44 spinners. They're a different thing. So now we have a field operator, which is the solution to the Dirac equation. The Durac equation explains how the field operator evolves over time and over space. The field operator includes the creation and destruction operators, each, you know, one for each momentum state. And it also includes the spinners, which specify for each of the momentum states, what spin it has. And so overall, what the field operator does is it allows us to describe different quantum states in terms of a momentum basis and also incorporate spin into the description in an appropriate way. Now these quantum fields are again different from the wave function of the Schroenger equation because the way function is not directly observable,
Starting point is 00:22:38 but pretty close to it because the square of the wave function gives a probability of observing the particle in a particular configuration. However, in quantum field theory, the field operator that we now have doesn't actually describe a quantum state at all. It's an operator, which really just allows us to express a quantum state in a momentum basis. What is actually observable is not the field state, or even the square of the field state. Rather, what we can usually observe in it's relevant to quantum field theory are probabilities of particles interacting in a particular context, like scattering off each other in a particle accelerator, or the decay amplitudes, like the rate of decay of a massive particle. And so the way the formalism works is that we can describe the initial
Starting point is 00:23:23 and the final states of whatever it is, whatever process we're interested in. Let's suppose it's a scattering state. So we collide some electrons together and then we sort of see what happens. There'll be some initial particles that collide or not even necessarily collide, but they could just interact with each other. They could come near each other and then repel each other if they're two electrons, for example. Or there could be some interaction which produces, which leads to one particle decaying into other particles. Whatever the process is, we'll have some initial state, which describes the particles that existed initially, and there's some final state which describes the particles as they exist after the interaction or the decay process has occurred.
Starting point is 00:24:01 There is a special operator in quantum field theory called the S matrix. The S matrix is an evolution operator. It describes the probability amplitude of transitioning from this initial state to the final state. There's some extra caveats there that I'll get into in a second, But the point is that this S matrix will, at the end of the day, be made up of field operators, as well as some various physical constants and other numbers, but field operators form the basis of it. So if we consider a very simple example, suppose we just have a single electron with a particular momentum, and it's just cruising along, it's just propagating through space. That's our initial state.
Starting point is 00:24:45 So we can describe that KET formalism, but we talked about in advanced. quantum mechanics, that's just the vertical line and then you describe the state and then you have the angular bracket. So that's just a way of describing some sort of quantum state. So here you could just describe that by an electron with a certain momentum. And it's just propagating through space, right? It's not interacting with anything. So in order to describe that mathematically, what we can do is the, we don't actually need the full S matrix here. We just use a single field state. So the field operator applied to this initial state and then on the, the, the, the, the final state here will just be the vacuum state, effectively because what we're describing
Starting point is 00:25:24 as an initial state of a propagating electron and the final state is the vacuum. Like, it's not interacting with anything, it's just propagating through the vacuum, and you can imagine that it's eventually annihilated when it's measured. At some point, it will be annihilated. So, what we're doing is we're sort of sandwiching the field operator in between this initial singular momentum state and this final vacuum state, which doesn't have any electron in it. And it turns out what that actually is when you work out the mathematics is you apply the field operator and you sandwich it between these two state descriptions. That actually gives you a plain wave. That's a, that's a solution to the Dirac equation which describes an electron propagating through space without interacting
Starting point is 00:26:06 with anything. So a plain wave is just a simple way to describe a wave that propagates through space and gradually spreads out over time. So the Dirac equation describes the dynamics the dynamic of a free non-interacting fermions, like an electron, as it propagates through space. And we can also describe that using the matrix element formalism that I mentioned before. We have the vacuum state and we have the initial state of the propagating electron, and we sandwich either side of the field operator. And so the field operator then gives you the plane wave solution. The field operator is really just a way of, again, describing mathematically using a momentum basis.
Starting point is 00:26:49 a given quantum state. And so these field operators are solutions to the Dirac equation. So that's all good. So at this point, what we have is a method for describing the evolution of, say, electrons as they propagate through space, just like the Schroedinger equation did, except now we've turned it into the Dirac equation. We've made it relatively invariate. We've incorporated spin in doing so. So that's all good. The main thing that we still need to do here is we need to incorporate interaction effects, because so far we've just been talking about a free electron that's not interacting with anything. It's just propagating through space. It's described by a plane wave solution. That's not really very interesting. We need to be able to describe how it
Starting point is 00:27:32 interacts with other particles, either electrons or other photons as well. So quantum electrodynamics, which is the theory that we're looking at here, describes how fermions, like electrons, interact with photons, so light and matter. So how do we incorporate interaction effects? Well, non-relativistic quantum mechanics, we modify the Schrodinger equation effectively. I mean, it's not really modifying the equation, but what we do is we add a potential energy term, which describes, well, it can describe lots of things, but generally an interaction effect between one particle and another, for example, an electron, which is affected by the potential energy of another electron, or interacting with a photon field or something like that.
Starting point is 00:28:10 As we did that in the Schrodinger equation, we have a variety of ways of then solving the resulting equation so that we can have a way function that describes an interacting particle. In the episode I did on quantum, on computational chemistry a while back, I talked about how complicated this is to solve the Schrodenger equation when there are even more than one electron surrounding a central like atomic nucleus, like in an atom. So we can do that in the Schroenger equation, but it become quite complicated very quickly. What about in the Dirac equation? Well, the Durac equation is more complicated again than the Schrodenger equation, and the
Starting point is 00:28:40 trouble is that we can't modify it in quite the same way just to include a potential. Essentially, there's no solution to that. But there are other ways so that we can try to describe interactions. So the formalism is rather different here. This is where we diverge even further from non-relativistic quantum mechanics. So as I said before, usually in quantum field theory, we can't actually describe, or at least in general, we don't actually have the ability to describe bound states, like an electron orbiting an atom, orbiting an atom, orbiting as part of an atom. Instead, normally what we describe is transition processes, so like scattering processes or decay. where we have an initial state, there's a transition, and it converts into a final state.
Starting point is 00:29:19 We can describe the probability amplitude, you square the probability amplitude to get the probability. So we can describe the probabilities of transitioning from an initial state to a final state through various processes. And that's what the matrix elements that I mentioned before, that's what they do. But we have to be able to describe how you get from the initial state to the final state. And the Dirac equation is not sufficient for that because that just describes freely propagating electrical. We want to describe interacting electrons. Now there is a formulism for doing this, so let me explain a bit how it works. This is called the S matrix.
Starting point is 00:29:52 The S matrix allows us to describe the probability of transitioning from that initial state to that final state in the presence of interactions. And we do that by using a function called the Hamiltonian. I've talked about the Hamiltonian before. The Hamiltonian is the total energy function for a particular system or a particular particle. In this case it will actually be multiple particles because we're talking about particles that interact with each other. I mentioned at the very start of this episode that energy is directly related to change with respect to time and momentum changes with respect to space. And so we're applying that principle here as well. What we want to do is calculate how a system
Starting point is 00:30:30 changes over time, how it evolves from the initial state to the final state. So this S matrix that I just mentioned, this is a time evolution operator. It tells us how a system changes from an initial state to a final state, or it computes that probability. by taking the exponential of the Hamiltonian. So literally the exponential function of that Hamiltonian function, which describes the total energy of the system. By the way, if you're wondering where the exponential function comes in, that effectively arises because of time evolution.
Starting point is 00:31:01 The exponential function often occurs when we have continuous time evolution of something, or continuous evolution of one variable in terms of another. It's the solution to a simple differential equation, right? So that's where the exponential comes from. It's basically saying that the function that describes the energy of a system also describes how that system evolves over time, but continuously. So hence the exponential. The trouble, of course, with the exponential is that it makes it very hard to solve for. So that's why we need to use the series approximation, the Dyson series expansion.
Starting point is 00:31:31 Now, this is a very complicated, and this is why we can't solve these equations exactly because the exponential of a very complicated function is a very, very complicated function. And so we're going to need to use approximation methods. And the particular method that's typically used in quantum field theory is called perturbation theory. Again, I did discuss this in a previous episode as well. I think advanced quantum mechanics we discussed this as well as the quantum field theory episode. perturbation theory is when you expand your complicated function as a series of polynomials. So like increasing powers of, or powers of X in the simple case. But here it's going to be essentially powers of our Hamiltonian function.
Starting point is 00:32:09 It's technically an infinite series, so you have infinite many powers, but you truncated at some point, and so you just calculate the first few and use that as an approximation. How good the approximation is, well, that will depend on the theory in question. For quantum electrodynamics, the interaction strength is relatively low, like the strength of the interaction between electrons and photons. And so the perturbation theory gives a very accurate approximation to the full theory, which is, again, you can't compute. But in the case of perturbation theory, you only need the first few orders. a few terms of the infinite expansion to actually give quite accurate results. So this expansion of the S matrix in terms of increasing powers of the Hamiltonian is called
Starting point is 00:32:50 the Dyson series expansion. And so when I say powers of the Hamiltonian, effectively what I mean is we take one Hamiltonian function. That's like the first order, and then the second order will be two multiplied together, and then the third order would be like three multiplied together and so forth. And so this is a way of approximating the exponential function. Okay, that's fine, But where does this Hamiltonian function come from? Well, just to confuse you a little bit more, there's a bit of terminology here. There are two functions which are very similar to each other, the Lagrangian and the Hamiltonian. So so far I've been talking about the Hamiltonian, the Hamiltonian is the total energy function,
Starting point is 00:33:25 which generates evolution in time. The Lagrangian is very similar. As it turns out, in quantum electrodynamics, they're pretty much identical to each other. The Hamiltonia is actually just the negative of the Lagrangian, or the interaction components, I should say, one is the negative of the other. So look, we don't really need to worry about this negative sign very much, and I'm just kind of going to talk about them interchangeably. They're both described the energy of the system, and that's what's important for us here. So the Hamiltonian is technically what is used in the S-matrix and generates the motion over time.
Starting point is 00:33:56 The Lagrangian is something we'll talk in more detail about when we talk about quantum quantum field theories applied to the standard model, because the Lagrangian is what we use to describe the different interactions in the standard model. But for here, it's just a bit of terminology because interactions are usually described using the Lagrangian, not the Hamiltonian, but you can convert between the pretty easily. So we need to worry too much about this. At any rate, we need to know the form of this Hamiltonian, or Lagrangian, in order to be able to compute the terms of the Dyson series expansion in order to compute the transition probabilities, right? That's the whole point of this exercise that we're trying to do here. We want to be able to describe the probability of one state transitioning into another state.
Starting point is 00:34:36 for that we need the interaction Lagrangian. How do we get the form of that? Well, that is something that we'll need to postpone until I talk about the standard model in general. Although, to some extent, finding the right form of the Lagrangian is an exercise in like guessing and then, you know, testing experimentally whether that generates the correct predictions. But there are many principles that can be appealed to to narrow the search base. For our purposes here, I'm just going to tell you that the interaction Lagrangian, that the part of the energy that is related to the interaction between the electrons and photons is given by two Fermion fields multiplied by a gamma matrix, or the gamma matrices, I should say, all four of them, and a photon field. So it's four
Starting point is 00:35:22 things multiplied together. So we've got two Fermion fields and a photon field plus the gamma matrices there. Remember the gamma matrices? There are four by four matrices, which help to describe the transformations of the spinner terms in different frames of reference and ensure that the spin is properly incorporated into our description. So that's why that is here, because of the spin of the electrons. As to why there are two fermion fields and one photon field, the fundamental reason for that is because fermions and photons are quite different types of things. That photons are a type of particle, which is called a gauge boson, and they mediate the forces between particles. a type of energy that mediates forces, whereas the fermion fields, the electrons, they are matter,
Starting point is 00:36:12 which experience the forces that are mediated by the photons. So they're different types of things, and therefore they appear in our equations differently. That's the best I can do for the moment. I'll get into more detail about that in the standard model episode. For now, we'll just take it as a fact that the type of the interaction that appears in our Lagrangian is two Fermion, fields multiplied by two electron fields, multiplied by a photon field, and then the gamma matrices is thrown in as well. So that's the form of the energy function, which is going to appear in our Hamiltonian, which then in turn will appear in the S matrix and the Dyson series expansion, which will then use to calculate these transition probabilities. So let me summarize where we're
Starting point is 00:37:00 at here. I've just described to you the mathematical form that the Lagrangian, and hence the Hamiltonian takes for an interaction between electrons and photons. This whole Lagrangian, the two electron fields, the gamma matrices and the electromagnetic field, this is really one giant operator, because remember all of the fields in quantum field theory are operators. They don't describe a quantum system. They're mathematical functions that act on some description of a system. So remember you've got your bras and your kets, the angular brackets that I mentioned in previous episodes, and I mentioned before just now as well, what the Lagrangian-Hamiltonian functions will do is they'll act on these quantum states and return effectively a description of the energy
Starting point is 00:37:47 of that state. What we want to do here is use this Lagrangian function, or the Hamiltonian, in order to construct the S-matrix, which is really another operator, that acts on an initial state and generates a probability amplitude of transitioning from that initial state to the final state. So the initial state could be two electrons with certain momentum and the final state could be two electrons with different momentum states, for example. Although there could be more particles created, particles destroyed. There's all sorts of things that it could potentially be. We'll go through a specific example later about like the initial and final states. But that's what the S matrix is doing. It's telling us the probability amplitude of going from the initial
Starting point is 00:38:30 to the final, and it does that using our fields. I mentioned before that that's the ultimate purpose of fields that allows us to describe how our quantum states change over time, because the field doesn't describe the state. The field describes how a state changes, really, and how it can be represented in terms of a momentum basis. So, at this point, what we have is our S-matrix equation, which is basically a bunch of different fields multiplied together. Remember I said that a single Hamiltonian or the Grangian consists of four of these two electron fields, one electromagnetic field and the gamma matrices. But then in order to use the dice and series expansion, you have multiple terms in the series. So the first one has one Hamiltonian, then the next has two multiplied together,
Starting point is 00:39:15 and then the next has three, and so forth. Now, I should mention that here we're actually dealing with something called a Hamiltonian density, which means we need to integrate it over the three dimensions of space and one dimension of time. That's fine. We just have extra integrals that then appear in our S-matrix Dyson series expansion. That's okay. But it does introduce a complication because as we're integrating over the different dimensions, particularly the time dimension, we need to ensure that the Hamiltonian states evaluated at different points in time are placed in the correct order, specifically when we have more than one Hamiltonian term in our orders of our series expansion, like if we have two multiplied together, for example, at second order, or three in third order,
Starting point is 00:40:02 we need to make sure that we place the fields evaluated at different times in the correct temporal ordering. Because the whole point of this expression is that we're trying to describe the probability amplitude of an initial state transitioning into a final state. And that takes place through a sort of a series of intermediate, interactions, and we need to make sure those interactions occur in the right time ordering. And so we introduced something called a time ordering operator, which basically just rearranges the Hamiltonian terms in the integral, so that they always appear in the correct time order. So earlier is to the right, and then progressively later as you go towards the left in the order
Starting point is 00:40:44 that is written. And just a note on how that interacts with relativity, because you may recall in relativity, the order of events can actually change in different frames of reference. However, that's only true for space-like separated events. And basically that means it's only true for events that are so far apart from each other in space that light could not, even light, or anything traveling less than the speed of light, wouldn't be able to get from one event to the other. They're very far away effectively. Any events that are closer together than that always preserve the temporal ordering regardless of what frame of reference you're in.
Starting point is 00:41:20 And so in order to account for this, we actually make an additional postulate called microcausality in quantum field theory, which basically says that the Hamiltonian functions evaluated at events that have a space-like separation, always commute with each other, meaning that it doesn't matter what order they're in. So for the times when it does matter what order they're in, namely when the events are not space-like separated, they're like or time-like separated, we can always unambiguously specify what order. order there in because events that are not very far apart from each other in space relative to time always have the same ordering in every reference frame. It's only for those events that don't have an unambiguous ordering, that the ordering changes in different reference frames. In those cases, it doesn't matter what order the Hamiltonian is placed in. So it all works out. Either way, if the order matters, you can specify in an invariant way what order the events occur in, or if you can't specify in an unambiguous way, what order they're occurring, then the order
Starting point is 00:42:20 it doesn't matter. So this extra assumption allows us to always apply this time ordering operator when we're integrating over the different times that the events can occur. Now, the trouble with this time ordering operator, though, is that it does make the mathematics a bit more complicated. So what we have then is the time ordering of these multiple Hamiltonians multiplied together, two Hamiltonians for the second order level expansion three for the third, and then so forth in our Dyson series. And remember, each of these Hamiltonians consist of several different field operators multiplied together. So particularly two Fermion fields, electromagnetic field, plus the gamma matrices. So what we end up with is this time-ordering operator applied to this big long list
Starting point is 00:43:03 or big-long multiplication of all of these field operators. And that's very mathematically difficult to manage. So how do we then make progress here in calculating the transition amplitudes, the probability the amplitudes from the initial to the final state. Well, thankfully, there's a mathematical theorem that helps us out, and it's called Wix theorem. I mentioned this in the previous episode on quantum field theory, but I didn't really explain what it is. Wix theorem states that the time ordering of any number of field operators multiplied
Starting point is 00:43:35 together is equal to the normal ordering of all possible contractions of two of those fields together. So we need to break that down as to what? that means. First of all, the normal ordering is a way of ordering the operators, the creation and annihilation operators that we have. Remember, those are part of our fields. Each of our field operators, in turn, is built out of annihilation and creation operators, which create and annihilate particles at particular memento. In order for that to make sense, we need to make sure that we put those operators in the right order. So specifically, we have all the creation operators
Starting point is 00:44:14 put to the left and all the annihilation operators on the right. Basically, that means that then we do all the annihilations first, and then we do all the creations to make sure that we end up with essentially going from the initial state to the final state, which is what we want. Specifically, the way that we think about it is that you start with whatever you have in your initial state, you annihilate all those particles, and then you propagate them, and then you create them all back again at the very end. So that's why we want to.
Starting point is 00:44:39 That's actually how this description works. You sort of imagine starting with some initial state, you destroy everything there and then propagate the fields and then create everything back at the end. That's a bit of an oversimplification, but that's kind of how the formalism actually works. And so the normal ordering ensures that the order of the creation and annihilation operators is consistent with that description. Now, in terms of a contraction, a contraction is a special operation which just takes two fields, so there could be many, many fields multiplied together, but it just takes two of these
Starting point is 00:45:11 field operators and computes the time-ordered vacuum expectation value of those two fields. The vacuum expectation value was just when we sandwich the two fields in between a vacuum state. Remember, you've got your bras and your ket states, so the vacuum state is just the state with no particles in it. And so we put one of those on either side, a zero in the angular brackets on the right and one on the left as well. and then in between we put the time-ordered product of two different field states. This special construction here, a two-field vacuum expectation value, is what we get when we do a contraction,
Starting point is 00:45:51 and it has a special name. They're called propagators. So propagators play a very important role in quantum field theory. We've kind of been talking about a generalized form of propagators all along, as we've been talking about the computing the matrix elements because they're very similar ideas.
Starting point is 00:46:10 So what a propagator does is it gives the probability amplitude for a particle to be created at one location, propagate to another location and then be destroyed at that location. And so that's why there are two field operators in the propagator. One of them essentially does the creating at one location and the other
Starting point is 00:46:26 does the destroying at the other location. So that's what a contraction is. A contraction turns two fields. into a propagator into this vacuum expectation value that gives a probability amplitude. So coming back to Wix theorem, what Wix theorem does is it takes this time ordering of many, many fields all multiplied together, it turns it into the sum of all of the possible ways of creating a single propagator from all of these fields, plus all of the ways of creating two propagators from these fields, and all the ways of creating three propagators from all these
Starting point is 00:46:57 fields, and so on until we've exhausted all of the fields. Now, why would we want to do so? I mean, how is it advantageous to turn one product into a whole bunch of products? Well, the answer is because time ordering is very complicated to do when we have lots of fields, but it's much easier to do if you only have two fields. And particularly, we already have a formalism for this. It's the propagators, right? We can describe quite clearly what the propagator is. So it's very handy to turn the time ordering of this whole bunch of fields into the normal
Starting point is 00:47:26 ordering of two fields and then multiply by the other fields. And it turns out that we can actually readily interpret all of the elements in our WIC expansion in a very simple way. And so this will turn out to be a very handy theorem indeed. Don't worry if it's still a little unclear. All the pieces are coming together. So at this point, what we have is a way of describing the transition from an initial state to the final state.
Starting point is 00:47:55 That's our S matrix. We can compute the S matrix by using the Hamiltonian. so that's the energy of our system and taking the exponential of that Hamiltonian. But an exponential of a Hamiltonian is mathematically intractable, so we use the Dyson series expansion, which expands it out into one Hamiltonian, and then two Hamiltonians multiply together, three Hamiltonians, and then so forth. So there's an infinite series expansion there.
Starting point is 00:48:18 Usually we just focus on the first few terms in the series. But we do need to add a time ordering when we start multiplying these multiple Hamiltonians together because we need to make sure that the different interactions happen in the correct. ordering. The time evolution operator causes a bit of mathematical trouble, and so we use Wix theorem to simplify it, Wix theorem turning a time ordering of a bunch of field operators multiplied together into a series of contractions, where we have, well, no contractions, and then two fields contracted together, all the possible ways of doing that with two fields, and then two different contractions, so all possible ways of doing two contractions with two fields,
Starting point is 00:48:58 and then three contractions and then so forth into we've exhausted all of the fields that are appearing inside the time order. If it combined together the Dyson series expansion plus the WIC expansion, we actually turn a very, very complicated expression into relatively simple expressions. And we can put all these terms together and actually finally do the calculation to get our matrix element, the S matrix that gives the probability amplitude from transitioning from the initial to the final states. So the next task is to actually calculate the terms in the WIC expansion and use that to determine our transition probability amplitude. But we're going to put a pause in that and resume that discussion in the next part of the episode.
Starting point is 00:49:41 Originally this was going to be a single episode, but we ended up covering quite a lot of territory, and so I'll split it here. So in the next episode, we're going to be talking about Feynman diagrams and how we use those to calculate the transition matrix probabilities, and how we use those to predict the cross sections for the scattering amplitudes, which are the things that we can experimentally measure, and we'll also talk about renormalization. So stay tuned for that when we finish out this interesting discussion in quantum field theory. If you enjoy this episode, you can support the podcast by going to your aggregator of choice, such as Spotify, leaving a favorable review.
Starting point is 00:50:15 You can also check out the podcast on YouTube, Science of Everything podcast, and like some of the videos or leave comments on episodes that you've seen. and that helped to bring the episodes to a new audience on that platform. If you'd like to make a financial contribution, you can give a one-off donation through PayPal to my email, Fodd's 12 at gmail.com, or go to Patreon and become a Patreon supporter. I greatly appreciate all of the support I received from the General Stone is there.
Starting point is 00:50:39 If you'd like to leave feedback about the podcast or request future episodes or any other comments or questions, you can contact me, my email, Fodst12 at gmail.com. That's FODDS12 at gmail.com. Thank you very much for listening, I'll talk to you next time.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.