3 Interacting Quantum Fields

The theories we have looked at so far are described by Gaussian partition functions or, equivalently, actions that are quadratic in the fields. These theories are trivial, in the sense that all \(N\)-point functions (which collectively describe everything there is to know) decompose identically into \(2\)-point functions. To do useful calculations we need to consider interacting theories. The particles can either interact with themselves, or we can have multiple types of particle which can interact with one another.

3.1 \(\varphi^{4}\) theory

We will focus on the first case: a scalar field with self-interactions, called \(\varphi^{4}\) theory. The action is

\[S=S_{0}+\int\text{d}^{D+1}x\varphi_{x}J_{x}+S_{\text{int}} \tag{3.1}\]

where

\[\begin{aligned} S_{0} & \triangleq S_{\text{KG}}\\ S_{\text{int}} & \triangleq\frac{\lambda}{4!}\int\text{d}^{D+1}x\varphi^{4}_{x}. \end{aligned}\]

You may hazard a guess as to where the name comes from! This action does not describe any particles in the standard model (nor does Klein Gordon), but it does describe other physical theories. For example, phonons emerging from balls and springs with an anharmonic potential between atoms, or the Ising model of interacting spins. The normalisation \(1/4!\) turns out to be convenient later.

3.2 Perturbation Theory

To calculate the \(N\)-point functions for a non-Gaussian theory we can use perturbation theory. The basic idea is pretty simple. The generating functional is now given by

\[\begin{aligned} \mathscr{Z}_{\lambda}\left[J\right] & =\int\mathscr{D}\varphi\exp\left(iS_{0}+i\frac{\lambda}{4!}\int\text{d}^{D+1}x\varphi^{4}_{x}+i\int\text{d}^{D+1}x\varphi_{x}J_{x}\right)\\ & =\int\mathscr{D}\varphi\exp\left(iS_{0}+i\int\text{d}^{D+1}x\varphi_{x}J_{x}\right)\exp\left(i\frac{\lambda}{4!}\int\text{d}^{D+1}x\varphi^{4}_{x}\right) \end{aligned}\]

and we can expand the exponential of the interaction term as a Taylor series to give

\[\mathscr{Z}_{\lambda}\left[J\right]=\int\mathscr{D}\varphi\sum^{\infty}_{n=0}\frac{1}{n!}\left(i\frac{\lambda}{4!}\int\text{d}^{D+1}x\varphi^{4}_{x}\right)^{n}\exp\left(iS_{0}+i\int\text{d}^{D+1}x\varphi_{x}J_{x}\right).\]

At each order in \(\lambda\) this is simply the expectation value of a polynomial of the fields, and any such object can be calculated using Wick’s theorem. Let’s take a look at some specific examples.

3.2.1 The interacting vacuum

The non-interacting field theory had an empty vacuum. But when interactions are present, the vacuum changes its form. To see this, we can calculate the 0-point function [^8], which is simply \(\mathscr{Z}_{\lambda}\left[J\right]\) itself. To \(\mathcal{O}\left(\lambda^{2}\right)\) we have:

\[\mathscr{Z}_{\lambda}\left[J\right]=\int\mathscr{D}\varphi\left(1+i\frac{\lambda}{4!}\int\text{d}^{D+1}x\varphi^{4}_{x}+\frac{1}{2}\left(i\frac{\lambda}{4!}\right)^{2}\int\text{d}^{D+1}x\int\text{d}^{D+1}y\varphi^{4}_{x}\varphi^{4}_{y}\right)\exp\left(iS_{0}+i\int\text{d}^{D+1}x\varphi_{x}J_{x}\right).\]

We work order-by-order in \(\lambda\), and find

\[\frac{\mathscr{Z}_{\lambda}\left[J\right]}{\mathscr{Z}_{0}\left[0\right]}=\frac{\mathscr{Z}_{0}\left[J\right]}{\mathscr{Z}_{0}\left[0\right]}+i\frac{\lambda}{4!}\int\text{d}^{D+1}x\langle\varphi^{4}_{x}\rangle_{0}+\frac{1}{2}\left(i\frac{\lambda}{4!}\right)^{2}\int\text{d}^{D+1}x\int\text{d}^{D+1}y\langle\varphi^{4}_{x}\varphi^{4}_{y}\rangle_{0}+\mathcal{O}\left(\lambda^{3}\right)\]

where we have introduced the convenient notation

\[\left\langle A\right\rangle _{0}\triangleq\frac{\int\mathscr{D}\varphi A\exp\left(iS_{0}\right)}{\int\mathscr{D}\varphi\exp\left(iS_{0}\right)}\]

i.e. the familiar expectation value without the interaction term. We proceed using Wick’s theorem, which says that any such \(\left\langle A\right\rangle _{0}\) expectation value is equal to the sum of all possible products of 2-point functions. This gives

\[\begin{aligned} \frac{\mathscr{Z}_{\lambda}\left[J\right]}{\mathscr{Z}_{0}\left[0\right]} & =\frac{\mathscr{Z}_{0}\left[J\right]}{\mathscr{Z}_{0}\left[0\right]}+i\frac{\lambda}{4!}\int\text{d}^{D+1}x3\langle\varphi^{2}_{x}\rangle_{0}\langle\varphi^{2}_{x}\rangle_{0}\\ & +\frac{1}{2}\left(i\frac{\lambda}{4!}\right)^{2}\int\text{d}^{D+1}x\int\text{d}^{D+1}y\left\{ 3\times3\langle\varphi^{2}_{x}\rangle_{0}\langle\varphi^{2}_{x}\rangle_{0}\langle\varphi^{2}_{y}\rangle_{0}\langle\varphi^{2}_{y}\rangle_{0}\right.\\ & +\left.\phantom{}^{4}C_{2}\times\phantom{}^{4}C_{2}\times2\langle\varphi_{x}\varphi_{y}\rangle_{0}\langle\varphi_{x}\varphi_{y}\rangle_{0}\langle\varphi^{2}_{x}\rangle_{0}\langle\varphi^{2}_{y}\rangle_{0}+4!\langle\varphi_{x}\varphi_{y}\rangle^{4}_{0}\right\} \\ & +\mathcal{O}\left(\lambda^{3}\right). \end{aligned}\]

The counting is as follows. In the \(\mathcal{O}\left(\lambda\right)\) term, there are three ways for the first \(\varphi_{x}\) to pick a partner, and then the other pair is fixed. Similarly for the first \(\mathcal{O}\left(\lambda^{2}\right)\) term (for \(x\) and \(y\) separately). For the second \(\mathcal{O}\left(\lambda^{2}\right)\) term there are \(\phantom{}^{4}C_{2}\) ways to choose one \(xx\) pair; same for the \(yy\) pair; then there are two ways to choose the first \(xy\) pair and the second is fixed.

We now use the fact that the non-interacting 2-point function is the Green’s function to give

\[\begin{aligned} \frac{\mathscr{Z}_{\lambda}\left[J\right]}{\mathscr{Z}_{0}\left[0\right]} & =\frac{\mathscr{Z}_{0}\left[J\right]}{\mathscr{Z}_{0}\left[0\right]}+i\lambda\int\text{d}^{D+1}x\frac{1}{8}G^{2}_{xx}\\ & +\left(i\lambda\right)^{2}\int\text{d}^{D+1}x\int\text{d}^{D+1}y\left\{ \frac{1}{2\cdot8^{2}}G^{2}_{xx}G^{2}_{yy}+\frac{1}{16}G_{xx}G^{2}_{xy}G_{yy}+\frac{1}{2\cdot4!}G^{4}_{xy}\right\} \\ & +\mathcal{O}\left(\lambda^{3}\right). \end{aligned}\]

3.2.2 Interacting 2-point function

Next we can redo the calculation of the 2-point function from Section @subsec-2-point-functions-from-Z. The \(\mathcal{O}\left(\lambda^{0}\right)\) term is again the non-interacting expression. The \(\mathcal{O}\left(\lambda^{1}\right)\) expression is

\[\begin{aligned} \langle\mathcal{T}\varphi_{2}\varphi_{1}\rangle & =\left.\mathcal{T}\mathscr{Z}^{-1}_{\lambda}\left[0\right]\left(-i\frac{\delta}{\delta J_{1}}\right)\left(-i\frac{\delta}{\delta J_{2}}\right)\mathscr{Z}_{\lambda}\left[J\right]\right|_{J=0}\\ & =\left.\mathcal{T}\mathscr{Z}^{-1}_{\lambda}\left[0\right]\int\mathscr{D}\varphi\left(i\frac{\lambda}{4!}\int\text{d}^{D+1}x\varphi^{4}_{x}\right)\left(-i\frac{\delta}{\delta J_{1}}\right)\left(-i\frac{\delta}{\delta J_{2}}\right)\exp\left(iS_{0}+i\int\text{d}^{D+1}x\varphi_{x}J_{x}\right)\right|_{J=0}\\ & =\left.\mathcal{T}\mathscr{Z}^{-1}_{\lambda}\left[0\right]\int\mathscr{D}\varphi\left(i\frac{\lambda}{4!}\int\text{d}^{D+1}x\varphi^{4}_{x}\right)\left(\varphi_{2}\right)\left(\varphi_{1}\right)\exp\left(iS_{0}+i\int\text{d}^{D+1}x\varphi_{x}J_{x}\right)\right|_{J=0}\\ & =i\frac{\lambda}{4!}\int\text{d}^{D+1}x\left\langle \mathcal{T}\varphi^{4}_{x}\varphi_{2}\varphi_{1}\right\rangle _{0}. \end{aligned}\]

All possible contractions are shown schematically in Fig 3.2.

Figure 3.1: The possible pairs of terms when applying Wick’s theorem to \(\varphi^{4}\) theory to \(\mathcal{O}\left(\lambda\right)\).

Figure 3.2: The possible pairs of terms when applying Wick’s theorem to \(\varphi^{4}\) theory to \(\mathcal{O}\left(\lambda\right)\).

The result, counting all possibilities, is

\[\begin{aligned} \langle\mathcal{T}\varphi_{1}\varphi_{2}\rangle & =i\frac{\lambda}{4!}\int\text{d}^{D+1}x\left\{ 3\left\langle \varphi^{2}_{x}\right\rangle _{0}\left\langle \varphi^{2}_{x}\right\rangle _{0}\left\langle \varphi_{1}\varphi_{2}\right\rangle _{0}+4\times3\left\langle \varphi_{x}\varphi_{1}\right\rangle _{0}\left\langle \varphi_{x}\varphi_{2}\right\rangle _{0}\left\langle \varphi^{2}_{x}\right\rangle _{0}\right\} \\ & =i\lambda\int\text{d}^{D+1}x\left\{ \frac{1}{8}G^{2}_{xx}G_{12}+\frac{1}{2}G_{x1}G_{x2}G_{xx}\right\} . \end{aligned} \tag{3.2}\]

The counting of terms is as follows. For the \(G^{2}_{xx}G_{12}\) term, the first \(x\) has \(3\) ways to choose the second \(x\) with which it contracts. The remaining two \(x\)s are then forced. Hence, \(3\). For the \(G_{x1}G_{x2}G_{xx}\) term, there are \(4\) choices of \(x\) for the \(1\) to pair with, leaving \(3\) choices for the \(2\) to pair with, then the remaining pair of \(x\)s must go together. Hence \(4\times3\).

3.2.3 Interacting 4-point function

Since the 3-point function vanishes, the next simplest term is the 4-point function. At order \(\lambda^{0}\) this is again the same as the unperturbed case. At order \(\lambda^{1}\) it is given by

\[\begin{aligned} \langle\mathcal{T}\varphi_{1}\varphi_{2}\varphi_{3}\varphi_{4}\rangle_{1} & =i\frac{\lambda}{4!}\int\text{d}^{D+1}x\left\langle \mathcal{T}\varphi^{4}_{x}\varphi_{1}\varphi_{2}\varphi_{3}\varphi_{4}\right\rangle _{0}\\ & =i\frac{\lambda}{4!}\int\text{d}^{D+1}x\left\{ 4!G_{1x}G_{2x}G_{3x}G_{4x}+3G^{2}_{xx}\left(G_{12}G_{34}+G_{13}G_{24}+G_{14}G_{23}\right)\right.\\ & +\left.4\times3G_{xx}\left(G_{1x}G_{2x}G_{34}+G_{1x}G_{3x}G_{24}+G_{1x}G_{4x}G_{23}+G_{2x}G_{3x}G_{14}+G_{2x}G_{4x}G_{13}+G_{3x}G_{4x}G_{12}\right)\right\} \\ & =i\lambda\int\text{d}^{D+1}x\left\{ G_{1x}G_{2x}G_{3x}G_{4x}+\frac{1}{8}G^{2}_{xx}\left(G_{12}G_{34}+2\text{ perms}\right)+\frac{1}{2}G_{xx}\left(G_{1x}G_{2x}G_{34}+5\text{ perms}\right)\right. \end{aligned} \tag{3.3}\]

At higher orders of \(\lambda\) things quickly grow in combinatorial difficulty. There is a neat bookkeeping trick for dealing with this, to which we turn now.

3.3 Feynman Diagrams

3.3.1 Diagrams

Feynman invented his eponymous notation to simplify the rapidly growing complexity of the combinatorics just discussed.

The Feynman diagrams for the expansions in the previous sections appear in Fig 3.4 (the vacuum), Fig 3.6 (2-point function), and Fig 3.8 (4-point function).

Take the 2-point function. You start by drawing a vertex for each external field position (anything not integrated over, or equivalently anything explicitly labelled on the left): in our case, \(x_{1}\) and \(x_{2}\). Then draw a single point for each internal field position, in our case \(x\). Now find all the ways to draw lines connecting these points, so that each external point meets precisely one line, and each internal point meets precisely 4 lines.

The reason each internal point has 4 lines meeting is that this is \(\varphi^{4}\) theory. When you Taylor expand the action, every power of \(\lambda\) brings with it a product \(\varphi^{4}_{x}\), so there is no other possibility for lines to meet. E.g. you can’t have 2 lines meet at an internal point.

Figure 3.3: The Feynman diagrams for the vacuum of \(\varphi^{4}\) theory to \(\mathcal{O}\left(\lambda^{2}\right)\).

Figure 3.4: The Feynman diagrams for the vacuum of \(\varphi^{4}\) theory to \(\mathcal{O}\left(\lambda^{2}\right)\).

Figure 3.5: The Feynman diagrams for the 2-point functions in \(\varphi^{4}\) theory at \(\mathcal{O}\left(\lambda\right)\).

Figure 3.6: The Feynman diagrams for the 2-point functions in \(\varphi^{4}\) theory at \(\mathcal{O}\left(\lambda\right)\).

Figure 3.7: The Feynman diagrams for the 4-point functions in \(\varphi^{4}\) theory to \(\mathcal{O}\left(\lambda\right)\).

Figure 3.8: The Feynman diagrams for the 4-point functions in \(\varphi^{4}\) theory to \(\mathcal{O}\left(\lambda\right)\).

3.3.2 Symmetry factors

A remarkable result of the Feynman diagrams is that they instantly yield the difficult combinatorial pre-factors. We term the ‘symmetry factor’ of a Feynman diagram the number the number of symmetry-equivalent diagrams it has. The prefactor of each term in the perturbative expansion is then given by the reciprocal of the diagram’s symmetry factor. To see why this is, it’s easiest to look at some examples.

In Eq 3.2 for the 2-point functions to order \(\lambda\), the first term is \(\frac{1}{8}G^{2}_{xx}G_{12}\). Looking at the corresponding diagram in Fig 3.6 we see that the \(G_{12}\) line is fixed (\(x_{1}\) and \(x_{2}\) are fixed), but the \(G^{2}_{xx}\) has some freedom. It has two lobes. We can flip the left lobe (\(\times2\)), the right lobe (\(\times2\)), and we can interchange the lobes (\(\times2\)). The symmetry factor is \(2\times2\times2=8\), and so the prefactor is \(1/8\).

The second term is \(\frac{1}{2}G_{x1}G_{x2}G_{xx}\). The corresponding diagram has one internal loop, which can be flipped (\(\times2\)). Hence the diagram has a symmetry factor of 2, and the prefactor is \(1/2\).

Now look at the 4-point functions in Eq 3.3, and the corresponding Feynman diagrams in Fig 3.8. The term \(G_{1x}G_{2x}G_{3x}G_{4x}\) has a prefactor of 1. This is actually by construction: it is why we included the \(1/4!\) factor in the definition of \(S_{\text{int}}\). There are 4 ways for \(1\) to choose an \(x\), 3 ways for the \(2\), 2 ways for the \(3\), and one way for the 4, giving \(4!\) terms, cancelling the prefactor. Looking at the diagram, all legs connect to external events, so there is no freedom.

Other symmetry factors can be worked out using similar reasoning. Often it is quite tricky, in the general case; but practical QFTs such as QED are more restricted and typically have simpler symmetry factors.

3.3.3 Physical interpretation of Feynman diagrams

Feynman diagrams suggest a natural physical interpretation. Initially, it can be helpful to draw space and time axes (e.g. time heading up, space horizontal). Then you can interpret the lines (propagators) as worldlines of particles. Heading up, if lines meet (which must happen at a vertex), the particles annihilate. If lines emerge from a vertex, particles are created.

In \(\varphi^{4}\) theory each vertex must be the meeting point of 4 lines, so if two head in, then two head out, and this is like two particles annihilating and another two being created at the same event. The spacetime points of internal vertices are always integrated over: this means we create a quantum superposition of that event happening at any spacetime location.

Consider the 2-point function connecting events \(x_{1}\) and \(x_{2}\), which gives the probability amplitude to detect a particle at \(x_{1}\) and at \(x_{2}\) (recalling that this doesn’t necessarily mean the same particle travels between those events – as the amplitude is non-zero for spacelike separation). In non-interacting \(\varphi^{4}\) theory, there is an amplitude for the particle to propagate from \(x_{1}\) to \(x_{2}\). In the interacting theory, this is also possible. But inspecting Fig 3.6 (rightmost diagram), we see there is also an amplitude for the particle to start propagating from \(x_{1}\), then to emit a particle at \(x\), to catch that same particle also at \(x\), then to propagate to \(x_{2}\).

Giving a causal interpretation to intermediate events in Feynman diagrams is therefore tricky. Closed loops appear, and along any closed loop any spacetime axes must suggest a particle propagating forwards and backwards in time. That’s how the intermediate particle can depart from \(x\) then arrive at \(x\).

The spacetime axes are arbitrary: you could have drawn space heading up, time horizontal, and you would have found a different physical interpretation. This is a helpful reminder that the only measurable things occur at spacetime events corresponding to external vertices.

3.3.4 Feynman Rules

The Feynman diagrams are in one-to-one correspondence with the terms in the perturbation expansion of an interacting QFT. They are typically easy to draw, and it’s fairly easy to check they’ve all been included at a given order. Hence, QFT typically amounts to drawing Feynman diagrams first, then converting them into their analytic expressions. This conversion is carried out by applying ‘Feynman rules’.

For the case above, we have:

The Feynman rules for \(\varphi^{4}\) theory (position space):

To each propagator (line) assign \(G_{xy}\)

To each vertex assign \(i\lambda\int\text{d}^{D+1}x\)

Divide by the symmetry factor.

In high-energy physics it is more common to work with the Fourier transformed propagators, since the boundary conditions typically take the form of specifying the momentum of a particle before and after a collision (recalling that the position and momentum cannot be simultaenously specified). In this case we have

The Feynman rules for \(\varphi^{4}\) theory (momentum space):

To each propagator (line) assign \(\tilde{G}_{p}\)

To each vertex assign \(i\lambda\left(2\pi\right)^{4}\delta^{4}\left(p_{1}+p_{2}+p_{3}-p_{4}\right)\) (4-momentum conservation)

integrate over internal momenta

Divide by the symmetry factor.

3.3.5 The Connected Generating Functional

Inspecting and interpreting the Feynman diagrams, we see that many of them are rather trivial. For example, take the 2-point function \(G_{12}\). At zeroth order this is the usual propagator. The first first-order correction \(G_{12}G^{2}_{xx}\) is just this same propagator along with a ‘bubble diagram’, an event occuring elsewhere in spacetime (at \(x\)). Since fields were introduced to act locally, it would seem odd if we needed to include such unrelated spacetime events. And indeed we do not. What we are really interested in is connected diagrams: those in which the whole diagram is one connected piece. To find the connected diagrams, we need to subtract off all the bubble diagrams. Note that every bubble appears in the vacuum: in fact, the vacuum is precisely the sum of all bubbles.

There is a very elegant trick for calculating only connected diagrams. Motivated by the observation that

\[\frac{\text{d}}{\text{d}x}\ln f=f^{-1}\frac{\text{d}f}{\text{d}x}\]

and the desire to divide out the vacuum, we replace our generating functional \(\mathscr{Z}\left[J\right]\) with

The connected generating functional:

\[\mathscr{W}_{\lambda}\left[J\right]\triangleq\ln\left(\mathscr{Z}_{\lambda}\left[J\right]\right).\]

We use it precisely as we used \(\mathscr{Z}_{\lambda}\left[J\right]\), but now we find only connected diagrams! For example, consider the 2-point function to \(\mathcal{O}\left(\lambda\right)\). We have \[\mathscr{W}_{\lambda}\left[J\right]=\ln\int\mathscr{D}\varphi\sum^{\infty}_{n=0}\frac{1}{n!}\left(i\frac{\lambda}{4!}\int\text{d}^{D+1}x\varphi^{4}_{x}\right)^{n}\exp\left(iS_{0}+i\int\text{d}^{D+1}x\varphi_{x}J_{x}\right)\]

which to \(\mathcal{O}\left(\lambda\right)\) is

\[\begin{aligned} \mathscr{W}_{\lambda}\left[J\right] & =\ln\left[\mathscr{Z}_{0}\left[J\right]+i\frac{\lambda}{4!}\int\mathscr{D}\varphi\left(\int\text{d}^{D+1}x\varphi^{4}_{x}\right)\exp\left(iS_{0}+i\int\text{d}^{D+1}x\varphi_{x}J_{x}\right)\right]\\ & =\ln\left(\mathscr{Z}_{0}\left[J\right]\right)+\ln\left[1+i\frac{\lambda}{4!}\mathscr{Z}^{-1}_{0}\left[J\right]\int\mathscr{D}\varphi\left(\int\text{d}^{D+1}x\varphi^{4}_{x}\right)\exp\left(iS_{0}+i\int\text{d}^{D+1}x\varphi_{x}J_{x}\right)\right] \end{aligned}\]

which can itself be Taylor expanded in \(\lambda\), using

\[\ln\left(1+\lambda\epsilon\right)=\lambda\epsilon+\mathcal{O}\left(\lambda^{2}\right)\]

to give

\[\mathscr{W}_{\lambda}\left[J\right]=\ln\left(\mathscr{Z}_{0}\left[J\right]\right)+i\frac{\lambda}{4!}\mathscr{Z}^{-1}_{0}\left[J\right]\int\mathscr{D}\varphi\left(\int\text{d}^{D+1}x\varphi^{4}_{x}\right)\exp\left(iS_{0}+i\int\text{d}^{D+1}x\varphi_{x}J_{x}\right).\]

After a reasonable bit of algebra (make sure to drop any terms you can spot will be zero along the way, e.g. \(\langle\varphi_{1}\varphi^{4}_{x}\rangle_{0}\)) you will find for the connected 2-point function

\[\begin{aligned} \langle\mathcal{T}\varphi_{1}\varphi_{2}\rangle_{\text{connected}} & =\left.\left(-i\frac{\delta}{\delta J_{2}}\right)\left(-i\frac{\delta}{\delta J_{1}}\right)\mathscr{W}_{\lambda}\left[J\right]\right|_{J=0}\\ & =\langle\mathcal{T}\varphi_{1}\varphi_{2}\rangle_{0}+i\frac{\lambda}{4!}\int\text{d}^{D+1}x\left(\langle\mathcal{T}\varphi_{1}\varphi_{2}\varphi^{4}_{x}\rangle_{0}-\langle\mathcal{T}\varphi_{1}\varphi_{2}\rangle_{0}\langle\mathcal{T}\varphi^{4}_{x}\rangle_{0}\right). \end{aligned}\]

The second term in parentheses subtracts off every bubble. The connected generating functional similarly leads to connected diagrams at all orders, and for all correlation functions.

3.4 Regularisation and Renormalisation

The point of all this perturbation theory is to find how interactions alter the ‘bare’, non-interacting \(N\)-point functions. Broadly, the answer is that the bare functions become renormalised. There are unfortunately two essentially totally unrelated meanings of the word ‘renormalised’, both introduced in QFT, and both of which are relevant here! Let’s look at them in turn.

3.4.1 Renormalisation (Meaning I)

One of the main things we’d like to know about is the interacting 2-point function. The non-interacting 2-point function is the bare propagator for our theory. Adding interactions ‘dresses’ the propagator, changing its form. Often we will be interested to see what happens to one type of particle when we introduce a second type of particle with which it can interact. Typically we will expect that interactions will ‘renormalise’ the properties of both particles, meaning for example that their effective masses or charges change.

A simple picture to hold in mind is to imagine trying to measure the mass of a tennis ball: you can do this by trying to accelerate the ball with a tennis racket, and measuring how much force it takes to impart a given acceleration. Now try the same experiment under water. It will take more force to impart that same acceleration, since the tennis ball is now interacting with the water molecules. Hence, your experiment tells you that the tennis ball’s mass has been renormalised. This analogy also captures the fact that the renormalisation is accompanied by a change in the nature of the ‘vacuum’ itself, from air to water.

This idea applies wherever QFT is used. For example, the bare electron propagator in the standard model corresponds to a particle whose mass does not match the measured mass of the electron, because the electrons you can measure have already been renormalised by interactions with other fields (notably photons, but also \(W\) bosons, Higgs bosons, and anything that couples to electromagnetism or mass).

Often in condensed matter physics we are interested in precisely this type of scenario: two fields, with their associated particle types, where the particles of one field are much lighter than those of the other. In such cases, we typically carry out the functional integral of the lighter particles using perturbation theory, leading to a renormalised effective free (non-interacting) theory of the heavier particles. We call this process ‘integrating out’ a field, and it’s a vital procedure in QFT. Let’s look at a real example now.

3.4.2 The random phase approximation (Renormalisation I)

Consider the interaction of fast electrons with slow phonons in a crystal. Here, we have the QFT

\[\begin{aligned} S & =S_{0}+S_{\text{int}}\\ S_{0} & =\int\text{d}^{4}q\varphi_{q}D^{-1}_{q}\varphi_{-q}+\int\text{d}^{4}k\psi^{\dagger}_{k}G^{-1}_{k}\psi_{k}\\ S_{\text{int}} & =g\int\text{d}^{4}q\int\text{d}^{4}k\varphi_{q}\psi^{\dagger}_{k}\psi_{k+q} \end{aligned}\]

where \(k=\left(\boldsymbol{k},\omega\right)\) (with Euclidean metric),

\[D_{q}=\frac{-2\Omega_{\boldsymbol{q}}}{\omega^{2}-\Omega^{2}_{\boldsymbol{q}}}\]

is the bare phonon propagator for phonons with energy \(\Omega_{\boldsymbol{q}}\) at wavevector \(\boldsymbol{q}\), and

\[G_{k}=\frac{1}{\omega-\xi_{\boldsymbol{k}}}\]

is the bare electron propagator, where \(\xi_{\boldsymbol{k}}\) is the energy of an electron with crystal momentum \(\boldsymbol{k}\). The energy of a particle is given by the locations of the poles in its Green’s function. The positive energy pole of the bare phonon is therefore

\[\omega=\Omega_{\boldsymbol{q}}.\]

We can look at how the interactions with the electron field change this. If you try drawing connected Feynman diagrams to correct the phonon propagator, you will find that the lowest order diagram appears at order \(g^{2}\). It is shown in Fig 3.10. However, there is a neat trick by which we can include an infinite number of Feynman diagrams of a similar form, also shown in that figure. We call this the random phase approximation or RPA (the name is for historical, and rather unconvincing, reasons).

Figure 3.9: The lowest order diagram contributing to the renormalisation of the phonon propagator \(D_{q}\). Here, \(G_{k}\) is the electron propagator, and each vertex contributes a coupling \(g\).

Figure 3.10: The lowest order diagram contributing to the renormalisation of the phonon propagator \(D_{q}\). Here, \(G_{k}\) is the electron propagator, and each vertex contributes a coupling \(g\).

It is important to note that the set of diagrams in the RPA is not all possible diagrams renormalising the phonon propagator. Rather, it is a mathematically convenient subset to consider, but there is no physical justification for discarding the other connected diagrams (the first of which appears at order \(g^{4}\)). RPA is an example of what is called an uncontrolled approximation: no effort is made to check it captures everything, and indeed it does not! This caveat in mind, let’s proceed to calculate the RPA corrected phonon energy.

The loop which appears inside the propagator is called the polarisation bubble, or Lindhard function. It is

\[\chi_{q}=\int\text{d}^{4}kG_{k}G_{k+q}.\]

The frequency integral can be carried out using a contour integral, with the result

\[\chi_{q}=\sum_{\boldsymbol{k}}\frac{f\left(\xi_{\boldsymbol{k}+\boldsymbol{q}}\right)-f\left(\xi_{\boldsymbol{k}}\right)}{\omega+\xi_{\boldsymbol{k}+\boldsymbol{q}}-\xi_{\boldsymbol{k}}}\]

where

\[f\left(\xi\right)=\frac{1}{\exp\left(\frac{\xi}{T}\right)+1}\]

is the Fermi Dirac distribution. The \(\boldsymbol{k}\) integral has been replaced by a discrete sum (since this is a crystal). The sum requires \(\xi_{\boldsymbol{k}}\) to be specified in order to be evaluated, and this is generally done numerically. The interacting phonon propagator is then

\[D^{\text{RPA}}_{q}=D_{q}+D_{q}\left(g^{2}\chi_{q}\right)D_{q}+D_{q}\left(g^{2}\chi_{q}\right)D_{q}\left(g^{2}\chi_{q}\right)D_{q}+D_{q}\left(g^{2}\chi_{q}\right)D_{q}\left(g^{2}\chi_{q}\right)D_{q}\left(g^{2}\chi_{q}\right)D_{q}+\ldots\]

This general form is called a Dyson series. It is simply a geometric series of Feynman diagrams. As such, the infinite sum can be carried out exactly, by noticing that

\[\begin{aligned} D^{\text{RPA}}_{q} & =D_{q}+D_{q}\left(g^{2}\chi_{q}\right)\left(D_{q}+D_{q}\left(g^{2}\chi_{q}\right)D_{q}+D_{q}\left(g^{2}\chi_{q}\right)D_{q}\left(g^{2}\chi_{q}\right)D_{q}+\ldots\right)\\ & =D_{q}+D_{q}\left(g^{2}\chi_{q}\right)D^{\text{RPA}}_{q} \end{aligned}\]

and so, subtracting the second term and gathering things together:

\[\left(1-D_{q}g^{2}\chi_{q}\right)D^{\text{RPA}}_{q}=D_{q}\]

giving the final result

\[\begin{aligned} D^{\text{RPA}}_{q} & =\left(1-D_{q}g^{2}\chi_{q}\right)^{-1}D_{q}\\ & =\frac{-2\Omega_{\boldsymbol{q}}}{\omega^{2}-\Omega^{2}_{\boldsymbol{q}}+2g^{2}\Omega_{\boldsymbol{q}}\chi_{q}}. \end{aligned}\]

This is shown diagrammatically in Fig 3.12.

Figure 3.11: The evaluation of a Dyson series diagrammatically.

Figure 3.12: The evaluation of a Dyson series diagrammatically.

The positive pole of this renormalised propagator occurs at

\[\omega=\sqrt{\Omega^{2}_{\boldsymbol{q}}-2g^{2}\Omega_{\boldsymbol{q}}\chi_{q}}.\]

That is, by interacting with the electrons, the energy \(\omega\) of the phonon at a given wavevector has decreased.

3.4.3 1PI Diagrams and Self-Energies

The Dyson series used in the RPA diagram summation is so useful that it has become a standard method of organising Feynman diagrams in QFT. We define a ‘1-Particle Irreducible’ (1PI) diagram to be:

1PI diagram: any Feynman diagram which remains connected after cutting any one internal line.

A connected diagram is one in which, placing your pen on any line in the diagram, all points can be reached without removing your pen from the page.

An internal line is any line which connects only to internal vertices (those which are integrated over).

With this definition, we can always decompose any Feynman diagram into 1PI diagrams connected with single lines, and we can similarly decompose the infinite set of Feynman diagrams into a Dyson series. How do we know this is always possible? It is by definition! The Dyson series is defined to be the individual 1PI parts connected together with single edges (cutting any one of which will disconnect the diagram). So the Dyson series captures everything that is not 1PI, and since we’re summing everything, the bits in the 1PI blobs have to be all the bits of diagrams which remain connected after a single cut. You can think of 1PI diagrams as a bit like the prime numbers of Feynman diagrams: just as any integer can be factored into a product of primes, any Feynman diagram can be factored into a product of 1PI diagrams.

As in the RPA, we can now deduce the renormalisation of our propagator (or any other \(N\)-point function) by an infinite Dyson series of 1PI diagrams. When renormalising a propagator we define the 1PI blob to be a ‘self energy’ \(\Sigma\). Then, as in RPA, we can argue that

\[\begin{aligned} G^{\text{ren}} & =G+G\Sigma G^{\text{ren}}\\ & \downarrow\nonumber \\ G^{\text{ren}} & =\left(1-G\Sigma\right)^{-1}G. \end{aligned}\]

Example: non-relativistic electron propagator

The clearest example of this effect is with the (spinless, non-relativistic) electron propagator which is sometimes used in condensed matter physics. Here we have

\[G=\frac{1}{\omega-\xi_{\boldsymbol{k}}+i\epsilon}\]

where \(\epsilon\) is an infinitesimal positive real number (specifying how the pole shifts off the real axis in contour integrals). Ignoring \(\epsilon\), the Green’s function has a pole at \(\omega=\xi_{\boldsymbol{k}}\), showing that the energy \(\omega\) of the electron with crystal momentum \(\boldsymbol{k}\) is \(\xi_{\boldsymbol{k}}\).

The probability to find the electron with wavevector \(\boldsymbol{k}\) and energy \(\omega\) is given by the ‘spectral function’

\[\begin{aligned} A\left(\boldsymbol{k},\omega\right) & \triangleq-\frac{1}{\pi}\mathfrak{Im}G\left(\boldsymbol{k},\omega\right)\\ & =\frac{1}{\pi}\frac{\epsilon}{\left(\omega-\xi_{\boldsymbol{k}}\right)^{2}+\epsilon^{2}}. \end{aligned}\]

Noting that this is simply a Lorentzian of width \(\epsilon\) we see that

\[\lim_{\epsilon\rightarrow0^{+}}A\left(\boldsymbol{k},\omega\right)=\delta\left(\omega-\xi_{\boldsymbol{k}}\right)\]

which says that the electron has 100% probability to be found with energy \(\omega=\xi_{\boldsymbol{k}}\). As a quick check that this makes sense, we can confirm that the electron must exist at some energy, and so

\[\int\text{d}\omega A\left(\boldsymbol{k},\omega\right)=1\]

which you can confirm by doing the Lorentzian integral. (This is called a ‘sum rule’). What happens when the electron propagator is renormalised by an infinite Dyson series? We find

\[\begin{aligned} G^{\text{ren}} & =\left(1-G\Sigma\right)^{-1}G\\ & =\frac{1}{\omega-\xi_{\boldsymbol{k}}-\Sigma+i\epsilon}. \end{aligned}\]

Writing

\[\Sigma\triangleq\Sigma'+i\Sigma''\]

for real \(\Sigma',\)\(\Sigma''\), the spectral function becomes

\[\begin{aligned} A\left(\boldsymbol{k},\omega\right) & =-\frac{1}{\pi}\mathfrak{Im}G\left(\boldsymbol{k},\omega\right)\\ & =\frac{1}{\pi}\frac{\Sigma''+\epsilon}{\left(\omega-\xi_{\boldsymbol{k}}-\Sigma'\right)^{2}+\left(\epsilon-\Sigma''\right)^{2}} \end{aligned}\]

where we can now drop the \(\epsilon\ll\Sigma''\) to give

\[A\left(\boldsymbol{k},\omega\right)=\frac{1}{\pi}\frac{\Sigma''}{\left(\omega-\xi_{\boldsymbol{k}}-\Sigma'\right)^{2}+\Sigma''^{2}}.\]

Two things have happened. First, the real part of the self energy \(\Sigma'\) has shifted the peak of \(A\) to \(\omega=\xi_{\boldsymbol{k}}+\Sigma'\), renormalising the energy. Second, the imaginary part \(\Sigma''\) has given a finite width to the spectral function Lorentzian. The electron of wavevector \(\boldsymbol{k}\) is no longer certain to be at the peak in \(A\); rather, it can now have any energy (with the probability for it to have energy between \(\omega\) and \(\omega+\text{d}\omega\) given by \(A\text{d}\omega\)). Note that the sum rule is still obeyed: the total probability to find the electron with some energy is still \(1\).

3.4.4 Renormalisation II

In \(\varphi^{4}\) theory, the field interacts with itself. This self-interaction will renormalise (I) the \(N\)-point functions. To order \(\lambda\), with connected diagrams, the propagator is renormalised as

\[\begin{aligned} G_{12} & \rightarrow G_{12}+i\frac{\lambda}{2}\int\text{d}^{4}xG_{1x}G_{xx}G_{x2} \end{aligned}\]

This time, however, there is a problem. The term

\[G_{xx}=G\left(x-x\right)=G\left(0\right)\propto\delta^{4}\left(0\right)\]

is infinite. We saw similar infinities before, which we eliminated by requiring operators to be normal ordered. This new type of infinity is sometimes called an ‘ultraviolet divergence’, since it has occured in the high energy / high momentum / small distance limit \(x-x=0\). (Ultraviolet is used by analogy to light, where UV is high energy compared to the visible spectrum; similarly Infrared, or IR, is used to describe long wavelength / low momentum / low energy properties). To deal with UV divergences, we use a method called... renormalisation! But despite appearing at the same time in these notes, the two uses are essentially unrelated. Let’s look at what I’ll call Renormalisation II. First, let’s write out the problematic integral explicitly using the expression for the Green’s function

\[G_{xy}=\int\text{d}^{4}p\frac{i}{p^{2}-m^{2}}\exp\left(-2\pi ip\cdot\left(x-y\right)\right)\]

to give

\[i\frac{\lambda}{2}\int\text{d}^{4}xG_{1x}G_{xx}G_{x2}=-i\frac{\lambda}{2}\left(\int\text{d}^{4}p\frac{i}{p^{2}-m^{2}}\right)\int\text{d}^{4}q\left(q^{2}-m^{2}\right)^{-2}\exp\left(2\pi iq\cdot\left(x_{1}-x_{2}\right)\right).\]

To get an idea of how these integrals behave, we can use a simple dimensional analysis. Roughly, we expect an integral of the form

\[\int^{\infty}_{0}\text{d}^{D+1}pp^{n}\]

to be convergent for \(n<-D-1\) and divergent for \(n>-D-1\). For \(n=-D-1\) we expect to get a logarithm, which will diverge with infinite limits (log-divergent: the slowest type of divergence).

The first step to dealing with it is regularisation. The integrals diverge because of the infinite limts. But in reality, momenta cannot be inifinitely large, and distances cannot be infinitely small. In an extreme limit, distances cannot be smaller than the Planck length \(l_{D}=\sqrt{G}\) (where \(G\) is Newton’s constant, in natural units). But we don’t need to go to such extremes: our field theory is a coarse-graining, and will only be valid down to some distance / up to some momentum. Call the upper momentum cutoff \(\Lambda\), which is vastly larger than any momentum we care about but is not infinite (and is much smaller than the ultimate limit set by Planckian units). Returning to our divergent integral

\[\int\text{d}^{4}p\frac{i}{p^{2}-m^{2}}\]

it is convenient to Wick rotate to Euclidean space:

\[\left(E,\boldsymbol{p}\right)\rightarrow\left(-ip^{0},\boldsymbol{p}\right)\]

giving

\[-\int\text{d}^{4}\boldsymbol{p}\frac{1}{\boldsymbol{p}^{2}+m^{2}}\]

and use 4D spherical polar co-ordinates to give

\[\begin{aligned} -2\pi^{2}\int^{\Lambda}_{0}\text{d}p\frac{p^{3}}{p^{2}+m^{2}} & =-\pi^{2}\left[\Lambda^{2}-m^{2}\ln\left(m^{2}+\Lambda^{2}\right)\right]\\ & \approx-\pi^{2}\left[\Lambda^{2}-2m^{2}\ln\left(\Lambda\right)\right] \end{aligned}\]

which contains a quadratic divergence and a log divergence. From here, we simply renormalise (II) by subtracting off this infinite amount. We do this by introducing a ‘counterterm’ in the Lagrange density:

\[\mathscr{L}=\frac{1}{2}\partial^{\mu}\varphi\partial_{\mu}\varphi-\frac{1}{2}\left(m^{2}+\delta^{2}_{m}\right)\varphi^{2}-\frac{\lambda}{4!}\varphi^{4}\label{eq-L-counterterm}\]

so that when the integral is redone, we find \(m^{2}\rightarrow m^{2}+\delta^{2}_{m}\) and

\[\begin{aligned} \int\text{d}^{4}p\frac{i}{p^{2}-m^{2}} & \rightarrow\int\text{d}^{4}p\frac{i}{p^{2}-m^{2}-\delta^{2}_{m}}\\ & =-\pi^{2}\left[\Lambda^{2}-2\left(m^{2}+\delta^{2}_{m}\right)\ln\left(\Lambda\right)\right]. \end{aligned}\]

We’d like this to be finite; let’s chose it to be \(-\pi^{2}\mu^{2}\) for some \(\mu\). This requires

\[\mu^{2}=\Lambda^{2}-2\left(m^{2}+\delta^{2}_{m}\right)\ln\left(\Lambda\right)\]

\[\delta^{2}_{m}=\frac{\mu^{2}-\Lambda^{2}}{-2\ln\left(\Lambda\right)}-m^{2}.\]

Putting it all together, we find that the corrected Lagrange density in Eq ?eq-L-counterterm gives the first-order correction to the 2-point function to be

\[i\frac{\lambda}{2}\int\text{d}^{4}xG_{1x}G_{xx}G_{x2}=i\frac{\lambda}{2}\pi^{2}\mu^{2}\int\text{d}^{4}q\left(q^{2}-m^{2}\right)^{-2}\exp\left(2\pi iq\cdot\left(x_{1}-x_{2}\right)\right).\]

By construction, the UV divergence has been subtracted off, by renormalising the mass (through an infinite amount!). In general, there is a process for identifying \(\mu\) self-consistently by looking at how it changes the parameters of the theory (specifically the coupling \(\lambda\)). In the particular case above, the 1-loop correction to the 2-point function in \(\varphi^{4}\) theory, we can actually choose \(\mu=0\). That is, the propagator does not renormalise at 1st order. The reason is that \(G_{xx}\) is special, as it has no momentum. Its Feynman diagram is called a ‘tadpole’ (when drawn correctly). In general, tadpoles can be renormalised away for free. If we look at the philosophy of Renormalisation II, we see that it is secretly quite similar to Renormalisation I. The idea is that the bare parameters we started with – in this case \(m^{2}\) – are not the physical parameters. The things we measure have already been renormalised, and so it is \(m^{2}+\delta m^{2}\) which is physical. Still, the mathematical process and reasoning behind Renormalisation I and II are appear rather different.