Notes on distributions
$ \newcommand{\deltaComplex}[1]{\delta_{c}(#1)} \newcommand{\deltaComplexWithPi}[1]{2\pi\delta\left(i(#1)\right)} $
Under construction.
Tempered distributions
In this section we introduce by examples the regularization and normalization of distributions. For simplicity, we focus on tempered distributions on \(\RR\).
A tempered distribution \(\phi\in \cS'(\RR)\) acting on the rapidly decreasing test function \(f\in \cS(\RR)\) can be formally written as an integral
with the kernel \(\phi(x)\). It is useful to conceptualize \(f\) as a Gaussian wave-packet and \(\phi\) as a possibly sharp observable.
Regularization
When the kernel \(\phi(x)\) contains singularities, the integral \eqref{eq: functional} is convergent only for a subspace \(V_{\phi}\subset \cS(\RR)\) of test functions. To extend the domain of \(\phi\) from \(V_{\phi}\) to \(\cS(\RR)\), we need to subtract off the divergences in \eqref{eq: functional} in a systematic way, and this procedure is called regularization of distributions.
The extension is highly nonunique. We are interested in the case where a family of distributions \(\phi_{a}(x)\) depends on the parameter \(a\) analytically. Then the analyticity of \(a\) can help choose a unique regularization of \(\phi_{a}(x)\).
Normalization
If \(\phi_{a}(x)\) is meromorphic w.r.t. \(a\in U\subset\CC\), we can cancel the poles of \(\phi_{a}(x)\) by another meromorphic function \(N(a)\) such that the normalized distribution \(N(a)\phi_{a}(x)\) is holomorphic in \(U\), called normalization of distributions.
Example 1: principal value
The principal values of \(x^{-1}\) and \(x^{-2}\) are
Notice that although \(x^{-2}\) as a function is positive-definite, as a distribution it is not.
Example
f[x_]:=1/(1+x^2);
Integrate[f[x],{x,-∞,∞}]
Integrate[(f[x]+f[-x]-2f[0])/x^2,{x,0,∞}]
Out[] = π
Out[] = -π
Example 2: Hilbert transform
The regularization of \(\phi(x)=\frac{1}{x-y}\) can be chosen as
This regularized distribution defines the Hilbert transform - a unitary operator on \(L^2(\RR)\) 1 with two eigenspaces
called the Hardy spaces. It is also identified as a specific Knapp-Stein intertwining operator of \(\slgroup(2,\RR)\). Consequently the ramified principal series representation at the end-point is reducible, with the decomposition \eqref{eq: Hardy spaces}.
Example 3: power-type distribution
The power-type distribution in \(\cS'(\RRgeq)\)
is convergent if \(\Re a> -1\) and hence is holomorphic in this region. If \(\Re a\leq -1\) the integral is divergent and acquires regularization at \(x=0\). Inserting a real-analytic test function \(f(x)=\sum_{n=0}^{\oo} \frac{f^{(n)}(0)}{n!} x^n\) and interchanging the order of integration and summation, we have
The first term is well-controlled due to the rapid decay of \(f(x)\) and is irrelevant to our discussion. The second term implies that \((x_{+}^{a},f(x))\) as a function of \(a\) is meromorphic in \(\CC\), with simple poles at \(a=-1,-2,\dots\) and residues being
Then using \((\delta^{(n)}(x),f(x))= (-1)^{n} f^{(n)}(0)\) to strip off the test function we find \(x_{+}^{a}\) is meromorphic w.r.t. \(a\in \CC\), with simple poles at \(a=-1,-2,\dots\) and residues being
The distribution \(x_{+}^{a}\) can be normalized as \(\frac{1}{\Gamma[a+1]}x_{+}^{a}\), then the values at the removed poles are
Singularity near infinity
Concretely, we choose the test function as \(f(x)=e^{-x}\), then
manifests the meromorphicity of \(x_{+}^{a}\) and provides the normalization above.
If choosing the test function as \(f(x)=(1+x)^{-1}\) however, there are additional poles appeared:
The reason is that this test function is not in \(\cS(\RRgeq)\) and should be divided into two parts:
-
one is in \(\cS(\RRgeq)\) and detects the singularity of \(x_{+}^{a}\) at \(x=0\), providing the factor \(\gm{a+1}\);
-
another is well-behaved as \(x\to \oo\) and decays sufficiently fast as \(x\to 0\), then detects the singularity of \(x_{+}^{a}\) near infinity, providing the factor \(\gm{-a}\).
To see this, we have for example
Regarding \(x\) as energy, in practice the test functions \(f(x)\) are not in \(\cS'(\RRgeq)\), and the singularities of \(x_{+}^{a}\) at \(0\) and \(\oo\) convert the IR and UV information of \(f(x)\) into analytic behaviors of \(a\). This is one of the reasons why the Mellin transform is commonly used in physics.
Homogeneous tempered distributions
In this section we consider distributional solutions to the functional equation
for \(x\in\RR,\, \lambda>0\). These solutions are called homogeneous tempered distributions.
Different bases
The first basis of solutions is
and we can recombine them according to parity as 2
These distributions are meromorphic functions of \(a\in\CC\), and the normalized versions are
Another basis is \((x \pm i\epsilon )^{a}\), which are boundary values of meromorphic functions. The relations to the previous one are
and the inverse relations are
The poles of \(x_{\pm}^{a}\) at \(a=-1,-2,\dots\) get canceled due to the phase factor \(e^{ia\pi}\), hence \((x \pm i\epsilon)^{a}\) is holomorphic w.r.t. \(a\in\CC\).
Fourier transform
The Riemann-Lebesgue lemma asserts that if \(f(x)\in L^{1}(\RR^d)\) then the Fourier transform \(f(p)\) vanishes at infinity,
The intuition here is that decay properties of distributions are Fourier-transformed into regularity properties.
Under the Fourier transform, the basis \(|x|_{\epsilon}^{a}\) remains invariant due to parity. In particular, the Dirac \(\delta\)-functions are the Fourier transforms of powers
The bases \(x_{\pm}^{a}\) and \((x\pm i \epsilon)^{a}\) are Fourier transform pairs, revealing the Paley-Wiener type theorems, which state that support properties of distributions are Fourier-transformed into analytic properties.
Higher dimensions
For higher dimensions, the spherical-symmetric solution of \(f(\lambda x)=\lambda^{a} f(x)\) for \(x\in\RR^{n},\, \lambda>0\) is \(|x|^{a}\). In the spherical coordinates \(x=r\hat{x},\, \hat{x}\in\sphere{d-1}\), the action of \(|x|^{a}\) on a test function is
hence the possible poles are at \(a=-n-d,\, n\in\NN\). Futhermore, the poles for odd \(n\)-s are absent, since the scalar differential operators constructed from \(\partial_x\) are \(\square^{n}\). This implies \(\frac{1}{\Gamma[\frac{a+d}{2}]}|x|^{a}\) is holomorphic w.r.t. \(a\in\CC\), and the values at the removed poles are
Especially for \(n=0\), we have
Two dimension with spin
Lacia-TimeStamp-2025-02-14-18:24:40
Summary
The three bases are summarized into the following table.
\(a\) | \(-2\) | \((-2,-1)\) | \(-1\) | \(\cdots\) |
---|---|---|---|---|
parity-even | \(\frac{1}{x^2}\) | \(\abs{x}^{a}\) | \(\delta(x) = \epsilon \abs{x}^{-1+2\epsilon}\) | \(\cdots\) |
parity-odd | \(\delta'(x) = -\epsilon \abs{x}^{-2+2\epsilon}_{1}\) | \(\abs{x}^{a}_{1}\) | \(\frac{1}{x}\) | \(\cdots\) |
half-line | \(-\) | \(x_{\pm}^{a}\) | \(-\) | \(\cdots\) |
\(i\epsilon\)-prescription | \((x \pm i\epsilon)^{-2}\) | \((x \pm i\epsilon)^{a}\) | \((x \pm i\epsilon)^{-1}\) | \(\cdots\) |
The regularized homogeneous distributions are summarized into the following table, where \(n\in \NN\).
Distributions | Values | Removed poles |
---|---|---|
\(\displaystyle \frac{1}{\Gamma[a+1]}x_{+}^{a}\) | \(\displaystyle \delta^{(n)}(x)\) | \(a = -n-1\) |
\(\displaystyle \frac{1}{\Gamma[a+1]}x_{-}^{a}\) | \(\displaystyle (-1)^{n}\delta^{(n)}(x)\) | \(a = -n-1\) |
\(\displaystyle \frac{1}{\Gamma[\frac{a+1}{2}]}\abs{x}^{a}_{0}\) | \(\displaystyle \frac{(-1)^{n} n!}{(2n)!}\delta^{(2n)}(x)\) | \(a = -2n-1\) |
\(\displaystyle \frac{1}{\Gamma[\frac{a+2}{2}]}\abs{x}^{a}_{1}\) | \(\displaystyle \frac{(-1)^{n+1} n!}{(2n+1)!}\delta^{(2n+1)}(x)\) | \(a = -2n-2\) |
\(\displaystyle (x+i\epsilon )^a\) | \(\displaystyle x^{-n-1}-i\pi \frac{(-1)^{n}}{n!} \delta^{(n)}(x)\) | \(a = -n-1\) |
\(\displaystyle (x-i\epsilon )^a\) | \(\displaystyle x^{-n-1}+i\pi \frac{(-1)^{n}}{n!} \delta^{(n)}(x)\) | \(a = -n-1\) |
\(\displaystyle \frac{1}{\Gamma[\frac{a+d}{2}]}\abs{x}^{a} \textInMath{ on } \mathbb{R}^{d}\) | \(\displaystyle \frac{(-1)^{n}\pi^{\frac{d}{2}}}{2^{2n}\Gamma[\frac{d}{2}+n]}\square^{n}\delta(x)\) | \(a = -2n-d\) |
Hyperfunctions
Hyperfunctions are another type of distributions related to analyticity, and share similar properties with other types of distributions: linearity, existence of derivatives, \etc.
Given a pair \((f,g)\) such that \(f\) and \(g\) are holomorphic on the upper and lower half planes respectively, the formal difference \(f-g\) specifies a hyperfunction on \(\RR\). This specification admits redundancy: for any holomorphic function \(h\) on the whole complex plane the two pairs \((f,g)\sim (f+h,g+h)\) are equivalent.
Formal distributions
As a formal distribution, the \(\delta\)-function is
which simply restates the contour deformation
Analytic functionals
Complex \(\delta\)-function
The \(\delta\)-function with complex argument \(\delta_{c}\) is an analytic functional belonging to the Gelfand–Shilov space \(Z'\), which is the dual space of analytic test functions.
It is defined as the identity of the Mellin transform
and formally can be written as
As a generalization of the usual Dirac \(\delta\)-function, when \(\Delta_{0}\) locates on the integration contour \(\Delta\in a + i\RR\) in \eqref{eq: complex Delta function and Mellin transform}, \(\delta_{c}\) reduces to
When \(\Delta_{0}\) leaves off the integration contour, \(\delta_{c}\) admits the following approximations:
Equation \eqref{eq: complex Delta function approximation a}
This can be shown by a contour deformation argument.
For \(\Re\Delta_{0}>a\), we consider a test function \(f(\Delta)\) that is analytic in the region \(a<\Re \Delta\) and decays sufficiently fast as \(\Delta\to +\oo\). As \(\epsilon^{\Delta-\Delta_{0}}\to 0\), we can enclose the contour to the right and pick up the poles at \(\Delta=\Delta_{0}+n,\, n\in\NN\), leading to
which justifies the defining property \eqref{eq: complex Delta function and Mellin transform}.
From the analysis above, we observe that only the term \(n = 0\) needs to be dominant. Consequently, the requirement on test functions can be relaxed: it suffices for \(f(\Delta)\) to be holomorphic in the strip \(a < \Re \Delta < \Re \Delta_{0}\), to be meromorphic or contain branch cuts in the region \(\Re \Delta_{0} < \Re \Delta\), and to decay sufficiently fast as \(\Delta \to +\infty\).
Principal value at \(\Re\Delta_{0}=a\)
Notice that if naively applying the approximation \eqref{eq: complex Delta function approximation a} to the case \(\Re\Delta_{0}=a\), there would appear an extra factor \(\half\).
The reason is as follows: in this case the leading pole \(\Delta=\Delta_{0}\) lies on the contour, hence the integral is divergent and should be understood as the principal value. When deforming the contour to the right, it is necessary to consider the contribution of a small semicircle \(C_{\Delta_{0},\delta}\eqqq\set{\Delta:\Delta=\Delta_{0}+\delta e^{i\theta}, \frac{\pi}{2}<\theta<\frac{3\pi}{2}}\) surrounding the leading pole:
Mellin transform
Lacia-TimeStamp-2025-02-14-18:28:08
Under the Mellin transform, we have the following pairs:
Now we check the consistency of the above Mellin pairs.
Miscellaneous
The Dirac \(\delta\)-function admits the following approximations
where \(\epsilon>0\) and the following properties
An identity
We consider the distribution \(\theta(z)\delta(i(z-\zb))\) on \(\RR^{2}\), which support on the positive real axis. In the Cartesian and radial coordinates, we have
As a periodic function of \(\theta\), we can use the identity
to obtain
In this way, the distributional behavior of \(\theta(z)\delta(i(z-\zb))\) can be understood as a divergent sum of powers of \((z,\zb)\).
Relation between \(\delta\)- formal and tempered distributions
The Dirac \(\delta\)-function satisfies
We show that the two are equivalent. Without loss of generality for \(w>0,r\in(0,w)\), the disk \(|z|\leq w+r\) can be map to the upper half plane \(\Re x\geq 0\) by the Mobius transformation \(z(x)=(w+r) \frac{(r+2 w) (x-y)-i r \epsilon }{(r+2 w) (x-y)+i r \epsilon }\) such that \(z(y+i\epsilon)=w\), and similarly the disk \(|z|\geq w-r\) can be mapped to the lower half plane \(\Re x\leq 0\) by \(z(x)=(w-r)\frac{(r-2 w) (x-y)+i r \epsilon }{(r-2 w) (x-y)-i r \epsilon }\) with \(z(y-i\epsilon)=w\). Hence we have
where the error term