Thursday 23 May 2019

Deriving electromagnetism from relativity and electrostatics.

Part 1: Liénard–Wiechert potential

The Liénard–Wiechert potential is given by-
$$\varphi(\mathbf{r}, t) = \frac{1}{4 \pi \epsilon_0} \left(\frac{q}{(1 - \mathbf{n} \cdot \mathbf{v_s})|\mathbf{r} - \mathbf{r}_s|} \right)_{t_r}$$
Here, $q$ is the source charge, $n$ is unit vector pointing towards the charge from location $\mathbf{r}$, $v_s$ is the source velocity at a retarded time, and $\mathbf{r}$ is the source location at a retarded time.

There is a similar potential given in Feynman Lecture Vol. 2 which is only an approximation and misses the $\frac{1}{1 - \mathbf{n} \cdot \mathbf{v_s}}$ factor. In spite of the extra factor in the LW potential, it is simpler because the local time component $\varphi$ of four vector field potential is invariant under Lorentz transformation. We will prove this now. The advantage of this invariance is that once we know the local field potential, we do not need to worry about change in the far away charge distribution after Lorentz transformation. The local four vector field potential contains the complete information about the far-away charges. The same is not true for the approximate potential given in the Feynman Lecture.

The equation given in Feynman lecture is different because it is inside an integral. When doing this integral, the required factor will turn up, due to apparent change in the size of a body approaching or receding from us, due to the fact that speed of light is finite for all observers. If speed of light was infinite, approaching bodies will not appear larger, and so would charge.

Let there be a source charge stationary at point $(x_0,y_0,0)$ with respect to an observer O at origin. Let there be another observer $O'$ at origin, moving with velocity $\mathbf{v}=-v \hat{i}$. Then locally, in the $O'$ frame, one would expect that $$\varphi '=\gamma \varphi$$, where $\gamma=\frac{1}{\sqrt{1-v^2}}$.

Let us see whether the same result follows from LW potential in the new frame.

In the frame O,
$$\varphi(0, t) = \frac{1}{4 \pi \epsilon_0} \left(\frac{q}{|\mathbf{r}|}\right)$$
In the new frame,
$$\varphi \prime(0, t) = \frac{1}{4 \pi \epsilon_0} \left(\frac{q}{(1 - \mathbf{n'} \cdot \mathbf{v})|\mathbf{r'}|}\right)$$

Note that in the new frame, $x_0 '=\gamma (x_0 + vr)$ where $r=\sqrt {x_0^2+y_0^2}$. Also, $y_0\prime=y_0$.
Also, Note that $$(1 - \mathbf{n'} \cdot \mathbf{v})=1-\frac{v x_0 '}{r'}$$, where $r'=\sqrt {x_0 '^2+y_0^2}$
$$=\frac{r\prime -v x_0\prime}{r\prime}$$
$$\implies \varphi \prime(0, t) = \frac{1}{4 \pi \epsilon_0} \left(\frac{q}{r\prime -v x_0\prime }\right)$$
Now, $r\prime -x_0\prime v$ turns out to be equal to $\frac {r}{\gamma}$, as can be checked. Hence, we obtain the expected equation.

Part 2: The actual Proof

Let there be a source charge stationary at point $(x_0,y_0,0)$ with respect to an observer O at origin. Let there be a charge $q$ at the origin moving with the velocity $\mathbf{u'}=u_x \hat{i}+u_y \hat{j}$ w.r.t O. Let there be another observer $O'$ at origin, moving with velocity $\mathbf{v}=v \hat{i}$, such that in the O' frame, the source appears to move with velocity $-v \hat{i}$.

What we are going to prove is, if we assume that for a stationary source, electrostatic force is as given by coulomb's law, then a Lorentz transformation of this force from frame O to O' is equivalent to the force arising from electromagnetic fields generated when solving the same result using the LW potential.

According to electromagnetic theory:-
$$\mathbf{E}=-\nabla \varphi - \frac{\partial \mathbf{A}}{\partial t}$$
$$\mathbf{B}=\nabla \times \mathbf{A}$$
$$ \mathbf{F}\propto \left( \mathbf{E} + \mathbf{B} \times \mathbf{v}\right)$$
Now, $\nabla \varphi = \varphi _x +\varphi _y + \varphi _z$. Also, 
$$\varphi_x= \frac{-x_0}{r^3}$$
$$\varphi_y= \frac{-y_0}{r^3}$$
$$\varphi_z=0$$
Then, in the O' frame,
$$\varphi '=\gamma \varphi$$
$$\mathbf{A}=-\gamma v \varphi$$
$$-\varphi _x' - \frac{\partial \mathbf{A'}}{\partial t} = \frac {\varphi \prime _x}{\gamma} = \varphi _x$$, because $\frac {\partial \mathbf {A'}}{\partial t}$ cancels the time component of $\varphi \prime _x'$ as seen in the O frame.
$$\implies \mathbf{E'}=\varphi _x \hat{i} + \gamma \varphi _y \hat{j}$$
Also, $$\mathbf{B'}=-v \gamma \varphi _y \hat{k}$$
$$\mathbf {F'}=q \left( \mathbf{E'} + \mathbf {B'}(u_y \prime \hat{i} - u_x \prime \hat{j})  \right)$$
Note that $u_y \prime$ and $u_x \prime$ are the velocity components of the second charge placed at origin, as seen in the O' frame.
Now in the frame O, $F=\varphi _x \hat{i} + \varphi _y \hat{j}= F_x \hat{i}+F_y \hat{j}$.
Let $ F'=F'_{x'} \hat{i}+ F'_y \hat{j}$.

Then $$F'_{x'}=F_x - \frac {F_y v u_y}{1- v u_x}$$.
Also, $$F'_y= \gamma F_y + v \gamma F_y \left( \frac {u_x -v}{1- v u_x} \right)$$
$$\implies F'_y= \frac {F_y}{\gamma (1-v u_x)}$$

This is the force that you would get when you apply Lorentz transformation from frame O to O'.

Concluding points

  1. The LW potentials are completely part of classical electromagnetism. But it can be noted that classical electromagnetism is not consistent under Galilean transformation. The correct transformation that preserves invariance of experimental results between inertial frames is that given by Lorentz transformation, which was discovered before Einstein's result.
  2. Hence magnetism is nothing more than relativistic effects on Force acting on a moving body as viewed from a frame of reference other than the one in which the object is stationary. It should be noted that this should apply for other forces of nature as well. For example gravito-magnetism is as real an effect.
  3. What I did in this post was what was historically done in the reverse order. Relativity was deduced to make electromagnetism consistent. I, instead, assumed SR and derived EM as a consequence. It should be noted though that SR is more fundamental than EM.

Saturday 18 May 2019

Axioms of Set theory

A very good video explaining the axioms of Set theory. https://youtu.be/zcvsyL7GtH4

Following are the 9 axioms of Set Theory.

  1. Axiom of extensionality: Two sets are equal (are the same set) if they have the same elements.
  2. Axiom of regularity (also called the Axiom of foundation): If element of a set X is another set Y, then X and Y are disjoint. This means Y cannot contain any elements that X contains, including Y itself. Hence, this prevents Y from containing itself.
  3. Axiom schema of specification (also called the axiom schema of separation or of restricted comprehension): In order to avoid paradoxes, if you are building a set X by describing some property of its elements, then you need to specify another set Y from where you need to pick the elements of X in question. (Note: Wikipedia says that this restriction is necessary to avoid Russell's paradox, but I do not see how, since the Paradox can be solved simply by applying the axiom of Regularity- Set that contains itself does not exist in the first place. Russell's paradox questions the existence of set of all sets that do not contain themselves.)
  4. Axiom of pairing: If x and y are sets, then there exists a set which contains x and y as elements.
  5. Axiom of union: The union over the elements of a set exists
  6. Axiom schema of replacement: When a function f acts on a set X, the image or the range of X is also a set.
  7. Axiom of infinity: There exist a set of infinite size.
  8. Axiom of power set: Power set exists for every set.
  9. Axiom of choice: Given a set X that contains other non empty sets $Y_1, Y_2, Y_3$ etc. within it. Then there exists a function f from X to $Y_1 \cup Y_2 \cup Y_3 \cup...$ such that $f(Y_n) \in Y_n$.

Proof of Godel's incompleteness Theorem

 This proof borrows from the book 'Godel's proof' by Nagel and Newman.

First Incompleteness theorem:


The first incompleteness theorem states that 'Any consistent formal system F within which a certain amount of elementary arithmetic can be carried out is incomplete; i.e., there are statements of the language of F which can neither be proved nor disproved in F.'
  1. Uniquely number every theorem in a system of mathematics. We can do this by assigning 10 symbols like $0$,$1$,$\subset$ (short for ‘if . . . then . . .’) etc a number between 1 to 10. Also, we can assign numerical variables, sentential variables and predicate variables to first, second and third powers of prime numbers greater or equal to 11, respectively. Note that the use of prime number ensures that all these variables get assigned a unique number, because otherwise $11^2$ is nothing but 121. Another way of doing it could have been assigning 11, 12 and 13 with variable of each type sequentially, and so on.
  2. Now that each letter of the language has a unique numerical representation, we assign a number to a formula by raising the nth prime to the power of the numerical value of the nth symbol, and taking product of all such numbers. Strings of formulas separated by comma is also a formula.
  3. The fact that a formula with Godel number x is the proof of a Godel number z hints towards a mathematical property that these two numbers share in the general case. This mathematical property depends upon the syntax as well as the rule of inference of the proof system. Let us denote the fact that the numbers x and z share this specific property by the formula $ Dem(x,z)$. This formula might not be true for a given value of x and z. Note that here, $Dem$ is a predicate variable.
  4. Next, let $sub(m,13,m)$ denote the Godel number of the formula that you get when you replace in the formula with Godel number m, each occurrence of the variable with Godel number 13 (here y) the number m.
  5. Now, note the meaning of $\sim \exists x Dem(x,sub(n,13,n))$ where n is Godel number of the statement $\sim \exists x Dem(x,sub(y,13,y))$. Note that $sub(n,13,n)$ is just the Godel number of the first formula. Hence, the first formula states that it cannot be proven within the system. (Although seen at mathematical level, the first formula just claims that a number does not have a given mathematical property. It is only when interpreted at metamathematical level that we infer a second layer of meaning to this statement- that it is claiming that it cannot be proven within the system). Now, if this statement is false, then it can be proven. A system of mathematics that can prove even false results is inconsistent. Hence, it must be true, and hence cannot be proven.
  6. Note that $\sim \exists x Dem(x,sub(n,13,n))$ expresses a property that no number possesses. Hence, it represents a new axiom of mathematics.

Notes:-
  1. If there are finitely many rules of inference, then Godel's theorem holds. If there are infinitely many of them, then we get another Godel like situation, where instead of infinite axioms, there infinite rules of inference. But the question is, is there an equivalent formulation of logic where there are only axioms but no rule of inference? I don't think so. Rules of inference are necessary to get from one axiom to another. They can be encompassed by a property that Godel numbers possess, as given by $Dem(x,z)$. Hence the rules of inference link numbers based on some property. Godel statement of a system of axioms is a statement that numbers don't possess a certain property.
  2. How does the choice of syntax affect the numerical relations as in $Dem(x,z)$? There should be some invariance regarding some general properties of these relations, independent of the choice of syntax.

Second Incompleteness theorem


The second incompleteness theorem states that 'Assume F is a consistent formalized system which contains elementary arithmetic. Then the consistency of F is not provable within F.'

The proof of the second incompleteness theorem follows from the first one. Even though the Godel statement G of a system F states that it is not provable, the reason why we cannot prove it within the system is that we are assuming that our system is consistent, and hence cannot prove a statement which says that it is not provable. Hence the consistency of F implies its incompleteness. The last sentence itself can be formalized as a theorem in F, and can also be proven within F. The proof follows because if we know that our system is consistent, then there is no more barrier in proving that G must be true. It is the lack of proof of consistency that prevents a proof a G. But if G is proven, then we will get an absurd result. Hence if F is consistent, its consistency itself is necessarily not provable to establish the non-provability of G.