Appendix A: Definitions, Derivations and Tricks

Paleomagnetism is famous for its use of a large number of incomprehensible acronyms. Here we have them gathered together along with definitions and the Section numbers where they are explained in more detail. You will find here a table of physical constants and paleomagnetic parameters used in the text as well as a table listing common statistics used in paleomagnetism. After the tables, there are a few sections with useful mathematical tricks.

A.1Definitions¶

Table A.1:Acronyms in paleomagnetism.

Acronym	Definition: Section #
AMS	Anisotropy of magnetic susceptibility: Section 13.1
APWP	Apparent polar wander path: Section 16.2
AF	Alternating field demagnetization: Section 9.1.4
ARM	Anhysteretic remanent magnetization: Section 7.10
ChRM	Characteristic remanent magnetization: Section 9.1.5
CNS	Cretaceous Normal Superchron: Section 15.1
CRM	Chemical remanent magnetization: Section 7.5
DGRF	Definitive geomagnetic reference field: Section 2.2
DRM	Detrital remanent magnetization: Section 7.6
E/I	Elongation/inclination correction method: Section 16.4
FC	Field cooled: Section 8.8.4
GAD	Geocentric axial dipole: Section 2.3
GHA	Greenwich hour angle: Section A.3.8
GPTS	Geomagnetic polarity time scale: Chapter 15
GRM	Gyroremanent magnetization: Section 7.10
IGRF	International geomagnetic reference field: Section 2.2
IZZI	Infield-zero field/ zero field-infield paleointensity protocol: Section 10.1.1.1
IRM	Isothermal remanent magnetization: Section 5.2.1 and Section 7.7
MD	Multidomain: Chapter 4
MDF	Median destructive field: Section 8.2
MDT	Median destructive temperature: Section 8.2
NRM	Natural remanent magnetization: Chapter 7
pARM	Partial anhysteretic remanence: Section 7.10
pDRM	Post-depositional detrital remanent magnetization: Section 7.6
PSD	Pseudo-single domain: Chapter 4
PSV	Paleosecular variation of the geomagnetic field: Section 14.1
pTRM	Partial thermal remanence: Section 7.4
sIRM	Saturation IRM: See $M_r$
SD	Single domain: Chapter 4
SP	Superparamagnetic: Section 4.3
SV	Secular variation: Section 14.1
TRM	Thermal remanent magnetization: Section 7.4
VADM	Virtual axial dipole moment: Section 2.4.3
VDM	Virtual dipole moment: Section 2.4.3; Equation 2.21
VDS	Vector difference sum: Section 9.1.6
VGP	Virtual geomagnetic pole: Section 2.4.2
VRM	Viscous remanent magnetization: Section 7.3
SQUID	Superconducting quantum interference device: Section 9.1.2
UT	Universal time (Greenwich mean time): Section A.3.8
ZFC	Zero-field cooled: Section 8.8.4

Table A.2:Physical Parameters and Constants.

Symbol	Definition: Section #
$\chi$	Magnetic susceptibility: The slope relating induced magnetization to an applied field: Section 1.5
$\chi_{ARM}$	ARM susceptibility: Section 8.6
$\chi_b$	Bulk magnetic susceptibility: Section 1.5; Equation 1.4
$\chi_d$	Diamagnetic susceptibility: Section 3.2.1
$\chi_f$	Ferromagnetic susceptibility: Section 3.3
$\chi_{fd}$	Frequency dependent: Section 8.3.3
$\chi_h$	High-frequency susceptibility: Section 8.8.2
$\chi_{hf}$	High-field susceptibility: Section 5.2.2
$\chi_{i}$	Initial susceptibility: Section 5.2.2
$\chi_l$	Low-frequency susceptibility: Section 8.8.2
$\chi_p$	Paramagnetic susceptibility: Section 3.2.2
$\delta_{FC}$	Verwey transition temperature jump while cooling in a field: Section 8.8.4
$\delta_{ZFC}$	Verwey transition temperature jump while cooling in zero field: Section 8.8.4
$\Delta M$ curve	Curve defined by subtracting the ascending from the descending curves in a hysteresis loop: Section 5.2.1
$\lambda,\phi$	Latitude, Longitude
$\mu_o$	Permeability of free space: (4 $\pi$ x 10 $^{-7}$ Hm $^{-1}$ ): Section 1.6
$\tau$	Relaxation time: Section 4.3; Equation 4.18
$\theta_m$	Magnetic co-latitude: Section 2.4; Equation 2.13
$\theta$	Co-latitude: Section 1.8
$a_{ij}$	Direction cosines: Section A.3.5.1
$[a_m]$	Magnetic activity: Section 10.1.2
$a$	The radius of the Earth (6.371 x 10 $^6$ m): Section 2.2
$\mathbf{B}$	Magnetic induction: Section 1.3
$C$	Frequency factor (10 $^{10}$ s $^{-1}$ ): Section 4.3
$D$	Declination: Section 2.1; Equation 2.4
$E$	Elongation: Table 8.1
$I$	Inclination: Section 2.1; Equation 2.4
$g_m^l, h_m^l$	Gauss coefficients: Section 2.2
$\mathbf{H}$	Magnetic field: Section 1.1
$H_{cr}$	Coercivity of remanence; field required to reduce saturation IRM to zero: Section 5.1
$H_c$	Coercivity; the magnetic field required to change the magnetic moment of a particle from one easy axis to another: Section 5.1
$k_B$	Boltzmann’s constant (1.381 x 10 $^{-23}$ JK $^{-1}$ ): Section 3.2.2
$K_i$	AMS measurement: Section D.1
$K_u$	Constant of uniaxial anisotropy energy: Paragraph and Section 4.1.5
$\mathbf{m}$	Magnetic moment: Section 1.2
$\mu_b$	Bohr magneton (9.27 x 10 $^{-24}$ Am $^2$ ): Section 3.1
$\mathbf{M}$	Magnetization: Section 1.5
$M_{eq}$	Equilibrium magnetization: Section 7.3
$M_r$	Saturation remanence (also sIRM): Section 5.2.1
$M_s$	Saturation magnetization; the magnetization measured in the presence of a saturating field: Section 5.2.1
$P_l^m$	Schmidt polynomials: Section 2.2
$\mathbf{s}$	Six elements of $\chi_{ij}$ ; $s_1=\chi_{11}, s_2=\chi_{22}, s_3=\chi_{33}, s_4=\chi_{12}, s_5=\chi_{23}, s_6=\chi_{13}$ : Section 13.1; Equation 13.3
$R_x$	IRM cross-over value: Section 8.4.1
$T$	Absolute temperature (in kelvin)
$T_b$	Blocking temperature: Section 7.4
$T_c$	Curie (Néel) temperature: Section 3.3, Section 8.2
$T_h$	Hopkinson Effect: Section 8.2
$T_m$	Morin transition: Section 8.2
$T_o$	Absolute zero: Section 3.3
$T_p$	Pyrrhotite transition: Section 8.2
$T_v$	Verwey temperature: Paragraph, Section 8.2
$v$	Volume
$v_b$	Blocking volume: Section 7.5

Table A.3:Common statistics in paleomagnetism.

Statistic	Definition: Section #
$\alpha_{95}$	Radius of circle (cone) of 95% confidence (Fisher): Section 11.2.1, Equation 11.17
$\delta$	Residual errors for AMS measurements: Section 13.1, Equation 13.11
$\epsilon_{ij}$	Semi-angles of Hext uncertainty ellipses: Section 13.1, Equation 13.17
$\kappa$	Fisher precision parameter: Section 11.2.1, Equation 11.9
$\eta_{95}, \zeta_{95}$	Semi-angles of directional 95% uncertainty ellipses: Section C.2.4, Equation C.17
$\boldsymbol{\tau}, \mathbf{V}$	Eigenvalues and eigenvectors of tensors: Section A.3.5.4, Equation A.48
$k$	Estimate of $\kappa$ : Section 11.2.1, Equation 11.16
CSD	Circular standard deviation (Fisher): Section 11.2.1, Equation 11.20
$dm$	Uncertainty in the meridian (longitude) of a paleomagnetic pole: Section 11.2.1, Equation 11.22
$dp$	Uncertainty in the parallel (latitude) of a paleomagnetic pole: Section 11.2.1, Equation 11.22
$F, F_{12}, F_{23}$	Significance tests for anisotropy (Hext): Section 13.2.1, Equation 13.19
MAD	Maximum angular deviation of principal eigenvector (Kirschvink): Section 9.1.7, Equation 9.1
MAD $_{plane}$	MAD of the pole to a best-fit plane (Kirschvink): Section 9.1.7, Equation 9.2
$M_u, M_e$	Significance tests for uniform and exponential distributions: Section 11.5, Section B.1.5, Equation B.6, and Equation B.7
$N$	Number of samples, specimens or sites
$n_f$	Number of degrees of freedom: Section 13.2, Equation 13.12
$R$	Resultant vector length of unit vectors: Section 11.2.1, Equation 11.14
$R_o$	Critical value of $R$ for non-random distribution (Watson): Section 11.3.1, Equation 11.23
$S_f$	Scatter of VGPs - corrected for within site scatter: Equation 14.3
$S_p$	Scatter of VGPs: Equation 14.2
$S_o$	Residual sum of squares of errors (Hext): Section 13.2, Equation 13.12
$\mathbf{T}$	Orientation tensor: Section A.3.5.4, Equation A.47

A.2Derivations¶

A.2.1Langevin function for a paramagnetic substance¶

Here we derive the Langevin function for a paramagnetic substance with magnetic moments $m$ in an applied field $H$ at temperature $T$ . If we make the assumption that there is no preferred alignment within the substance, we can assume that the number of moments ( $n(\alpha)$ ) between angles $\alpha$ and $\alpha + d\alpha$ with respect to $\mathbf{H}$ is proportional to the solid angle $\sin\alpha d\alpha$ and the probability density function, i.e.

n(\alpha) d\alpha \propto \exp \bigl({ -E_m\over {k_BT}} \bigr) \sin \alpha d\alpha,

(A.1)

where $E_m$ is the magnetic energy. When we measure the induced magnetization, we really measure only the component of the moment parallel to the applied field, or $n(\alpha) m \cos\alpha$ . The net induced magnetization $M_I$ of a population of particles with volume $v$ is therefore:

M_I = {m\over v} \int_0^{\pi} n(\alpha)\cos \alpha d \alpha.

(A.2)

By definition, $n(\alpha)$ integrates to $N$ , the total number of moments, or

N = \int_0^{\pi} n(\alpha)d\alpha.

(A.3)

The total saturation moment of a given population of $N$ individual magnetic moments $m$ is $Nm$ . The saturation value of magnetization $M_s$ is thus $Nm$ normalized by the volume $v$ . Therefore, the magnetization expressed as the fraction of saturation is:

{M\over {M_s}} = { {\int_0^{\pi} n(\alpha ) \cos \alpha d\alpha}\over {\int_0^{\pi} n(\alpha )d\alpha }}

(A.4)

= { {\int_0^{\pi} e^{(m\mu_o H \cos \alpha )/k_BT}\cos \alpha \sin \alpha d\alpha}\over { \int_0^{\pi} e^{(m\mu_o H\cos \alpha )/k_BT}\sin \alpha d\alpha}}.

(A.5)

By substituting $a=m\mu_oH/k_BT$ and $\cos \alpha =x$ , we write

{M\over {M_s}} = N { {\int_{-1}^{1} e^{a x}xdx} \over {\int_{-1}^1 e^{a x}dx} } = \bigl( { {e^{a} + e^{-a}} \over {e^{a} - e^{-a}} } - {1\over{a} } \bigr),

(A.6)

and finally

{M\over {M_s}} = [\text{coth } a - {1\over{a}}]=\mathcal{L} (a).

(A.7)

A.2.2Superparamagnetism¶

The derivation of superparamagnetism follows closely that of paramagnetism whereby the probability of finding a magnetization vector an angle $\alpha$ away from the direction of the applied field is given by:

n(\alpha )d\alpha = 2\pi n_o e^{({{M_sBv\cos \alpha}\over {k_BT}})}\sin \alpha d\alpha.

(A.8)

The total magnetization contributed by the $N$ moments is:

{M\over {M_s}} = \int_0^{\pi} \cos \alpha n(\alpha )d\alpha.

(A.9)

Combining Equation A.8 and Equation A.9 we get:

{M\over {M_s}} = N { {\int_0^{\pi} n(\alpha ) \cos \alpha d\alpha}\over {\int_0^{\pi} n(\alpha )d\alpha }}

(A.10)

= N { {\int_0^{\pi} e^{(M_sBv\cos \alpha )/k_BT}\cos \alpha \sin \alpha d\alpha}\over { \int_0^{\pi} e^{(M_sBv\cos \alpha )/k_BT}\sin \alpha d\alpha}}.

(A.11)

By substituting $a= M_sBv/k_BT$ and $\cos \alpha =x$ , and remembering Equation A.7, we can write:

{M\over {M_s}} = N { {\int_1^{-1} e^{a x}xdx} \over {\int_1^{-1} e^{a x}dx} } = N\mathcal{L} (a).

(A.12)

So finally

{M\over {M_s}} = N\mathcal{L} (a).

(A.13)

A.3Useful tricks¶

In this section, we have assembled assorted mathematical and plotting techniques that come in handy throughout this book.

A.3.1Spherical trigonometry¶

Spherical trigonometry has widespread applications throughout the book. It is used in the transformations of observed directions to virtual poles (Chapter 2) and transformation of coordinate systems, to name a few. Here we summarize the two most useful relationships: the Law of Sines and the Law of Cosines.

Spherical triangle with vertices A, B, C on a globe, connected by great circle arcs a, b, c, with inset showing subtended angles on a unit sphere. — Figure A.1:Rules of spherical trigonometry. $a,b,c$ are all great circle tracks on a sphere which form a triangle with apices $A,B,C$ . The lengths of $a,b,c$ on a unit sphere are equal to the angles subtended by radii that intersect the globe at the apices, as shown in the inset. $\alpha,\beta,\gamma$ are the angles between the great circles.

In Figure A.1, $\alpha, \beta$ and $\gamma$ are the angles between the great circles labelled $a$ , $b$ , and $c$ . On a unit sphere, $a,b$ and $c$ are also the angles subtended by radii that intersect the globe at the apices A, B, and C (see inset on Figure A.1). Two formulae from spherical trigonometry come in handy in paleomagnetism, the Law of Sines:

{\sin \alpha \over \sin a}={\sin \beta \over \sin b}={\sin \gamma \over \sin c},

(A.14)

and the Law of Cosines:

\cos a = \cos b \cos c + \sin b \sin c \cos \alpha.

(A.15)

A.3.2Vector addition¶

Two vectors A and B in an X-Y coordinate system with their x and y components shown, angles to the X axis labeled, and resultant vector C from their addition. — Figure A.2:Vectors $\mathbf{A}$ and $\mathbf{B}$ , their components A $_{x,y}$ , B $_{x,y}$ and the angles between them and the $X$ axis, $\alpha$ and $\beta$ . The angle between the two vectors is $\alpha-\beta = \Delta$ . Unit vectors in the directions of the axes are $\hat x$ and $\hat y$ respectively.

To add the two vectors (see Figure A.2) $\mathbf{A}$ and $\mathbf{B}$ , we break each vector into components $A_{x,y}$ and $B_{x,y}$ . For example, $A_x=|A|\cos{\alpha}, A_y=|A|\sin{\alpha}$ where $|A|$ is the length of the vector $\mathbf{A}$ . The components of the resultant vector $\mathbf{C}$ are: $C_x = A_x +B_x, C_y=A_y+B_y$ . These can be converted back to polar coordinates of magnitude and angles if desired, whereby:

|C| = \sqrt {C_x^2+C_y^2} \text{ and } \gamma= \cos^{-1} { {C_x}\over{ {|C|}}}.

(A.16)

A.3.3Vector subtraction¶

To subtract two vectors, compute the components as in addition, but the components of the vector difference $\mathbf{C}$ are: $C_x = A_x -B_x, C_y=A_y-B_y$ .

A.3.4Vector multiplication¶

There are two ways to multiply vectors. The first is the dot product whereby $\mathbf{A} \cdot \mathbf{B}= A_xB_x + A_yB_y$ . This is a scalar and is actually the cosine of the angle between the two vectors if the $\mathbf{A}$ and $\mathbf{B}$ are taken as unit vectors (assume a magnitude of unity in the component calculation).

Three-dimensional diagram with vectors A and B in a plane separated by angle theta, and their cross product vector C pointing perpendicular to the plane. — Figure A.3:Illustration of cross product of vectors $A$ and $B$ separated by angle $\theta$ to get the orthogonal vector $C$ .

The other way to perform vector multiplication is the cross product (see Figure A.3), which produces a vector orthogonal to both $\mathbf{A}$ and $\mathbf{B}$ and whose components are given by:

C = \det \begin{vmatrix} \hat x & \hat y & \hat z \\ A_x & A_y & A_z \\ B_x & B_y & B_z \end{vmatrix}.

(A.17)

To calculate the determinant, we follow these rules:

C_x=A_yB_z - A_zB_y, \quad C_y=A_zB_x - A_xB_z, \quad C_z=A_xB_y - A_yB_x.

(A.18)

C_i = A_jB_k-A_kB_j \quad i\neq j \neq k.

(A.19)

A.3.5Tricks with tensors¶

Vectors belong to a more general concept called tensors. While a vector describes a magnitude of something in a given direction, tensors allow calculation of magnitudes as a function of orientation. Velocity is a vector relating speed to direction, but speed may change depending on direction, so we might need a tensor to calculate speed as a function of direction. Many properties in Earth science require tensors, like the indicatrix in mineralogy which relates the speed of light to crystallographic direction, or the relationship between stress and strain. Tensors in paleomagnetism are used, for example, to transform coordinate systems and to characterize the anisotropy of magnetic properties such as susceptibility. We will cover transformation of coordinate systems in the following.

A.3.5.1Direction cosines¶

We use direction cosines in paleomagnetism in a variety of applications, from mineralogy to transformation from specimen to geographic or stratigraphic coordinate systems. Direction cosines are the cosines of the angles between different axes in given coordinate systems, here $X$ and $X'$ respectively (see, e.g., Figure A.4a). The direction cosine $a_{12}$ is the cosine of the angle between the $X_1$ and the $X'_2$ , $\alpha_{12}$ axes. We can define four of these direction cosines to fully describe the relationship between the two coordinate systems:

a_{11} = \cos \alpha_{11}, \quad a_{21} = \cos \alpha_{21},

(A.20)

a_{12} = \cos \alpha_{12}, \quad a_{22} = \cos \alpha_{22}.

(A.21)

The first subscript always refers to the $X$ system and the second refers to the $X'$ .

Two-panel diagram. a) Vector R in the X1-X2 coordinate system at angle alpha. b) Two coordinate systems X and X-prime with direction cosine angles alpha-11, alpha-12, alpha-21, alpha-22 between their axes. — Figure A.4:Definition of direction cosines in two dimensions. a) Definition of vector in one set of coordinates, $x_1, x_2$ . b) Definition of angles relating $X$ axes to $X'$ .

A.3.5.2Changing coordinate systems¶

One application of using direction cosines is the transformation of coordinates systems from one set ( $X$ ) to a new set $X'$ . To find new coordinates $x'_1, x'_2,..$ from the old ( $x_1, x_2,...$ ), we have:

x_1' = a_{11} x_1 + a_{12} x_2, \quad x_2' = a_{21} x_2 + a_{22} x_2.

(A.22)

In three dimensions we have:

x_1' = a_{11} x_1 + a_{12} x_2 + a_{13} x_3,

(A.23)

x_2' = a_{21} x_1 + a_{22} x_2 + a_{23} x_3,

(A.24)

x_3' = a_{31} x_1+ a_{32} x_2 + a_{33} x_3,

(A.25)

which can also be written as:

\begin{pmatrix}x'_1\\ x'_2\\ x'_3\end{pmatrix} = \begin{pmatrix} a_{11}&a_{12}&a_{13}\\ a_{21}&a_{22}&a_{23}\\ a_{31}&a_{32}&a_{33} \end{pmatrix} \begin{pmatrix}x_1\\ x_2\\ x_3\end{pmatrix},

(A.26)

with a short cut notation as: $x'_i = a_{ij} x_j$ . However we write this, it means that for each axis $i$ , just sum through the $j$ ’s for all the dimensions. The matrix $a_{ij}$ is an example of a 3 x 3 tensor and equations of the form $A_i = B_{ij} C_j$ relating two vectors with a tensor will be used throughout the book. A more common notation is with bold-faced variables which indicate vectors or tensors, e.g., $\mathbf{A} = \mathbf{B} \cdot \mathbf{C}$ .

Two-panel diagram. a) Sample cube with right-hand coordinate axes X1, X2, X3. b) Sphere showing X and X-prime coordinate systems with angle alpha between corresponding axes and spherical triangle relating the two systems. — Figure A.5:a) Sample coordinate system. b) Trigonometric relations between two cartesian coordinate systems, $\mathbf{X}_i$ and $\mathbf{X}'_i$ . $\lambda,\phi,\psi$ are all known and the angles between the various axes can be calculated using spherical trigonometry. For example, the angle $\alpha$ between $\mathbf{X}_1$ and $\mathbf{X}_1'$ forms one side of the triangle shown by dash-dot lines. Thus, $\cos \alpha = \cos \lambda \cos \phi + \sin \lambda \sin \phi \cos \psi$ . [Figure from Tauxe (1998).]

Now we would like to apply this to changing coordinate systems for a paleomagnetic specimen in the most general case. The specimen coordinate system is defined by a right-hand rule where the thumb ( $\mathbf{X}_1$ ) is directed parallel to an arrow marked on the sample, the index finger ( $\mathbf{X}_2$ ) is in the same plane but at right angles and clockwise to $\mathbf{X}_1$ and the middle finger ( $\mathbf{X}_3$ ) is perpendicular to the other two (Figure A.5a). The transformation of coordinates ( $x_i$ ) from the $\mathbf{X}_i$ axes to the coordinates in the desired $\mathbf{X}'$ coordinate system requires the determination of the direction cosines as described in Section A.3.5.1. The various $a_{ij}$ can be calculated using spherical trigonometry as in Section A.3.1. For example, $a_{11}$ for the general case depicted in Figure A.5 is $\cos \alpha$ , which is given by the Law of Cosines (see Section A.3.1) by using appropriate values, or:

\cos \alpha = \cos \lambda \cos \phi + \sin \lambda \sin \phi \cos \psi.

(A.27)

The other $a_{ij}$ can be calculated in a similar manner. In the case of most coordinate system rotations used in paleomagnetism, $X_2$ is in the same plane as $X'_1$ and $X'_2$ (and is horizontal) so $\psi$ = 90°. This problem is much simpler. The directions cosines for the case where $\psi = 90^{\circ}$ are:

a=\begin{pmatrix} \cos \lambda \cos \phi & - \sin \phi & - \sin \lambda \cos \phi\\ \cos \lambda \sin \phi & \cos \phi & - \sin \lambda \sin \phi \\ \sin \lambda & 0& \cos \lambda \end{pmatrix}.

(A.28)

The new coordinates can be obtained from Equation A.26, as follows:

x'_1 = a_{11}x_1 + a_{12}x_2 + a_{13}x_3

(A.29)

x'_2 = a_{21}x_1 + a_{22}x_2 + a_{23}x_3

(A.30)

x'_3 = a_{31}x_1 + a_{32}x_2 + a_{33}x_3.

(A.31)

The declination and inclination can be calculated by inserting these values in the equations in Chapter 2.

A.3.5.3Method for rotating points on a globe using finite rotation poles¶

Given the coordinates of the point on the globe $P_p$ with latitude $\lambda_p$ , longitude $\phi_p$ the finite rotation pole $P_f$ with latitude $\lambda_f$ , longitude $\phi_f$ , the way to transform coordinates is as follows (you should also review Section A.3.5.2).

Convert the latitudes and longitudes to cartesian coordinates by:

P_1=\cos\phi \cos \lambda, \quad P_2 = \sin \phi \cos \lambda, \quad P_3 = \sin \lambda

(A.32)

where $P$ is the point of interest.

Set up the rotation matrix $R$ as:

R_{11} = P_{f1}P_{f1}(1-\cos \Omega) + \cos \Omega

(A.33)

R_{12} = P_{f1}P_{f2}(1-\cos \Omega) - P_{f3} \sin \Omega

(A.34)

R_{13} = P_{f1}P_{f3}(1-\cos \Omega) + P_{f2}\sin \Omega

(A.35)

R_{21} = P_{f2}P_{f1}(1-\cos \Omega) + P_{f3}\sin \Omega

(A.36)

R_{22} = P_{f2}P_{f2}(1-\cos \Omega) + \cos \Omega

(A.37)

R_{23} = P_{f2}P_{f3}(1-\cos \Omega) - P_{f1}\sin \Omega

(A.38)

R_{31} = P_{f3}P_{f1}(1-\cos \Omega) - P_{f2} \sin \Omega

(A.39)

R_{32} = P_{f3}P_{f2}(1-\cos \Omega) + P_{f1}\sin \Omega

(A.40)

R_{33} = P_{f3}P_{f3}(1-\cos \Omega) + \cos \Omega

(A.41)

The coordinates of the transformed pole ( $P_t$ ) are:

P_{t1} = R_{11} P_{p1} + R_{12}P_{p2}+R_{13}P_{p3}

(A.42)

P_{t2} = R_{21} P_{p1} + R_{22}P_{p2}+R_{23}P_{p3}

(A.43)

P_{t3} = R_{31} P_{p1} + R_{32}P_{p2}+R_{33}P_{p3}

(A.44)

which can be converted back into latitude and longitude in the usual way (see Chapter 2).

A.3.5.4The orientation tensor and eigenvectors¶

The orientation tensor $\mathbf{T}$ Scheidegger, 1965 (also known as the matrix of sums of squares and products), is extremely useful in paleomagnetism. This is found as follows:

Convert the $D$ , $I$ , and $M$ for a set of data points (e.g., a sequence of demagnetization data, or a set of geomagnetic vectors or unit vectors where $M=1$ ) to corresponding $x_i$ values (see Chapter 2).
Calculate the coordinates of the “center of mass” ( $\bar x$ ) of the data points:

\bar x_1 = {1\over N} (\sum_{1}^{N} x_{1i}); \quad \bar x_2 = {1\over N} (\sum_{1}^{N} x_{2i}); \quad \bar x_3 = {1\over N} (\sum_{1}^{N} x_{3i}),

(A.45)

where $N$ is the number of data points involved. Note that for unit vectors, the center of mass is the same as the Fisher mean (Chapter 11).

Transform the origin of the data cluster to the center of mass:

x_{1i}'=x_{1i}-\bar x_1; \quad x_{2i}'=x_{2i}-\bar x_2; \quad x_{3i}'=x_{3i}-\bar x_3,

(A.46)

where $x'_i$ are the transformed coordinates.

The orientation matrix is defined as:

\mathbf{T}=\begin{pmatrix}\sum x'_{1i}x'_{1i}&\sum x'_{1i}x'_{2i}&\sum x'_{1i}x'_{3i}\\ \sum x'_{2i}x'_{1i}&\sum x'_{2i}x'_{2i}&\sum x'_{2i}x'_{3i}\\ \sum x'_{3i}x'_{1i}&\sum x'_{3i}x'_{2i}&\sum x'_{3i}x'_{3i}\end{pmatrix}.

(A.47)

$\mathbf{T}$ is a 3 x 3 matrix, where only six of the nine elements are independent. It is constructed in some coordinate system, such as the geographic or sample coordinate system. Usually, none of the six independent elements are zero. There exists, however, a coordinate system along which the “off-axis” terms are zero and the axes of this coordinate system are called the eigenvectors of the matrix. The three elements of $\mathbf{T}$ in the eigenvector coordinate system are called eigenvalues. In terms of linear algebra, this idea can be expressed as:

\mathbf{T} \mathbf{V} = \boldsymbol{\tau} \mathbf{V},

(A.48)

where $\mathbf{V}$ is the matrix containing three eigenvectors and $\boldsymbol{\tau}$ is the diagonal matrix containing three eigenvalues. Equation A.48 is only true if:

\text{det} | \mathbf{T} - \boldsymbol{\tau} | = 0.

(A.49)

If we expand Equation A.49, we have a third degree polynomial whose roots ( $\tau$ ) are the eigenvalues:

(T_{11}-\tau)[(T_{22}-\tau)(T_{33}-\tau) - T_{23}^2] -

(A.50)

T_{12}[T_{12}(T_{33}-\tau) - T_{13}T_{23}] + T_{13}[T_{13}T_{23}-T_{13}(T_{22}-\tau)] = 0.

(A.51)

The three possible values of $\tau$ ( $\tau_1, \tau_2, \tau_3$ ) can be found with iteration and determination. In practice, there are many programs for calculating $\boldsymbol{\tau}$ . My personal favorite is the Numpy Module for Python (see many free websites, especially Scientific Python (SciPy) for hints). Please note that the conventions adopted here are to scale the $\tau$ ’s such that they sum to one; the largest eigenvalue is termed $\tau_1$ and corresponds to the eigenvector $\mathbf{V}_1$ .

Inserting the values for the transformed components calculated in Equation A.46 into $\mathbf{T}$ gives the covariance matrix for the demagnetization data. The direction of the axis associated with the greatest scatter in the data (the principal eigenvector $\mathbf{V}_1$ ) corresponds to a best-fit line through the data. This is usually taken to be the direction of the component in question. This direction also corresponds to the axis around which the “moment of inertia” is least. The eigenvalues of $\mathbf{T}$ are the variances associated with each eigenvector. Thus the standard deviations are $\sigma_i=\sqrt{\tau_i}$ .

A.3.6Upside down triangles, $\nabla$ ¶

A.3.6.1Gradient¶

We often wish to differentiate a function along three orthogonal axes. For example, imagine we know the topography of a ski area (see Figure A.6). For every location (in say, $X$ and $Y$ coordinates), we know the height above sea level. This is a scalar function. Now imagine we want to build a ski resort, so we need to know the direction of steepest descent and the slope (red arrows in Figure A.6).

Photograph of a snow-covered ski slope with red arrows indicating the direction and magnitude of steepest descent on the terrain. — Figure A.6:Illustration of the relationship between a vector field (direction and magnitude of steepest slope at every point, e.g., red arrows) and the scalar field (height) of a ski slope.

To convert the scalar field (height versus position) to a vector field (direction and magnitude of greatest slope) mathematically, we would simply differentiate the topography function. Let’s say we had a very weird two dimensional, sinusoidal topography such that $z=f(x)=\sin x$ with $z$ the height and $x$ is the distance from some marker. The slope in the $x$ direction ( $\hat x$ ), then would be $\hat x {d \over {d x}}{ f(x) }$ . If $f(x,y,z)$ were a three dimensional topography then the gradient of the topography function would be:

(\hat x {{\partial} \over {\partial x}} f + \hat y {{\partial} \over {\partial y}} f + \hat z {{\partial} \over {\partial z}} f) .

(A.52)

For short hand, we define a “vector differential operator” to be a vector whose components are

\nabla = (\hat x {\partial \over {\partial x}}, \hat y {\partial \over {\partial y}}, \hat z {\partial \over {\partial z}}).

(A.53)

This can also be written in polar coordinates:

\nabla = {\partial \over {\partial r}}, {\partial \over {r\partial \theta}}, {\partial \over {r\sin \theta \partial \phi}}.

(A.54)

Arrows radiating outward from a central point with increasing magnitude, enclosed by a dashed box, illustrating a vector field with non-zero divergence. — Figure A.7:Example of a vector field with a non-zero divergence.

A.3.6.2Divergence¶

The divergence of a vector function (e.g. $\mathbf{H}$ ) is written as:

\nabla \cdot \mathbf{H}.

(A.55)

The trick here is to treat $\nabla$ as a vector and use the rules for dot products described in Section A.3.2. In cartesian coordinates, this is:

\nabla \cdot \mathbf{H} = \hat x {{\partial H_x} \over {\partial x}} + \hat y {{\partial H_y}\over {\partial y}} + \hat z {{\partial H_z}\over {\partial z}}.

(A.56)

Like all dot products, the divergence of a vector function is a scalar.

Uniform parallel arrows all pointing upward with equal magnitude, enclosed by a dashed box, illustrating a vector field with zero divergence. — Figure A.8:Example of a vector field with zero divergence.

Arrows circulating counterclockwise around a central point in the x-y plane along an elliptical path, with x, y, and z axes shown, illustrating a vector field with non-zero curl. — Figure A.9:Example of a vector field with non-zero curl.

The name divergence is well chosen because $\nabla \cdot \mathbf{H}$ is a measure of how much the vector field “spreads out” (diverges) from the point in question. In fact, what divergence quantifies is the balance between vectors coming in to a particular region versus those that go out. The example in Figure A.7 depicts a vector function whereby the magnitude of the vector increases linearly with distance away from the central point. An example of such a function would be $v(r)=r$ . The divergence of this function is:

\nabla \cdot v = {\partial \over {\partial r}} r = 1.

(A.57)

(a scalar). There are no arrows returning in to the dashed box, only vectors going out and the non-zero divergence quantifies this net flux out of the box.

Now consider Figure A.8, which depicts a vector function that is constant over space, i.e. $v(r) = k$ . The divergence of this function is:

\nabla \cdot v = {\partial \over {\partial r}} k = 0.

(A.58)

The zero divergence means that for every vector leaving the box, there is an equal and opposite vector coming in. Put another way, no net flux results in a zero divergence. The fact that the divergence of the magnetic field is zero means that there are no point sources (monopoles), as opposed to electrical fields that have divergence related to the presence of electrons or protons.

A.3.6.3Curl¶

The curl of the vector function $\mathbf{B}$ is defined as $\nabla \times \mathbf{B}$ . In cartesian coordinates we have

\nabla \times \mathbf{B} = \hat x ({\partial \over {\partial y}} B_z - {\partial \over {\partial z}} B_y) + \hat y ({\partial \over {\partial z}} B_x - {\partial \over {\partial x}} B_z) + \hat z ({\partial \over {\partial x}} B_y - {\partial \over {\partial y}} B_x).

(A.59)

Curl is a measure of how much the vector function “curls” around a given point. The function describing the velocity of water in a whirlpool has a significant curl, while that of a smoothly flowing stream does not.

Consider Figure A.9 which depicts a vector function $v=-y\hat x + x\hat y$ . The curl of this function is:

\nabla \times v = \det \begin{vmatrix} \hat x & \hat y & \hat z \\ {\partial \over {\partial x}} & {\partial \over {\partial y}} & {\partial \over {\partial z}}\\ -y & x & 0 \end{vmatrix},

(A.60)

\hat x ( {\partial \over {\partial y}} 0 - {\partial \over {\partial z}} x ) + \hat y ( {\partial \over {\partial x}} 0 - {\partial \over {\partial z}} (-y) ) + \hat z ( {\partial \over {\partial x}} x - {\partial \over {\partial y}} (-y) ).

(A.61)

= 0 \hat x + 0 \hat y + 2\hat z

(A.62)

So there is a positive curl in this function and the curl is a vector in the $\hat z$ direction.

The magnetic field has a non-zero curl in the presence of currents or changing electric fields. In free space, away from currents (lightning!!), the magnetic field has zero curl.

A.3.7The statistical bootstrap¶

Sometimes things just are not normal. Statistically that is. When you can not assume that your data follow some known distribution, like the normal distribution, or the Fisher distribution, what do you do? In this section, we outline a technique called the bootstrap, which allows us to make statistical inferences when parametric assumptions fail. The reader should also refer to Efron & Tibshirani (1993) for a more complete discussion.

Three-panel figure. a) Histogram of 500 data points from a Gaussian distribution. b) Q-Q plot showing data versus normal quantiles. c) Histogram of 10,000 bootstrapped means with 95% confidence bounds marked. — Figure A.10:Bootstrapping applied to a normal distribution. a) 500 data points are drawn from a Gaussian distribution with mean of 10 and a standard deviation of 2. b) Q-Q plot of data in a). The 95% confidence interval for the mean is given by Gauss statistics as ± 0.17. 10,000 new (para) data sets are generated by randomly drawing $N$ data points from the original data set shown in a). c) A histogram of the means from all the para-data sets. 95% of the means fall within the interval 10.06 $^{+0.16}_{-0.16}$ , hence the bootstrap confidence interval is similar to that calculated with Gaussian statistics. [Figure from Tauxe (1998).]

In Figure A.10, we illustrate the essentials of the statistical bootstrap. We will develop the technique using data drawn from a normal distribution. First, we generate a synthetic data set by drawing 500 data points from a normal distribution with a mean $\bar x$ of 10 and a standard deviation $\sigma$ of 2. The synthetic data are plotted as a histogram in Figure A.10a. In Figure A.10b we plot the data as a Q-Q plot (see Section B.1.5) against the $z_i$ expected for a normal distribution.

The data in Figure A.10a plot in a line on the Q-Q plot (Figure A.10b). The value for $D$ is 0.0306. Because $N=500$ , the critical value of $D$ , $D_c$ at the 95% confidence level is 0.0396. Happily, our normal distribution simulation program has produced a set of 500 numbers for which the null hypothesis of a normal distribution has not been rejected. The mean of the synthetic dataset is about 10 and the standard deviation is 1.9. The usual Gaussian statistics allow us to estimate a 95% confidence interval for the mean as $\pm 1.96 \sigma / \sqrt{N}$ or $\pm 0.17$ .

Sphere with North Pole at top showing site location L and sub-solar point S connected by spherical triangle with angles H, beta, beta-prime, theta, and solar declination delta. — Figure A.11:Calculation of the azimuth of the shadow direction ( $\beta'$ ) relative to true North, using a sun compass. L is the site location (at $\lambda_L,\phi_L$ ), S is the position on the Earth where the sun is directly overhead ( $\lambda_S,\phi_S$ ). [Figure from Tauxe (1998).]

In order to estimate a confidence interval for the mean using the bootstrap, we first randomly draw a list of $N$ data by selecting data points from the original data set. This list is called a pseudo-sample of the data. Some data points will be used more than once and others will not be used at all. We then calculate the mean of the pseudo-sample. We repeat the procedure of drawing pseudo-samples and calculating the mean many times (say 10,000 times). A histogram of the “bootstrapped” means is plotted in Figure A.10c. If these are sorted such that the first mean is the lowest and the last mean is the highest, the 95% of the means are between the 250 $^{th}$ and the 9,750 $^{th}$ mean. These therefore are the 95% confidence bounds because we are approximately 95% confident that the true mean lies between these limits. The 95% confidence interval calculated for the data in Figure A.10 by bootstrap is about ± 0.16 which is nearly the same as that calculated the Gaussian way. However, the bootstrap required orders of magnitude more calculations than the Gaussian method, hence it is ill-advised to perform a bootstrap calculation when a parametric one will do. Nonetheless, if the data are not Gaussian, the bootstrap provides a means of calculating confidence intervals when there is no quick and easy way. Furthermore, with a modern computer, the time required to calculate the bootstrap illustrated in Figure A.10 was virtually imperceptible.

A.3.8Directions using a sun compass¶

In a sun compass problem, we have the direction of the sun’s shadow and an angle between that and the desired direction ( $\alpha$ ). The declination of the shadow itself is 180° from the direction toward the sun. In Figure A.11, the problem of calculating declination from sun compass information is set up as a spherical trigonometry problem, similar to those introduced in Chapter 2 and Section A.3.1. The declination of the shadow direction $\beta'$ , is given by 180 - $\beta$ . We also know the latitude of the sampling location L ( $\lambda_L$ ). We need to calculate the latitude of S (the point on the Earth’s surface where the sun is directly overhead), and the local hour angle $H$ .

Knowing the time of observation (in Universal Time), the position of S ( $\lambda_s = \delta,\phi_s$ in Figure A.11) can be calculated with reasonable precision (to within 0.01°) for the period of time between 1950 and 2050 using the procedure recommended in the 1996 Astronomical Almanac:

First, calculate the Julian Day $J$ . Then, calculate the fraction of the day in Universal Time $U$ . Finally, calculate the parameter $d$ which is the number of days from J2000 by:

d= J - 2451545 + U.

(A.63)

The mean longitude of the sun ( $\phi_s$ ), corrected for aberration, can be estimated in degrees by:

\phi_s=280.461 + 0.9856474 d.

(A.64)

The mean anomaly $g=357.528 + 0.9856003 d$ (in degrees).
Put $\phi_s$ and $g$ in the range 0 → 360°.
The longitude of the ecliptic is given by $\phi_E=\phi_s + 1.915 \sin g + 0.020 \sin 2g$ (in degrees).
The obliquity of the ecliptic is given by $\epsilon = 23.439 - 0.0000004 d$ .
Calculate the right ascension ( $A$ ) by:

A = \phi_E - ft \sin 2\phi_E + (f/2) t^2 \sin 4 \phi_E,

(A.65)

where $f=180/\pi$ and $t=$ tan $^2\epsilon/2$ .

The so-called “declination” of the sun ( $\delta$ in Figure A.11 which should not be confused with the magnetic declination $D$ ), which we will use as the latitude $\lambda_s$ , is given by:

\delta = \sin^{-1}(\sin \epsilon \sin \phi_e).

(A.66)

Finally, the equation of time in degrees is given by $E= 4(\phi_s-A)$ .

We can now calculate the Greenwich Hour Angle $GHA$ from the Universal Time $U$ (in minutes) by $GHA = (U + E)/4 + 180$ . The local hour angle ( $H$ in Figure A.11) is $GHA + \phi_L$ . We calculate $\beta$ using the laws of spherical trigonometry (see Section A.3.1). First we calculate $\theta$ by the Law of Cosines (remembering that the cosine of the colatitude equals the sine of the latitude):

\cos \theta = \sin \lambda_L \sin \lambda_s + \cos \lambda_L \cos \lambda_s \cos H

(A.67)

and finally using the Law of Sines:

\sin \beta = (\cos \lambda_s \sin H)/\sin \theta.

(A.68)

If $\lambda_s<\lambda_L$ , then the required angle is the shadow direction $\beta'$ , given by: $\beta' =180-\beta$ . The azimuth of the desired direction is $\beta'$ plus the measured shadow angle $\alpha$ .

References¶

Tauxe, L. (1998). Paleomagnetic Principles and Practice. Kluwer Academic Publishers.
Scheidegger, A. E. (1965). On the statistics of the orientation of bedding planes, grain axes, and similar sedimentological data. U.S. Geological Survey Professional Paper, 525–C, 164–167.
Efron, B., & Tibshirani, R. J. (1993). An Introduction to the Bootstrap (Vol. 57). Chapman. 10.1201/9780429246593