Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Appendix A: Definitions, Derivations and Tricks

Scripps Institution of Oceanography, UC San Diego

Paleomagnetism is famous for its use of a large number of incomprehensible acronyms. Here we have them gathered together along with definitions and the Section numbers where they are explained in more detail. You will find here a table of physical constants and paleomagnetic parameters used in the text as well as a table listing common statistics used in paleomagnetism. After the tables, there are a few sections with useful mathematical tricks.

A.1Definitions

Table A.1:Acronyms in paleomagnetism.

AcronymDefinition: Section #
AMSAnisotropy of magnetic susceptibility: Section 13.1
APWPApparent polar wander path: Section 16.2
AFAlternating field demagnetization: Section 9.1.4
ARMAnhysteretic remanent magnetization: Section 7.10
ChRMCharacteristic remanent magnetization: Section 9.1.5
CNSCretaceous Normal Superchron: Section 15.1
CRMChemical remanent magnetization: Section 7.5
DGRFDefinitive geomagnetic reference field: Section 2.2
DRMDetrital remanent magnetization: Section 7.6
E/IElongation/inclination correction method: Section 16.4
FCField cooled: Section 8.8.4
GADGeocentric axial dipole: Section 2.3
GHAGreenwich hour angle: Section A.3.8
GPTSGeomagnetic polarity time scale: Chapter 15
GRMGyroremanent magnetization: Section 7.10
IGRFInternational geomagnetic reference field: Section 2.2
IZZIInfield-zero field/ zero field-infield paleointensity protocol: Section 10.1.1.1
IRMIsothermal remanent magnetization: Section 5.2.1 and Section 7.7
MDMultidomain: Chapter 4
MDFMedian destructive field: Section 8.2
MDTMedian destructive temperature: Section 8.2
NRMNatural remanent magnetization: Chapter 7
pARMPartial anhysteretic remanence: Section 7.10
pDRMPost-depositional detrital remanent magnetization: Section 7.6
PSDPseudo-single domain: Chapter 4
PSVPaleosecular variation of the geomagnetic field: Section 14.1
pTRMPartial thermal remanence: Section 7.4
sIRMSaturation IRM: See MrM_r
SDSingle domain: Chapter 4
SPSuperparamagnetic: Section 4.3
SVSecular variation: Section 14.1
TRMThermal remanent magnetization: Section 7.4
VADMVirtual axial dipole moment: Section 2.4.3
VDMVirtual dipole moment: Section 2.4.3; Equation 2.21
VDSVector difference sum: Section 9.1.6
VGPVirtual geomagnetic pole: Section 2.4.2
VRMViscous remanent magnetization: Section 7.3
SQUIDSuperconducting quantum interference device: Section 9.1.2
UTUniversal time (Greenwich mean time): Section A.3.8
ZFCZero-field cooled: Section 8.8.4

Table A.2:Physical Parameters and Constants.

SymbolDefinition: Section #
χ\chiMagnetic susceptibility: The slope relating induced magnetization to an applied field: Section 1.5
χARM\chi_{ARM}ARM susceptibility: Section 8.6
χb\chi_bBulk magnetic susceptibility: Section 1.5; Equation 1.4
χd\chi_dDiamagnetic susceptibility: Section 3.2.1
χf\chi_fFerromagnetic susceptibility: Section 3.3
χfd\chi_{fd}Frequency dependent: Section 8.3.3
χh\chi_hHigh-frequency susceptibility: Section 8.8.2
χhf\chi_{hf}High-field susceptibility: Section 5.2.2
χi\chi_{i}Initial susceptibility: Section 5.2.2
χl\chi_lLow-frequency susceptibility: Section 8.8.2
χp\chi_pParamagnetic susceptibility: Section 3.2.2
δFC\delta_{FC}Verwey transition temperature jump while cooling in a field: Section 8.8.4
δZFC\delta_{ZFC}Verwey transition temperature jump while cooling in zero field: Section 8.8.4
ΔM\Delta M curveCurve defined by subtracting the ascending from the descending curves in a hysteresis loop: Section 5.2.1
λ,ϕ\lambda,\phiLatitude, Longitude
μo\mu_oPermeability of free space: (4π\pi x 107^{-7} Hm1^{-1}): Section 1.6
τ\tauRelaxation time: Section 4.3; Equation 4.18
θm\theta_mMagnetic co-latitude: Section 2.4; Equation 2.13
θ\thetaCo-latitude: Section 1.8
aija_{ij}Direction cosines: Section A.3.5.1
[am][a_m]Magnetic activity: Section 10.1.2
aaThe radius of the Earth (6.371 x 106^6 m): Section 2.2
B\mathbf{B}Magnetic induction: Section 1.3
CCFrequency factor (1010^{10} s1^{-1}): Section 4.3
DDDeclination: Section 2.1; Equation 2.4
EEElongation: Table 8.1
IIInclination: Section 2.1; Equation 2.4
gml,hmlg_m^l, h_m^lGauss coefficients: Section 2.2
H\mathbf{H}Magnetic field: Section 1.1
HcrH_{cr}Coercivity of remanence; field required to reduce saturation IRM to zero: Section 5.1
HcH_cCoercivity; the magnetic field required to change the magnetic moment of a particle from one easy axis to another: Section 5.1
kBk_BBoltzmann’s constant (1.381 x 1023^{-23} JK1^{-1}): Section 3.2.2
KiK_iAMS measurement: Section D.1
KuK_uConstant of uniaxial anisotropy energy: Paragraph and Section 4.1.5
m\mathbf{m}Magnetic moment: Section 1.2
μb\mu_bBohr magneton (9.27 x 1024^{-24} Am2^2): Section 3.1
M\mathbf{M}Magnetization: Section 1.5
MeqM_{eq}Equilibrium magnetization: Section 7.3
MrM_rSaturation remanence (also sIRM): Section 5.2.1
MsM_sSaturation magnetization; the magnetization measured in the presence of a saturating field: Section 5.2.1
PlmP_l^mSchmidt polynomials: Section 2.2
s\mathbf{s}Six elements of χij\chi_{ij}; s1=χ11,s2=χ22,s3=χ33,s4=χ12,s5=χ23,s6=χ13s_1=\chi_{11}, s_2=\chi_{22}, s_3=\chi_{33}, s_4=\chi_{12}, s_5=\chi_{23}, s_6=\chi_{13}: Section 13.1; Equation 13.3
RxR_xIRM cross-over value: Section 8.4.1
TTAbsolute temperature (in kelvin)
TbT_bBlocking temperature: Section 7.4
TcT_cCurie (Néel) temperature: Section 3.3, Section 8.2
ThT_hHopkinson Effect: Section 8.2
TmT_mMorin transition: Section 8.2
ToT_oAbsolute zero: Section 3.3
TpT_pPyrrhotite transition: Section 8.2
TvT_vVerwey temperature: Paragraph, Section 8.2
vvVolume
vbv_bBlocking volume: Section 7.5

Table A.3:Common statistics in paleomagnetism.

StatisticDefinition: Section #
α95\alpha_{95}Radius of circle (cone) of 95% confidence (Fisher): Section 11.1.2.1, Equation 11.17
δ\deltaResidual errors for AMS measurements: Section 13.1, Equation 13.11
ϵij\epsilon_{ij}Semi-angles of Hext uncertainty ellipses: Section 13.1, Equation 13.17
κ\kappaFisher precision parameter: Section 11.1.2.1, Equation 11.9
η95,ζ95\eta_{95}, \zeta_{95}Semi-angles of directional 95% uncertainty ellipses: Section C.2.4, Equation C.17
τ,V\boldsymbol{\tau}, \mathbf{V}Eigenvalues and eigenvectors of tensors: Section A.3.5.4, Equation A.48
kkEstimate of κ\kappa: Section 11.1.2.1, Equation 11.16
CSDCircular standard deviation (Fisher): Section 11.1.2.1, Equation 11.20
dmdmUncertainty in the meridian (longitude) of a paleomagnetic pole: Section 11.1.2.1, Equation 11.22
dpdpUncertainty in the parallel (latitude) of a paleomagnetic pole: Section 11.1.2.1, Equation 11.22
F,F12,F23F, F_{12}, F_{23}Significance tests for anisotropy (Hext): Section 13.2.1, Equation 13.19
MADMaximum angular deviation of principal eigenvector (Kirschvink): Section 9.1.7, Equation 9.1
MADplane_{plane}MAD of the pole to a best-fit plane (Kirschvink): Section 9.1.7, Equation 9.2
Mu,MeM_u, M_eSignificance tests for uniform and exponential distributions: Section 11.1.5, Section B.1.5, Equation B.6, and Equation B.7
NNNumber of samples, specimens or sites
nfn_fNumber of degrees of freedom: Section 13.2, Equation 13.12
RRResultant vector length of unit vectors: Section 11.1.2.1, Equation 11.14
RoR_oCritical value of RR for non-random distribution (Watson): Section 11.1.3.1, Equation 11.23
SfS_fScatter of VGPs - corrected for within site scatter: Equation 14.3
SpS_pScatter of VGPs: Equation 14.2
SoS_oResidual sum of squares of errors (Hext): Section 13.2, Equation 13.12
T\mathbf{T}Orientation tensor: Section A.3.5.4, Equation A.47

A.2Derivations

A.2.1Langevin function for a paramagnetic substance

Here we derive the Langevin function for a paramagnetic substance with magnetic moments mm in an applied field HH at temperature TT. If we make the assumption that there is no preferred alignment within the substance, we can assume that the number of moments (n(α)n(\alpha)) between angles α\alpha and α+dα\alpha + d\alpha with respect to H\mathbf{H} is proportional to the solid angle sinαdα\sin\alpha d\alpha and the probability density function, i.e.

n(α)dαexp(EmkBT)sinαdα,n(\alpha) d\alpha \propto \exp \bigl({ -E_m\over {k_BT}} \bigr) \sin \alpha d\alpha,

where EmE_m is the magnetic energy. When we measure the induced magnetization, we really measure only the component of the moment parallel to the applied field, or n(α)mcosαn(\alpha) m \cos\alpha. The net induced magnetization MIM_I of a population of particles with volume vv is therefore:

MI=mv0πn(α)cosαdα.M_I = {m\over v} \int_0^{\pi} n(\alpha)\cos \alpha d \alpha.

By definition, n(α)n(\alpha) integrates to NN, the total number of moments, or

N=0πn(α)dα.N = \int_0^{\pi} n(\alpha)d\alpha.

The total saturation moment of a given population of NN individual magnetic moments mm is NmNm. The saturation value of magnetization MsM_s is thus NmNm normalized by the volume vv. Therefore, the magnetization expressed as the fraction of saturation is:

MMs=0πn(α)cosαdα0πn(α)dα{M\over {M_s}} = { {\int_0^{\pi} n(\alpha ) \cos \alpha d\alpha}\over {\int_0^{\pi} n(\alpha )d\alpha }}
=0πe(mμoHcosα)/kBTcosαsinαdα0πe(mμoHcosα)/kBTsinαdα.= { {\int_0^{\pi} e^{(m\mu_o H \cos \alpha )/k_BT}\cos \alpha \sin \alpha d\alpha}\over { \int_0^{\pi} e^{(m\mu_o H\cos \alpha )/k_BT}\sin \alpha d\alpha}}.

By substituting a=mμoH/kBTa=m\mu_oH/k_BT and cosα=x\cos \alpha =x, we write

MMs=N11eaxxdx11eaxdx=(ea+eaeaea1a),{M\over {M_s}} = N { {\int_{-1}^{1} e^{a x}xdx} \over {\int_{-1}^1 e^{a x}dx} } = \bigl( { {e^{a} + e^{-a}} \over {e^{a} - e^{-a}} } - {1\over{a} } \bigr),

and finally

MMs=[coth a1a]=L(a).{M\over {M_s}} = [\text{coth } a - {1\over{a}}]=\mathcal{L} (a).

A.2.2Superparamagnetism

The derivation of superparamagnetism follows closely that of paramagnetism whereby the probability of finding a magnetization vector an angle α\alpha away from the direction of the applied field is given by:

n(α)dα=2πnoe(MsBvcosαkBT)sinαdα.n(\alpha )d\alpha = 2\pi n_o e^{({{M_sBv\cos \alpha}\over {k_BT}})}\sin \alpha d\alpha.

The total magnetization contributed by the NN moments is:

MMs=0πcosαn(α)dα.{M\over {M_s}} = \int_0^{\pi} \cos \alpha n(\alpha )d\alpha.

Combining Equation A.8 and Equation A.9 we get:

MMs=N0πn(α)cosαdα0πn(α)dα{M\over {M_s}} = N { {\int_0^{\pi} n(\alpha ) \cos \alpha d\alpha}\over {\int_0^{\pi} n(\alpha )d\alpha }}
=N0πe(MsBvcosα)/kBTcosαsinαdα0πe(MsBvcosα)/kBTsinαdα.= N { {\int_0^{\pi} e^{(M_sBv\cos \alpha )/k_BT}\cos \alpha \sin \alpha d\alpha}\over { \int_0^{\pi} e^{(M_sBv\cos \alpha )/k_BT}\sin \alpha d\alpha}}.

By substituting a=MsBv/kBTa= M_sBv/k_BT and cosα=x\cos \alpha =x, and remembering Equation A.7, we can write:

MMs=N11eaxxdx11eaxdx=NL(a).{M\over {M_s}} = N { {\int_1^{-1} e^{a x}xdx} \over {\int_1^{-1} e^{a x}dx} } = N\mathcal{L} (a).

So finally

MMs=NL(a).{M\over {M_s}} = N\mathcal{L} (a).

A.3Useful tricks

In this section, we have assembled assorted mathematical and plotting techniques that come in handy throughout this book.

A.3.1Spherical trigonometry

Spherical trigonometry has widespread applications throughout the book. It is used in the transformations of observed directions to virtual poles (Chapter 2) and transformation of coordinate systems, to name a few. Here we summarize the two most useful relationships: the Law of Sines and the Law of Cosines.

Spherical triangle with vertices A, B, C on a globe, connected by great circle arcs a, b, c, with inset showing subtended angles on a unit sphere.

Figure A.1:Rules of spherical trigonometry. a,b,ca,b,c are all great circle tracks on a sphere which form a triangle with apices A,B,CA,B,C. The lengths of a,b,ca,b,c on a unit sphere are equal to the angles subtended by radii that intersect the globe at the apices, as shown in the inset. α,β,γ\alpha,\beta,\gamma are the angles between the great circles.

In Figure A.1, α,β\alpha, \beta and γ\gamma are the angles between the great circles labelled aa, bb, and cc. On a unit sphere, a,ba,b and cc are also the angles subtended by radii that intersect the globe at the apices A, B, and C (see inset on Figure A.1). Two formulae from spherical trigonometry come in handy in paleomagnetism, the Law of Sines:

sinαsina=sinβsinb=sinγsinc,{\sin \alpha \over \sin a}={\sin \beta \over \sin b}={\sin \gamma \over \sin c},

and the Law of Cosines:

cosa=cosbcosc+sinbsinccosα.\cos a = \cos b \cos c + \sin b \sin c \cos \alpha.

A.3.2Vector addition

Two vectors A and B in an X-Y coordinate system with their x and y components shown, angles to the X axis labeled, and resultant vector C from their addition.

Figure A.2:Vectors A\mathbf{A} and B\mathbf{B}, their components Ax,y_{x,y}, Bx,y_{x,y} and the angles between them and the XX axis, α\alpha and β\beta. The angle between the two vectors is αβ=Δ\alpha-\beta = \Delta. Unit vectors in the directions of the axes are x^\hat x and y^\hat y respectively.

To add the two vectors (see Figure A.2) A\mathbf{A} and B\mathbf{B}, we break each vector into components Ax,yA_{x,y} and Bx,yB_{x,y}. For example, Ax=Acosα,Ay=AsinαA_x=|A|\cos{\alpha}, A_y=|A|\sin{\alpha} where A|A| is the length of the vector A\mathbf{A}. The components of the resultant vector C\mathbf{C} are: Cx=Ax+Bx,Cy=Ay+ByC_x = A_x +B_x, C_y=A_y+B_y. These can be converted back to polar coordinates of magnitude and angles if desired, whereby:

C=Cx2+Cy2 and γ=cos1CxC.|C| = \sqrt {C_x^2+C_y^2} \text{ and } \gamma= \cos^{-1} { {C_x}\over{ {|C|}}}.

A.3.3Vector subtraction

To subtract two vectors, compute the components as in addition, but the components of the vector difference C\mathbf{C} are: Cx=AxBx,Cy=AyByC_x = A_x -B_x, C_y=A_y-B_y.

A.3.4Vector multiplication

There are two ways to multiply vectors. The first is the dot product whereby AB=AxBx+AyBy\mathbf{A} \cdot \mathbf{B}= A_xB_x + A_yB_y. This is a scalar and is actually the cosine of the angle between the two vectors if the A\mathbf{A} and B\mathbf{B} are taken as unit vectors (assume a magnitude of unity in the component calculation).

Three-dimensional diagram with vectors A and B in a plane separated by angle theta, and their cross product vector C pointing perpendicular to the plane.

Figure A.3:Illustration of cross product of vectors AA and BB separated by angle θ\theta to get the orthogonal vector CC.

The other way to perform vector multiplication is the cross product (see Figure A.3), which produces a vector orthogonal to both A\mathbf{A} and B\mathbf{B} and whose components are given by:

C=detx^y^z^AxAyAzBxByBz.C = \det \begin{vmatrix} \hat x & \hat y & \hat z \\ A_x & A_y & A_z \\ B_x & B_y & B_z \end{vmatrix}.

To calculate the determinant, we follow these rules:

Cx=AyBzAzBy,Cy=AzBxAxBz,Cz=AxByAyBx.C_x=A_yB_z - A_zB_y, \quad C_y=A_zB_x - A_xB_z, \quad C_z=A_xB_y - A_yB_x.

or

Ci=AjBkAkBjijk.C_i = A_jB_k-A_kB_j \quad i\neq j \neq k.

A.3.5Tricks with tensors

Vectors belong to a more general concept called tensors. While a vector describes a magnitude of something in a given direction, tensors allow calculation of magnitudes as a function of orientation. Velocity is a vector relating speed to direction, but speed may change depending on direction, so we might need a tensor to calculate speed as a function of direction. Many properties in Earth science require tensors, like the indicatrix in mineralogy which relates the speed of light to crystallographic direction, or the relationship between stress and strain. Tensors in paleomagnetism are used, for example, to transform coordinate systems and to characterize the anisotropy of magnetic properties such as susceptibility. We will cover transformation of coordinate systems in the following.

A.3.5.1Direction cosines

We use direction cosines in paleomagnetism in a variety of applications, from mineralogy to transformation from specimen to geographic or stratigraphic coordinate systems. Direction cosines are the cosines of the angles between different axes in given coordinate systems, here XX and XX' respectively (see, e.g., Figure A.4a). The direction cosine a12a_{12} is the cosine of the angle between the X1X_1 and the X2X'_2, α12\alpha_{12} axes. We can define four of these direction cosines to fully describe the relationship between the two coordinate systems:

a11=cosα11,a21=cosα21,a_{11} = \cos \alpha_{11}, \quad a_{21} = \cos \alpha_{21},
a12=cosα12,a22=cosα22.a_{12} = \cos \alpha_{12}, \quad a_{22} = \cos \alpha_{22}.

The first subscript always refers to the XX system and the second refers to the XX'.

Two-panel diagram. a) Vector R in the X1-X2 coordinate system at angle alpha. b) Two coordinate systems X and X-prime with direction cosine angles alpha-11, alpha-12, alpha-21, alpha-22 between their axes.

Figure A.4:Definition of direction cosines in two dimensions. a) Definition of vector in one set of coordinates, x1,x2x_1, x_2. b) Definition of angles relating XX axes to XX'.

A.3.5.2Changing coordinate systems

One application of using direction cosines is the transformation of coordinates systems from one set (XX) to a new set XX'. To find new coordinates x1,x2,..x'_1, x'_2,.. from the old (x1,x2,...x_1, x_2,...), we have:

x1=a11x1+a12x2,x2=a21x2+a22x2.x_1' = a_{11} x_1 + a_{12} x_2, \quad x_2' = a_{21} x_2 + a_{22} x_2.

In three dimensions we have:

x1=a11x1+a12x2+a13x3,x_1' = a_{11} x_1 + a_{12} x_2 + a_{13} x_3,
x2=a21x1+a22x2+a23x3,x_2' = a_{21} x_1 + a_{22} x_2 + a_{23} x_3,
x3=a31x1+a32x2+a33x3,x_3' = a_{31} x_1+ a_{32} x_2 + a_{33} x_3,

which can also be written as:

(x1x2x3)=(a11a12a13a21a22a23a31a32a33)(x1x2x3),\begin{pmatrix}x'_1\\ x'_2\\ x'_3\end{pmatrix} = \begin{pmatrix} a_{11}&a_{12}&a_{13}\\ a_{21}&a_{22}&a_{23}\\ a_{31}&a_{32}&a_{33} \end{pmatrix} \begin{pmatrix}x_1\\ x_2\\ x_3\end{pmatrix},

with a short cut notation as: xi=aijxjx'_i = a_{ij} x_j. However we write this, it means that for each axis ii, just sum through the jj’s for all the dimensions. The matrix aija_{ij} is an example of a 3 x 3 tensor and equations of the form Ai=BijCjA_i = B_{ij} C_j relating two vectors with a tensor will be used throughout the book. A more common notation is with bold-faced variables which indicate vectors or tensors, e.g., A=BC\mathbf{A} = \mathbf{B} \cdot \mathbf{C}.

Two-panel diagram. a) Sample cube with right-hand coordinate axes X1, X2, X3. b) Sphere showing X and X-prime coordinate systems with angle alpha between corresponding axes and spherical triangle relating the two systems.

Figure A.5:a) Sample coordinate system. b) Trigonometric relations between two cartesian coordinate systems, Xi\mathbf{X}_i and Xi\mathbf{X}'_i. λ,ϕ,ψ\lambda,\phi,\psi are all known and the angles between the various axes can be calculated using spherical trigonometry. For example, the angle α\alpha between X1\mathbf{X}_1 and X1\mathbf{X}_1' forms one side of the triangle shown by dash-dot lines. Thus, cosα=cosλcosϕ+sinλsinϕcosψ\cos \alpha = \cos \lambda \cos \phi + \sin \lambda \sin \phi \cos \psi. [Figure from Tauxe (1998).]

Now we would like to apply this to changing coordinate systems for a paleomagnetic specimen in the most general case. The specimen coordinate system is defined by a right-hand rule where the thumb (X1\mathbf{X}_1) is directed parallel to an arrow marked on the sample, the index finger (X2\mathbf{X}_2) is in the same plane but at right angles and clockwise to X1\mathbf{X}_1 and the middle finger (X3\mathbf{X}_3) is perpendicular to the other two (Figure A.5a). The transformation of coordinates (xix_i) from the Xi\mathbf{X}_i axes to the coordinates in the desired X\mathbf{X}' coordinate system requires the determination of the direction cosines as described in Section A.3.5.1. The various aija_{ij} can be calculated using spherical trigonometry as in Section A.3.1. For example, a11a_{11} for the general case depicted in Figure A.5 is cosα\cos \alpha, which is given by the Law of Cosines (see Section A.3.1) by using appropriate values, or:

cosα=cosλcosϕ+sinλsinϕcosψ.\cos \alpha = \cos \lambda \cos \phi + \sin \lambda \sin \phi \cos \psi.

The other aija_{ij} can be calculated in a similar manner. In the case of most coordinate system rotations used in paleomagnetism, X2X_2 is in the same plane as X1X'_1 and X2X'_2 (and is horizontal) so ψ\psi = 90°. This problem is much simpler. The directions cosines for the case where ψ=90\psi = 90^{\circ} are:

a=(cosλcosϕsinϕsinλcosϕcosλsinϕcosϕsinλsinϕsinλ0cosλ).a=\begin{pmatrix} \cos \lambda \cos \phi & - \sin \phi & - \sin \lambda \cos \phi\\ \cos \lambda \sin \phi & \cos \phi & - \sin \lambda \sin \phi \\ \sin \lambda & 0& \cos \lambda \end{pmatrix}.

The new coordinates can be obtained from Equation A.26, as follows:

x1=a11x1+a12x2+a13x3x'_1 = a_{11}x_1 + a_{12}x_2 + a_{13}x_3
x2=a21x1+a22x2+a23x3x'_2 = a_{21}x_1 + a_{22}x_2 + a_{23}x_3
x3=a31x1+a32x2+a33x3.x'_3 = a_{31}x_1 + a_{32}x_2 + a_{33}x_3.

The declination and inclination can be calculated by inserting these values in the equations in Chapter 2.

A.3.5.3Method for rotating points on a globe using finite rotation poles

Given the coordinates of the point on the globe PpP_p with latitude λp\lambda_p, longitude ϕp\phi_p the finite rotation pole PfP_f with latitude λf\lambda_f, longitude ϕf\phi_f, the way to transform coordinates is as follows (you should also review Section A.3.5.2).

  1. Convert the latitudes and longitudes to cartesian coordinates by:

P1=cosϕcosλ,P2=sinϕcosλ,P3=sinλP_1=\cos\phi \cos \lambda, \quad P_2 = \sin \phi \cos \lambda, \quad P_3 = \sin \lambda

where PP is the point of interest.

  1. Set up the rotation matrix RR as:

R11=Pf1Pf1(1cosΩ)+cosΩR_{11} = P_{f1}P_{f1}(1-\cos \Omega) + \cos \Omega
R12=Pf1Pf2(1cosΩ)Pf3sinΩR_{12} = P_{f1}P_{f2}(1-\cos \Omega) - P_{f3} \sin \Omega
R13=Pf1Pf3(1cosΩ)+Pf2sinΩR_{13} = P_{f1}P_{f3}(1-\cos \Omega) + P_{f2}\sin \Omega
R21=Pf2Pf1(1cosΩ)+Pf3sinΩR_{21} = P_{f2}P_{f1}(1-\cos \Omega) + P_{f3}\sin \Omega
R22=Pf2Pf2(1cosΩ)+cosΩR_{22} = P_{f2}P_{f2}(1-\cos \Omega) + \cos \Omega
R23=Pf2Pf3(1cosΩ)Pf1sinΩR_{23} = P_{f2}P_{f3}(1-\cos \Omega) - P_{f1}\sin \Omega
R31=Pf3Pf1(1cosΩ)Pf2sinΩR_{31} = P_{f3}P_{f1}(1-\cos \Omega) - P_{f2} \sin \Omega
R32=Pf3Pf2(1cosΩ)+Pf1sinΩR_{32} = P_{f3}P_{f2}(1-\cos \Omega) + P_{f1}\sin \Omega
R33=Pf3Pf3(1cosΩ)+cosΩR_{33} = P_{f3}P_{f3}(1-\cos \Omega) + \cos \Omega
  1. The coordinates of the transformed pole (PtP_t) are:

Pt1=R11Pp1+R12Pp2+R13Pp3P_{t1} = R_{11} P_{p1} + R_{12}P_{p2}+R_{13}P_{p3}
Pt2=R21Pp1+R22Pp2+R23Pp3P_{t2} = R_{21} P_{p1} + R_{22}P_{p2}+R_{23}P_{p3}
Pt3=R31Pp1+R32Pp2+R33Pp3P_{t3} = R_{31} P_{p1} + R_{32}P_{p2}+R_{33}P_{p3}

which can be converted back into latitude and longitude in the usual way (see Chapter 2).

A.3.5.4The orientation tensor and eigenvectors

The orientation tensor T\mathbf{T} Scheidegger, 1965 (also known as the matrix of sums of squares and products), is extremely useful in paleomagnetism. This is found as follows:

  1. Convert the DD, II, and MM for a set of data points (e.g., a sequence of demagnetization data, or a set of geomagnetic vectors or unit vectors where M=1M=1) to corresponding xix_i values (see Chapter 2).

  2. Calculate the coordinates of the “center of mass” (xˉ\bar x) of the data points:

xˉ1=1N(1Nx1i);xˉ2=1N(1Nx2i);xˉ3=1N(1Nx3i),\bar x_1 = {1\over N} (\sum_{1}^{N} x_{1i}); \quad \bar x_2 = {1\over N} (\sum_{1}^{N} x_{2i}); \quad \bar x_3 = {1\over N} (\sum_{1}^{N} x_{3i}),

where NN is the number of data points involved. Note that for unit vectors, the center of mass is the same as the Fisher mean (Chapter 11).

  1. Transform the origin of the data cluster to the center of mass:

x1i=x1ixˉ1;x2i=x2ixˉ2;x3i=x3ixˉ3,x_{1i}'=x_{1i}-\bar x_1; \quad x_{2i}'=x_{2i}-\bar x_2; \quad x_{3i}'=x_{3i}-\bar x_3,

where xix'_i are the transformed coordinates.

  1. The orientation matrix is defined as:

T=(x1ix1ix1ix2ix1ix3ix2ix1ix2ix2ix2ix3ix3ix1ix3ix2ix3ix3i).\mathbf{T}=\begin{pmatrix}\sum x'_{1i}x'_{1i}&\sum x'_{1i}x'_{2i}&\sum x'_{1i}x'_{3i}\\ \sum x'_{2i}x'_{1i}&\sum x'_{2i}x'_{2i}&\sum x'_{2i}x'_{3i}\\ \sum x'_{3i}x'_{1i}&\sum x'_{3i}x'_{2i}&\sum x'_{3i}x'_{3i}\end{pmatrix}.

T\mathbf{T} is a 3 x 3 matrix, where only six of the nine elements are independent. It is constructed in some coordinate system, such as the geographic or sample coordinate system. Usually, none of the six independent elements are zero. There exists, however, a coordinate system along which the “off-axis” terms are zero and the axes of this coordinate system are called the eigenvectors of the matrix. The three elements of T\mathbf{T} in the eigenvector coordinate system are called eigenvalues. In terms of linear algebra, this idea can be expressed as:

TV=τV,\mathbf{T} \mathbf{V} = \boldsymbol{\tau} \mathbf{V},

where V\mathbf{V} is the matrix containing three eigenvectors and τ\boldsymbol{\tau} is the diagonal matrix containing three eigenvalues. Equation A.48 is only true if:

detTτ=0.\text{det} | \mathbf{T} - \boldsymbol{\tau} | = 0.

If we expand Equation A.49, we have a third degree polynomial whose roots (τ\tau) are the eigenvalues:

(T11τ)[(T22τ)(T33τ)T232](T_{11}-\tau)[(T_{22}-\tau)(T_{33}-\tau) - T_{23}^2] -
T12[T12(T33τ)T13T23]+T13[T13T23T13(T22τ)]=0.T_{12}[T_{12}(T_{33}-\tau) - T_{13}T_{23}] + T_{13}[T_{13}T_{23}-T_{13}(T_{22}-\tau)] = 0.

The three possible values of τ\tau (τ1,τ2,τ3\tau_1, \tau_2, \tau_3) can be found with iteration and determination. In practice, there are many programs for calculating τ\boldsymbol{\tau}. My personal favorite is the Numpy Module for Python (see many free websites, especially Scientific Python (SciPy) for hints). Please note that the conventions adopted here are to scale the τ\tau’s such that they sum to one; the largest eigenvalue is termed τ1\tau_1 and corresponds to the eigenvector V1\mathbf{V}_1.

Inserting the values for the transformed components calculated in Equation A.46 into T\mathbf{T} gives the covariance matrix for the demagnetization data. The direction of the axis associated with the greatest scatter in the data (the principal eigenvector V1\mathbf{V}_1) corresponds to a best-fit line through the data. This is usually taken to be the direction of the component in question. This direction also corresponds to the axis around which the “moment of inertia” is least. The eigenvalues of T\mathbf{T} are the variances associated with each eigenvector. Thus the standard deviations are σi=τi\sigma_i=\sqrt{\tau_i}.

A.3.6Upside down triangles, \nabla

A.3.6.1Gradient

We often wish to differentiate a function along three orthogonal axes. For example, imagine we know the topography of a ski area (see Figure A.6). For every location (in say, XX and YY coordinates), we know the height above sea level. This is a scalar function. Now imagine we want to build a ski resort, so we need to know the direction of steepest descent and the slope (red arrows in Figure A.6).

Photograph of a snow-covered ski slope with red arrows indicating the direction and magnitude of steepest descent on the terrain.

Figure A.6:Illustration of the relationship between a vector field (direction and magnitude of steepest slope at every point, e.g., red arrows) and the scalar field (height) of a ski slope.

To convert the scalar field (height versus position) to a vector field (direction and magnitude of greatest slope) mathematically, we would simply differentiate the topography function. Let’s say we had a very weird two dimensional, sinusoidal topography such that z=f(x)=sinxz=f(x)=\sin x with zz the height and xx is the distance from some marker. The slope in the xx direction (x^\hat x), then would be x^ddxf(x)\hat x {d \over {d x}}{ f(x) }. If f(x,y,z)f(x,y,z) were a three dimensional topography then the gradient of the topography function would be:

(x^xf+y^yf+z^zf).(\hat x {{\partial} \over {\partial x}} f + \hat y {{\partial} \over {\partial y}} f + \hat z {{\partial} \over {\partial z}} f) .

For short hand, we define a “vector differential operator” to be a vector whose components are

=(x^x,y^y,z^z).\nabla = (\hat x {\partial \over {\partial x}}, \hat y {\partial \over {\partial y}}, \hat z {\partial \over {\partial z}}).

This can also be written in polar coordinates:

=r,rθ,rsinθϕ.\nabla = {\partial \over {\partial r}}, {\partial \over {r\partial \theta}}, {\partial \over {r\sin \theta \partial \phi}}.
Arrows radiating outward from a central point with increasing magnitude, enclosed by a dashed box, illustrating a vector field with non-zero divergence.

Figure A.7:Example of a vector field with a non-zero divergence.

A.3.6.2Divergence

The divergence of a vector function (e.g. H\mathbf{H}) is written as:

H.\nabla \cdot \mathbf{H}.

The trick here is to treat \nabla as a vector and use the rules for dot products described in Section A.3.2. In cartesian coordinates, this is:

H=x^Hxx+y^Hyy+z^Hzz.\nabla \cdot \mathbf{H} = \hat x {{\partial H_x} \over {\partial x}} + \hat y {{\partial H_y}\over {\partial y}} + \hat z {{\partial H_z}\over {\partial z}}.

Like all dot products, the divergence of a vector function is a scalar.

Uniform parallel arrows all pointing upward with equal magnitude, enclosed by a dashed box, illustrating a vector field with zero divergence.

Figure A.8:Example of a vector field with zero divergence.

Arrows circulating counterclockwise around a central point in the x-y plane along an elliptical path, with x, y, and z axes shown, illustrating a vector field with non-zero curl.

Figure A.9:Example of a vector field with non-zero curl.

The name divergence is well chosen because H\nabla \cdot \mathbf{H} is a measure of how much the vector field “spreads out” (diverges) from the point in question. In fact, what divergence quantifies is the balance between vectors coming in to a particular region versus those that go out. The example in Figure A.7 depicts a vector function whereby the magnitude of the vector increases linearly with distance away from the central point. An example of such a function would be v(r)=rv(r)=r. The divergence of this function is:

v=rr=1.\nabla \cdot v = {\partial \over {\partial r}} r = 1.

(a scalar). There are no arrows returning in to the dashed box, only vectors going out and the non-zero divergence quantifies this net flux out of the box.

Now consider Figure A.8, which depicts a vector function that is constant over space, i.e. v(r)=kv(r) = k. The divergence of this function is:

v=rk=0.\nabla \cdot v = {\partial \over {\partial r}} k = 0.

The zero divergence means that for every vector leaving the box, there is an equal and opposite vector coming in. Put another way, no net flux results in a zero divergence. The fact that the divergence of the magnetic field is zero means that there are no point sources (monopoles), as opposed to electrical fields that have divergence related to the presence of electrons or protons.

A.3.6.3Curl

The curl of the vector function B\mathbf{B} is defined as ×B\nabla \times \mathbf{B}. In cartesian coordinates we have

×B=x^(yBzzBy)+y^(zBxxBz)+z^(xByyBx).\nabla \times \mathbf{B} = \hat x ({\partial \over {\partial y}} B_z - {\partial \over {\partial z}} B_y) + \hat y ({\partial \over {\partial z}} B_x - {\partial \over {\partial x}} B_z) + \hat z ({\partial \over {\partial x}} B_y - {\partial \over {\partial y}} B_x).

Curl is a measure of how much the vector function “curls” around a given point. The function describing the velocity of water in a whirlpool has a significant curl, while that of a smoothly flowing stream does not.

Consider Figure A.9 which depicts a vector function v=yx^+xy^v=-y\hat x + x\hat y. The curl of this function is:

×v=detx^y^z^xyzyx0,\nabla \times v = \det \begin{vmatrix} \hat x & \hat y & \hat z \\ {\partial \over {\partial x}} & {\partial \over {\partial y}} & {\partial \over {\partial z}}\\ -y & x & 0 \end{vmatrix},

or

x^(y0zx)+y^(x0z(y))+z^(xxy(y)).\hat x ( {\partial \over {\partial y}} 0 - {\partial \over {\partial z}} x ) + \hat y ( {\partial \over {\partial x}} 0 - {\partial \over {\partial z}} (-y) ) + \hat z ( {\partial \over {\partial x}} x - {\partial \over {\partial y}} (-y) ).
=0x^+0y^+2z^= 0 \hat x + 0 \hat y + 2\hat z

So there is a positive curl in this function and the curl is a vector in the z^\hat z direction.

The magnetic field has a non-zero curl in the presence of currents or changing electric fields. In free space, away from currents (lightning!!), the magnetic field has zero curl.

A.3.7The statistical bootstrap

Sometimes things just are not normal. Statistically that is. When you can not assume that your data follow some known distribution, like the normal distribution, or the Fisher distribution, what do you do? In this section, we outline a technique called the bootstrap, which allows us to make statistical inferences when parametric assumptions fail. The reader should also refer to Efron & Tibshirani (1993) for a more complete discussion.

Three-panel figure. a) Histogram of 500 data points from a Gaussian distribution. b) Q-Q plot showing data versus normal quantiles. c) Histogram of 10,000 bootstrapped means with 95% confidence bounds marked.

Figure A.10:Bootstrapping applied to a normal distribution. a) 500 data points are drawn from a Gaussian distribution with mean of 10 and a standard deviation of 2. b) Q-Q plot of data in a). The 95% confidence interval for the mean is given by Gauss statistics as ± 0.17. 10,000 new (para) data sets are generated by randomly drawing NN data points from the original data set shown in a). c) A histogram of the means from all the para-data sets. 95% of the means fall within the interval 10.060.16+0.16^{+0.16}_{-0.16}, hence the bootstrap confidence interval is similar to that calculated with Gaussian statistics. [Figure from Tauxe (1998).]

In Figure A.10, we illustrate the essentials of the statistical bootstrap. We will develop the technique using data drawn from a normal distribution. First, we generate a synthetic data set by drawing 500 data points from a normal distribution with a mean xˉ\bar x of 10 and a standard deviation σ\sigma of 2. The synthetic data are plotted as a histogram in Figure A.10a. In Figure A.10b we plot the data as a Q-Q plot (see Section B.1.5) against the ziz_i expected for a normal distribution.

The data in Figure A.10a plot in a line on the Q-Q plot (Figure A.10b). The value for DD is 0.0306. Because N=500N=500, the critical value of DD, DcD_c at the 95% confidence level is 0.0396. Happily, our normal distribution simulation program has produced a set of 500 numbers for which the null hypothesis of a normal distribution has not been rejected. The mean of the synthetic dataset is about 10 and the standard deviation is 1.9. The usual Gaussian statistics allow us to estimate a 95% confidence interval for the mean as ±1.96σ/N\pm 1.96 \sigma / \sqrt{N} or ±0.17\pm 0.17.

Sphere with North Pole at top showing site location L and sub-solar point S connected by spherical triangle with angles H, beta, beta-prime, theta, and solar declination delta.

Figure A.11:Calculation of the azimuth of the shadow direction (β\beta') relative to true North, using a sun compass. L is the site location (at λL,ϕL\lambda_L,\phi_L), S is the position on the Earth where the sun is directly overhead (λS,ϕS\lambda_S,\phi_S). [Figure from Tauxe (1998).]

In order to estimate a confidence interval for the mean using the bootstrap, we first randomly draw a list of NN data by selecting data points from the original data set. This list is called a pseudo-sample of the data. Some data points will be used more than once and others will not be used at all. We then calculate the mean of the pseudo-sample. We repeat the procedure of drawing pseudo-samples and calculating the mean many times (say 10,000 times). A histogram of the “bootstrapped” means is plotted in Figure A.10c. If these are sorted such that the first mean is the lowest and the last mean is the highest, the 95% of the means are between the 250th^{th} and the 9,750th^{th} mean. These therefore are the 95% confidence bounds because we are approximately 95% confident that the true mean lies between these limits. The 95% confidence interval calculated for the data in Figure A.10 by bootstrap is about ± 0.16 which is nearly the same as that calculated the Gaussian way. However, the bootstrap required orders of magnitude more calculations than the Gaussian method, hence it is ill-advised to perform a bootstrap calculation when a parametric one will do. Nonetheless, if the data are not Gaussian, the bootstrap provides a means of calculating confidence intervals when there is no quick and easy way. Furthermore, with a modern computer, the time required to calculate the bootstrap illustrated in Figure A.10 was virtually imperceptible.

A.3.8Directions using a sun compass

In a sun compass problem, we have the direction of the sun’s shadow and an angle between that and the desired direction (α\alpha). The declination of the shadow itself is 180° from the direction toward the sun. In Figure A.11, the problem of calculating declination from sun compass information is set up as a spherical trigonometry problem, similar to those introduced in Chapter 2 and Section A.3.1. The declination of the shadow direction β\beta', is given by 180 - β\beta. We also know the latitude of the sampling location L (λL\lambda_L). We need to calculate the latitude of S (the point on the Earth’s surface where the sun is directly overhead), and the local hour angle HH.

Knowing the time of observation (in Universal Time), the position of S (λs=δ,ϕs\lambda_s = \delta,\phi_s in Figure A.11) can be calculated with reasonable precision (to within 0.01°) for the period of time between 1950 and 2050 using the procedure recommended in the 1996 Astronomical Almanac:

  1. First, calculate the Julian Day JJ. Then, calculate the fraction of the day in Universal Time UU. Finally, calculate the parameter dd which is the number of days from J2000 by:

d=J2451545+U.d= J - 2451545 + U.
  1. The mean longitude of the sun (ϕs\phi_s), corrected for aberration, can be estimated in degrees by:

ϕs=280.461+0.9856474d.\phi_s=280.461 + 0.9856474 d.
  1. The mean anomaly g=357.528+0.9856003dg=357.528 + 0.9856003 d (in degrees).

  2. Put ϕs\phi_s and gg in the range 0 → 360°.

  3. The longitude of the ecliptic is given by ϕE=ϕs+1.915sing+0.020sin2g\phi_E=\phi_s + 1.915 \sin g + 0.020 \sin 2g (in degrees).

  4. The obliquity of the ecliptic is given by ϵ=23.4390.0000004d\epsilon = 23.439 - 0.0000004 d.

  5. Calculate the right ascension (AA) by:

A=ϕEftsin2ϕE+(f/2)t2sin4ϕE,A = \phi_E - ft \sin 2\phi_E + (f/2) t^2 \sin 4 \phi_E,

where f=180/πf=180/\pi and t=t=tan2ϵ/2^2\epsilon/2.

  1. The so-called “declination” of the sun (δ\delta in Figure A.11 which should not be confused with the magnetic declination DD), which we will use as the latitude λs\lambda_s, is given by:

δ=sin1(sinϵsinϕe).\delta = \sin^{-1}(\sin \epsilon \sin \phi_e).
  1. Finally, the equation of time in degrees is given by E=4(ϕsA)E= 4(\phi_s-A).

We can now calculate the Greenwich Hour Angle GHAGHA from the Universal Time UU (in minutes) by GHA=(U+E)/4+180GHA = (U + E)/4 + 180. The local hour angle (HH in Figure A.11) is GHA+ϕLGHA + \phi_L. We calculate β\beta using the laws of spherical trigonometry (see Section A.3.1). First we calculate θ\theta by the Law of Cosines (remembering that the cosine of the colatitude equals the sine of the latitude):

cosθ=sinλLsinλs+cosλLcosλscosH\cos \theta = \sin \lambda_L \sin \lambda_s + \cos \lambda_L \cos \lambda_s \cos H

and finally using the Law of Sines:

sinβ=(cosλssinH)/sinθ.\sin \beta = (\cos \lambda_s \sin H)/\sin \theta.

If λs<λL\lambda_s<\lambda_L, then the required angle is the shadow direction β\beta', given by: β=180β\beta' =180-\beta. The azimuth of the desired direction is β\beta' plus the measured shadow angle α\alpha.

References
  1. Tauxe, L. (1998). Paleomagnetic Principles and Practice. Kluwer Academic Publishers.
  2. Scheidegger, A. E. (1965). On the statistics of the orientation of bedding planes, grain axes, and similar sedimentological data. U.S. Geological Survey Professional Paper, 525–C, 164–167.
  3. Efron, B., & Tibshirani, R. J. (1993). An Introduction to the Bootstrap (Vol. 57). Chapman. 10.1201/9780429246593