Short Communication: The Wasserstein distance as a dissimilarity metric for comparing detrital age spectra, and other geological distributions

This is a Preprint and has not been peer reviewed. The published version of this Preprint is available: https://doi.org/10.5194/gchron-5-263-2023. This is version 5 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Alex George Lipp , Pieter Vermeesch

Abstract

Distributional data such as detrital age populations or grain size distributions are common in the geological sciences. As analytical techniques become more sophisticated, increasingly large amounts of distributional data are being gathered. These advances require quantitative and objective methods, such as multidimensional scaling (MDS), to analyse large numbers of samples. Crucial to such methods is choosing a sensible measure of dissimilarity between samples. At present, the Kolmogorov-Smirnov (KS) statistic is the most widely used of these dissimilarity measures. However, the KS statistic has some limitations such as high sensitivity to differences between the modes of two distributions, and insensitivity to their tails. Here we propose the Wasserstein-2 distance (W2) as an additional and alternative metric for use in geochronology. Whereas the KS-distance is defined as the maximum vertical distance between two empirical cumulative distribution functions, the W2-distance is a function of the horizontal distances (i.e., age differences) between observations. Using a variety of synthetic and real datasets we explore scenarios where W2 may provide greater geological insight than the KS statistic. We find that in cases where absolute time differences are not relevant (e.g., mixing of known, discrete age peaks), the KS statistic can be more intuitive. However, in scenarios where absolute age differences are important (e.g., temporally/spatially evolving sources, thermochronology, and overcoming laboratory biases) W2 is preferable. The W2-distance has been added to the R package IsoplotR, for immediate use in detrital geochronology and other applications. The W2 distance can be generalised to multiple dimensions, which opens opportunities beyond distributional data.

DOI

https://doi.org/10.31223/X5TM02

Subjects

Earth Sciences, Environmental Sciences, Geochemistry, Geology, Geomorphology, Stratigraphy, Tectonics and Structure

Keywords

Distributional data, Wasserstein distance, Kolmogorov-Smirnov distance, Detrital mineral ages, Zircon U-Pb dating, Multi-dimensional scaling

Dates

Published: 2022-10-26 20:39

Last Updated: 2023-05-17 20:31

Older Versions
License

CC BY Attribution 4.0 International

Additional Metadata

Data Availability (Reason not available):
Link provided in manuscript