Orange and Orange-Volcanoes: a New Open and Collaborative Platform to Perform Data-Driven Investigations and Machine Learning Analyses in Petrology and Volcanology

This is a Preprint and has not been peer reviewed. This is version 1 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Alessandro Musu, Valerio Parodi, Marko Toplak, Alessandro Carfì, Mónica Ágreda López, Fulvio Mastrogiovanni, J. Zhangzhou, Diego Perugini, Donato Belmonte, Blaž Zupan, Maurizio Petrelli 

Abstract

Orange-Volcanoes is an extension of the open-source Orange data mining platform specifically tailored for geochemical, petrological, and volcanological investigations. Orange-Volcanoes enhances the original platform by incorporating specialized tools to enable interactive data-driven investigations in geochemistry, such as performing Compositional Data Analysis (CoDA). Applying CoDA transformations enables the use of many standard and multivariate statistical methods like principal component analysis, discriminant analysis, and hierarchical clustering on compositional data. In this way, Orange-Volcanoes allows for the application of a wide range of data mining and statistical methods implemented in Orange using geochemical data. Moreover, Orange allows the use of advanced methods in the field of explainable artificial intelligence, such as feature importance and Shapley additive explanations. Also, within Orange-Volcanoes, we demonstrate the flexibility of the Orange platform by developing visual tools that allow for conducting mineral-liquid equilibrium tests and calculating thermo-barometric estimates. The Orange-Volcanoes supports collaborative efforts and reproducibility by offering a visual programming interface that requires no coding experience, making it accessible to a wide range of users, including scientists, educators, and students. We provide a series of case studies, including interactive petrological data exploration and clustering in tephra studies to highlight Orange-Volcanoes' potential and versatility in volcanological applications. Orange-Volcanoes can be downloaded using pip, and its documentation is available at https://bit.ly/orange3-volcanoes-doc

DOI

https://doi.org/10.31223/X5FT60

Subjects

Earth Sciences, Geochemistry, Multivariate Analysis, Volcanology

Keywords

Orange data mining, Data-driven investigations, Igneous Petrology, volcanology, Compositional data analysis (CoDA), machine learning

Dates

Published: 2025-03-06 08:20

Last Updated: 2025-03-06 16:20

License

CC-By Attribution-NonCommercial-NoDerivatives 4.0 International

Additional Metadata

Conflict of interest statement:
None