18–20 de noviembre de 2024
UIS
America/Bogota zona horaria

Machine Learning Pipeline for Particle Classification for the LAGO Water Cherenkov Detectors

20 nov 2024, 9:20
20m
Aula 402 - Laboratorio de Livianos (UIS)

Aula 402 - Laboratorio de Livianos

UIS

Charla corta Charlas cortas

Ponente

Ticiano Jorge Torres Peralta (TSWC/FACET-UNT/CONICET)

Descripción

One of LAGO’s primary objectives is to measure the secondary particle flux generated by Galactic Cosmic Rays (GCRs) as they impinge the Earth’s atmosphere, using its extensive network of Water Cherenkov Detectors (WCDs). We know that this flux predominantly includes electromagnetic, muonic, and hadronic components. While the resulting charge histograms provide valuable insights into the flux of primary cosmic rays and their key characteristics, they fall short in effectively differentiating the contributions from the various secondary particles.

In this work, we present a Machine Learning (ML) pipeline designed to process WCD data—both real and simulated—and classify these secondary particles, in a High Performance Computing (HPC) environment. This pipeline involves preprocessing, feature engineering and selection, a machine learning stage, and a final stage to aggregate the results. The primary algorithm used is an unsupervised hierarchical density-based clustering method called OPTICS (Ordering Points to Identify the Clustering Structure), which finds similarity patterns within the data to group and effectively classify the secondary particles. Here, OPTICS is run many times in parallel in the HPC environment to produce a set of results that are aggregated in the final stage to show the algorithm's robustness in terms of accuracy and precision.

Our results demonstrate that this enhanced methodology accurately identifies originating particles with a high degree of confidence on a single-pulse basis. These promising outcomes suggest the potential for implementing ML-based models across LAGO’s distributed detection network and in other astroparticle observatories, enabling semi-automated, onboard, and real-time data analysis.

Charla presencial o virtual Virtual

Autores primarios

Ticiano Jorge Torres Peralta (TSWC/FACET-UNT/CONICET) Dr. María Graciela Molina (TSWC/FACET-UNT/CONICET/INGV) Hernán Asorey (CIEMAT) IVÁN SIDELNIK (Departamento de Física de Neutrones, Centro Atómico Bariloche, Comisión Nacional de Energía Atómica (CNEA), Bariloche R8402, Argentina.) Dr. Antonio Juan Rubio Montero (CIEMAT) Sergio Dasso (UBA FCEN DCAO Lamp Group, IAFE(CONICET-UBA), UBA FCEN DF) Dr. Rafael Mayo Garcia (CIEMAT) Dr. Alvaro Taboada (ITEDA)

Materiales de la presentación