This research explores the application of Probabilistic Distance Clustering (PDC) to compositional data, specifically addressing the challenge of handling a bounded sample space in distance computations. It first reviews existing log-ratio transformations that map compositions into Euclidean space for clustering. Then, it adapts PDC to handle compositional data by using the additive ratio combined with the Box-Cox transformation. A simulation study will evaluate this method compared to the well-established isometric log-ratio transformation.

Probabilistic Distance Clustering for Compositional Data

Simonacci, Violetta
;
Palumbo, Francesco;Gallo, Michele
2025-01-01

Abstract

This research explores the application of Probabilistic Distance Clustering (PDC) to compositional data, specifically addressing the challenge of handling a bounded sample space in distance computations. It first reviews existing log-ratio transformations that map compositions into Euclidean space for clustering. Then, it adapts PDC to handle compositional data by using the additive ratio combined with the Box-Cox transformation. A simulation study will evaluate this method compared to the well-established isometric log-ratio transformation.
2025
9783031960321
9783031960338
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11574/251002
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact