DDCAL: Evenly Distributing Data into Low Variance Clusters Based on Iterative Feature Scaling

Lux, Marian; Rinderle-Ma, Stefanie

doi:10.1007/s00357-022-09428-6

lux_ddcal_2023

Wenn Sie Schwierigkeiten haben, das Dokument zu öffnen, versuchen Sie auch bitte diesen Link

Titel:: DDCAL: Evenly Distributing Data into Low Variance Clusters Based on Iterative Feature Scaling
Dokumenttyp:: Zeitschriftenaufsatz
Autor(en):: Lux, Marian; Rinderle-Ma, Stefanie
Abstract:: This work studies the problem of clustering one-dimensional data points such that they are evenly distributed over a given number of low variance clusters. One application is the visualization of data on choropleth maps or on business process models, but without over-emphasizing outliers. This enables the detection and differentiation of smaller clusters. The problem is tackled based on a heuristic algorithm called DDCAL (1d distribution cluster algorithm) that is based on iterative feature scaling which generates stable results of clusters. The effectiveness of the DDCAL algorithm is shown based on 5 artificial data sets with different distributions and 4 real-world data sets reflecting different use cases. Moreover, the results from DDCAL, by using these data sets, are compared to 11 existing clustering algorithms. The application of the DDCAL algorithm is illustrated through the visualization of pandemic and population data on choropleth maps as well as process mining results on process models. «
This work studies the problem of clustering one-dimensional data points such that they are evenly distributed over a given number of low variance clusters. One application is the visualization of data on choropleth maps or on business process models, but without over-emphasizing outliers. This enables the detection and differentiation of smaller clusters. The problem is tackled based on a heuristic algorithm called DDCAL (1d distribution cluster algorithm) that is based on iterative feature scal... »
Zeitschriftentitel:: Journal of Classification
Jahr:: 2023
Monat:: January
Sprache:: en
Volltext / DOI:: doi:10.1007/s00357-022-09428-6
WWW:: https://doi.org/10.1007/s00357-022-09428-6
Print-ISSN:: 1432-1343
BibTeX

Attachment-Browser öffnen...

Vorkommen:

mediaTUM Gesamtbestand Einrichtungen Schools TUM School of Computation, Information and Technology Departments Computer Science Informatik 17 - Lehrstuhl für Wirtschaftsinformatik und Geschäftsprozessmanagement (Prof. Rinderle-Ma)

mediaTUM Gesamtbestand Hochschulbibliographie 2023 Schools und Fakultäten TUM School of Computation, Information and Technology Informatik 17 - Lehrstuhl für Wirtschaftsinformatik und Geschäftsprozessmanagement (Prof. Rinderle-Ma)