Lehr- und Forschungseinheit für Datenbanksysteme
print


Breadcrumb Navigation


Content

Accepted paper at BTW 2021

Cluster Flow — an Advanced Concept for Ensemble-Enabling, Interactive Clustering

21.01.2021

Authors

Sandra Obermeier, Anna Beer, Florian Wahl, Thomas Seidl

btw2021_logo







19. Fachtagung für Datenbanksysteme für Business, Technologie und Web,
13-17 September 2021, Dresden


Abstract

Even though most clustering algorithms serve knowledge discovery in fields other than computer science, most of them still require users to be familiar with programming or data mining to some extent. As that often prevents efficient research, we developed an easy to use, highly explainable clustering method accompanied by an interactive tool for clustering. It is based on intuitively understandable kNN graphs and the subsequent application of adaptable filters, which can be combined ensemble-like and iteratively and prune unnecessary or misleading edges. For a first overview of the data, fully automatic predefined filter cascades deliver robust results. A selection of simple filters and combination methods that can be chosen interactively yield very good results on benchmark datasets compared to various algorithms.