Soft clustering algorithms : theoretical and practical improvements / Kathrin Bujna ; [accepted at the recommendation of Prof. Dr. Johannes Blömer (Paderborn University) and Prof. Dr. Eyke Hüllermeier (Paderborn University)]. Paderborn, 2017

Inhalt

I Soft Clusterings

II Fuzzy K-Means Problems

4 Introduction

4.1 The Fuzzy K-Means Problem

4.2 A Comparison with the K-Means Problem

4.3 Related Work

4.4 More Related Work (The K-Means Problem)

4.5 Overview

5 Basics

5.1 Problem Definition

5.2 Fuzzifier Functions

5.3 Special Cases

6 Two Key Properties

7 Baselines

8 Superset Sampling for Fuzzy Clusters

8.6 Algorithms

9 A Discretization

9.5 A Discrete Search Space

10 An eps-Approximate Mean Set

11 Dimension Reduction

11.1 The Johnson Lindenstrauss Lemma

11.2 Principal Component Analysis

12 Coresets

13 Summary & Conclusion

III Clustering with Gaussian Mixture Models

14 Introduction

14.1 Gaussian Mixture Models (GMMs)

14.2 Likelihood Approach

14.3 Expectation-Maximization (EM)

14.4 Overview

15 A Non-Asymptotic Comparison of EM and SEM Algorithms

16 Adaptive Seeding for Gaussian Mixture Models

16.5 Evaluation

16.6 Conclusion and Future Work

17 On the Soft K-Means Problem

17.4 A Clustering-Centric Variant

17.5 Towards an Analysis

17.6 Conclusions

IV Appendix

A Three Handy Lemmata