Dimensionality-reduced subspace clustering

Heckel, Reinhard; Tschannen, Michael; Bölcskei, Helmut

Informations

Fulltext

Dimensionality-reduced subspace clustering

Heckel, Reinhard ; Tschannen, Michael ; Bölcskei, Helmut

In: Information and Inference: A Journal of the IMA, 2017, vol. 6, no. 3, p. 246-283

Ajouter à la liste personnelle

Titre

Dimensionality-reduced subspace clustering

Auteur

Heckel, Reinhard. Department of Information Technology and Electrical Engineering, ETH Zurich, Sternwartstr. 7, CH-8092 Zürich, Switzerland
Tschannen, Michael. Department of Information Technology and Electrical Engineering, ETH Zurich, Sternwartstr. 7, CH-8092 Zürich, Switzerland
Bölcskei, Helmut. Department of Information Technology and Electrical Engineering, ETH Zurich, Sternwartstr. 7, CH-8092 Zürich, Switzerland

Type de document

Postprint

Langue

Anglais

Publié dans

Information and Inference: A Journal of the IMA, 2017, vol. 6, no. 3, p. 246-283. Oxford University Press

Autre version électronique

Publisher's version : https://doi.org/10.1093/imaiai/iaw021

Mots clés

Articles ; subspace clustering ; dimensionality reduction ; random projection ; sparse signal recovery

Identifiant OAI-PMH

oai:doc.rero.ch:331472

Summary

Subspace clustering refers to the problem of clustering unlabeled high-dimensional data points into a union of low-dimensional linear subspaces, whose number, orientations and dimensions are all unknown. In practice, one may have access to dimensionality-reduced observations of the data only, resulting, e.g., from undersampling due to complexity and speed constraints on the acquisition device or mechanism. More pertinently, even if the high-dimensional data set is available, it is often desirable to first project the data points into a lower-dimensional space and to perform clustering there; this reduces storage requirements and computational cost. The purpose of this article is to quantify the impact of dimensionality reduction through random projection on the performance of three subspace clustering algorithms, all of which are based on principles from sparse signal recovery. Specifically, we analyze the thresholding based subspace clustering (TSC) algorithm, the sparse subspace clustering (SSC) algorithm and an orthogonal matching pursuit variant thereof (SSC-OMP). We find, for all three algorithms, that dimensionality reduction down to the order of the subspace dimensions is possible without incurring significant performance degradation. Moreover, these results are order-wise optimal in the sense that reducing the dimensionality further leads to a fundamentally ill-posed clustering problem. Our findings carry over to the noisy case as illustrated through analytical results for TSC and simulations for SSC and SSC-OMP. Extensive experiments on synthetic and real data complement our theoretical findings.

Dimensionality-reduced subspace clustering

Heckel, Reinhard ; Tschannen, Michael ; Bölcskei, Helmut

In: Information and Inference: A Journal of the IMA, 2017, vol. 6, no. 3, p. 246-283

Voir aussi

Exporter vers

Dimensionality-reduced subspace clustering

Heckel, Reinhard ; Tschannen, Michael ; Bölcskei, Helmut

In: Information and Inference: A Journal of the IMA, 2017, vol. 6, no. 3, p. 246-283

Voir aussi

Liens

Partager

Exporter vers