Timezone: »

On Integrated Clustering and Outlier Detection
Lionel Ott · Linsey Pang · Fabio Ramos · Sanjay Chawla

Mon Dec 08 04:00 PM -- 08:59 PM (PST) @ Level 2, room 210D

We model the joint clustering and outlier detection problem using an extension of the facility location formulation. The advantages of combining clustering and outlier selection include: (i) the resulting clusters tend to be compact and semantically coherent (ii) the clusters are more robust against data perturbations and (iii) the outliers are contextualised by the clusters and more interpretable. We provide a practical subgradient-based algorithm for the problem and also study the theoretical properties of algorithm in terms of approximation and convergence. Extensive evaluation on synthetic and real data sets attest to both the quality and scalability of our proposed method.

Author Information

Lionel Ott (University of Sydney)
Linsey Pang (University of Sydney)
Fabio Ramos (University of Sydney, NVIDIA)
Sanjay Chawla (QCRI)

More from the Same Authors