Improving Categorical Data Clustering Algorithm by Weighting Uncommon Attribute Value Matches


Zengyou He, Xiaofei Xu, Shenchun Deng




This paper presents an improved Squeezer algorithm for categorical data clustering by giving greater weight to uncommon attribute value matches in similarity computations. Experimental results on real life datasets show that, the modified algorithm is superior to the original Squeezer algorithm and other clustering algorithm with respect to clustering accuracy.