To cluster or not to cluster: that is the question
- From: Stats Wolf <stats.wolf@xxxxxxxxx>
- Date: Fri, 11 Apr 2008 11:12:57 -0700 (PDT)
Hi all,
I am using k-means algorithm, and to detect the optimum number of
clusters, Calinski and Harabasz's C(g) coefficient. Unfortunately,
their approach does not enable me to determine the C(g) for one
cluster (so, for the situation when I actually don't cluster the
sample). If the optimum number of clusters is higher than 2, that's
fine, but when it is actually 2, I can't know whether 2 is indeed the
optimum number of clusters or maybe I should not cluster the sample
whatsoever. What do you think about this?
Thanks in advance,
Wolf
.
- Follow-Ups:
- Re: To cluster or not to cluster: that is the question
- From: Ray Koopman
- Re: To cluster or not to cluster: that is the question
- From: Art Kendall
- Re: To cluster or not to cluster: that is the question
- Prev by Date: Re: Specific comparisons in binomial GLM
- Next by Date: Bayesian Regression - goodness of fit
- Previous by thread: Q: dichotomous variables
- Next by thread: Re: To cluster or not to cluster: that is the question
- Index(es):
Relevant Pages
|