I am running a hierarchical clustering process in R
, using daisy
to compute a dissimilarity matrix and agnes
for hierarchical clustering, as described in Clustering of mixed type data with R.
With my 8 GB Ram, I constantly run into this error:
Error: cannot allocate vector of size 1.8 Gb
I have 21836 rows with only 2 variables. However, I'd like to use more variables, but I am already running out of memory using only 2.
Are there any alternative algorithms for a mixed data set of continuous and categorical variables?
Are there any alternative tools (I am currently using R) which would require less memory?