Clustering in high-dimensional spaces presents unique challenges arising from the so-called “curse of dimensionality”, where the volume of the feature space grows exponentially and distances between ...
Dr. James McCaffrey of Microsoft Research presents a full-code, step-by-step tutorial on technique for visualizing and clustering data. A self-organizing map (SOM) is a data structure that can be used ...
Clustering non-numeric -- or categorial -- data is surprisingly difficult, but it's explained here by resident data scientist Dr. James McCaffrey of Microsoft Research, who provides all the code you ...