Skip to main content
Search type Search syntax
Tags [tag]
Exact "words here"
Author user:1234
user:me (yours)
Score score:3 (3+)
score:0 (none)
Answers answers:3 (3+)
answers:0 (none)
isaccepted:yes
hasaccepted:no
inquestion:1234
Views views:250
Code code:"if (foo != bar)"
Sections title:apples
body:"apples oranges"
URL url:"*.example.com"
Saves in:saves
Status closed:yes
duplicate:no
migrated:no
wiki:no
Types is:question
is:answer
Exclude -[tag]
-apples
For more details on advanced search visit our help page
Results tagged with
Search options answers only not deleted user 116440

Hierarchical cluster analysis is a method of cluster analysis which builds, by steps, a hierarchy of clusters, a dendrogram. Most popular is agglomerative hierarchical clustering (HAC) which starts from individual objects and collects them into bigger and bigger clusters.

1 vote
Accepted

Is there any effective way to tell difference between clusters

You can think of the cluster assigned to each point as a class label. This will let you address things the same way you would for a classification problem: "what are the features that distinguish thes …
user20160's user avatar
  • 33.9k
1 vote

Agglomerative clustering with must-link constraints

In the usual, uncontrained case, hierarchical agglomerative clustering starts with every point assigned to its own cluster. On each iteration, we select two clusters to merge, according to some chosen …
user20160's user avatar
  • 33.9k
1 vote
Accepted

What are some clustering algorithms in which I can define no of clusters I require?

A common approach with k-means is to run multiple times with different initializations, then pick the best. Some other options that let you pick the number of clusters: Gaussian mixture models, agglom …
user20160's user avatar
  • 33.9k
2 votes

Clustering sets of vectors

Hierarchical agglomerative clustering might work for you. It typically starts with each data point in its own cluster, then iteratively merges pairs of clusters to form larger and larger clusters. Sin …
user20160's user avatar
  • 33.9k
3 votes

Difference between languages (spoken)?

The simplest option might be to ignore distances between languages themselves. You could construct a vector for each neighborhood, where each element corresponds to a language, and the its value corre …
user20160's user avatar
  • 33.9k
1 vote
Accepted

Does it make sense to cluster based on Euclidean distances between rows of a cosine matrix?

It sounds like you're doing the following: Given $n$ data points $\{X_1, ..., X_n\}$, calculate a pairwise cosine distance matrix $C$, where $C_{ij}$ is the cosine distance between points $i$ and $j …
user20160's user avatar
  • 33.9k