6

I know of the Jaccard index and the Sørensen-Dice coefficient for computing set similarity, but have been unable to find any other algorithms related to set similarity. This site contains quite a few resources for vector similarity, but that's not what I want.

What other set-similarity measures exist?

kjetil b halvorsen
  • 63,378
  • 26
  • 142
  • 467
blacksite
  • 614
  • 1
  • 10
  • 22
  • You can treat being in a set as an variable (or being in the first set as a classifier and being in the second as the concept). Using this reduction you can apply all the supervised learning metrics - accuracy, mutual information, etc. – DaL Jun 15 '17 at 07:14

1 Answers1

3

Other measures are:

  • Overlap Coefficient: $\frac{|A \cap B|}{min(|A|,|B|)}$
  • Tversky index: $|A\cap B| + \alpha|A\setminus B| + \beta|B \setminus A|$ where $\alpha$ and $\beta$ are positive numbers.
Borhan Kazimipour
  • 240
  • 1
  • 3
  • 9