Jaccard coefficient similarity measure for asymmetric binary variables

jaccard similarity in statistics

How to calculate the similarity of an asymmetric binary variable using Jaccard coefficient?

There are many methods to calculate the similarity of data. Jaccard coefficient is one of them.

Jaccard coefficient is used to calculate the similarity among asymmetric binary attributes. 

Contingency table for binary data:

Object 2
Object 1 1 / True / Positive0 / False / NegativeSum
1 / True / PositiveABA + B
0 / False / NegativeCDC + D
SumA + CB + D 

In table 1 we can consider the following facts.

A represents that object 1 is True and object 2 is also True.

B represents that object 1 is True and object 2 is  False.

C represents that object 1 is False and object 2 is  True.

D represents that object 1 is False and object 2 is also False.

NameFeverCoughTest 1Test 2Test 3Test 4
AsadNegativeYes Negative Positive Negative Negative
BilalNegative Yes NegativePositive Positive Negative
TahirPositiveYes Negative Negative Negative Negative

In table 2, Asad, Bilal and Tahir are objects. Negative values represents False and Positive represents Negative.

Consider 1 for positive/True  and 0 for negative/False.

jaccard similarity in statistics

Similarly, we can calculate the similarity of one object with each other object.

Video Lecture

Next Similar Tutorials

  1. Proximity Measure for Nominal Attributes – Click Here
  2. Distance measure for asymmetric binary attributes – Click Here
  3. Distance measure for symmetric binary variables – Click Here
  4. Euclidean distance in data mining – Click Here Euclidean distance Excel file – Click Here
  5. Jaccard coefficient similarity measure for asymmetric binary variables – Click Here
By:Prof. Fazal Rehman Shamil
CEO @ T4Tutorials
Last Modified: May 8, 2020

Leave a Reply

Your email address will not be published. Required fields are marked *