Oops! It appears that you have disabled your Javascript. In order for you to see this page as it is meant to appear, we ask that you please re-enable your Javascript!

Distance measure for asymmetric binary attributes in data mining

Last modified on December 9th, 2018 at 9:17 pm

How to calculate proximity measure for asymmetric binary attributes?

In this tutorial, we will learn about the proximity measure for asymmetric binary attributes

Contingency table for binary data

Here in this example, consider 1 for positive/True  and 0 for negative/False.

Object 2
Object 1 1 / True / Positive0 / False / NegativeSum
1 / True / PositiveABA + B
0 / False / NegativeCDC + D
SumA + CB + D 

In table 1 we can consider the following facts.

A represents that object 1 is True and object 2 is also True.

B represents that object 1 is True and object 2 is also False.

C represents that object 1 is False and object 2 is also True.

D represents that object 1 is False and object 2 is also False.

NameFeverCoughTest 1Test 2Test 3Test 4
AsadNegativeYes Negative Positive Negative Negative
BilalNegative Yes NegativePositive Positive Negative
TahirPositiveYes Negative Negative Negative Negative

In table 2, Asad, Bilal and Tahir are objects. Negative values represents False and Positive represents Negative.

dissimilarity of binary variables

In the results, we can see the following facts;

The distance between object 1 and 2 is 0.67. Asad is object 1 and Tahir is in object 2 and the distance between both is 0.67.

Less distance is between Asad and Bilal. It means that Asad and Bilal are more similar to each other as compared to other objects.

Prof. Fazal Rehman Shamil
Researcher, Publisher of International Journal Of Software Technology & Science ISSN: 2616-5325
Instructor, SEO Expert, Web Programmer and poet.
Feel free to contact.