Oops! It appears that you have disabled your Javascript. In order for you to see this page as it is meant to appear, we ask that you please re-enable your Javascript!

Computing Information-Gain for Continuous-Valued Attributes in data mining

Last modified on December 9th, 2018 at 9:16 pm

In this tutorial, we will learn about the computing Information-Gain for Continuous-Valued Attributes

To calculate the split point is not a big deal. It is just a just a fun to find the split point. For example, we have the following data mentioned below;

How can we calculate the split point?

IncomeClass
18YES
45NO
18NO
25YES
28YES
28NO
34NO

Solution  to calculate the split point

Step 1:

First of all, we need to sort the data in ascending order. After sorting the data, data is shown in the table below.

IncomeClass
18YES
18NO
25YES
28YES
28NO
34NO
45NO

Step 2:

Find the midpoint of first two numbers and calculate the information gain

Split point = (18+25) / 2 = 21

  Infoage<21(D) = 2/7(I(1,1)) + 5/7(I(2,3))

  = 2/7(-1/2(log2(1/2)) – 1/2(log2(1/2))+5/7(-2/5(log2(2/5)) – 3/5(log2(3/5)))

  = 0.98

Prof. Fazal Rehman Shamil
Researcher, Publisher of International Journal Of Software Technology & Science ISSN: 2616-5325
Instructor, SEO Expert, Web Programmer and poet.
Feel free to contact.