which attribute selection measure is best in data mining

Last modified on April 17th, 2020

In this tutorial, we will learn about the followings;

Information Gain.

Gain Ratio.

Gini index.

Attribute Selection Measure

There are different attribute selection measures. Some of them are as follows;

  1. Information Gain
  2. Gain ratio
  3. Gini index
Information gain Gain ratio Gini index
Biased towards the multi-valued attribute. Unbalanced splits.

One partition is much smaller than other partition.

Biased towards the multi-valued attribute.
    Difficult to manage a large number of classes.
    Partitions are equal.

Next Similar Tutorials

  1. Decision tree induction on categorical attributes  – Click Here
  2. Decision Tree Induction and Entropy in data mining – Click Here
  3. Overfitting of decision tree and tree pruning – Click Here
  4. Attribute selection Measures – Click Here
  5. Computing Information-Gain for Continuous-Valued Attributes in data mining – Click Here
  6. Gini index for binary variables – Click Here
  7. Bagging and Bootstrap in Data Mining, Machine Learning – Click Here
  8. Evaluation of a classifier by confusion matrix in data mining – Click Here
  9. Holdout method for evaluating a classifier in data mining – Click Here
  10. RainForest Algorithm / Framework – Click Here
  11. Boosting in data mining – Click Here
  12. Naive Bayes Classifier  – Click Here

 

Subscribe for Friendship

Latest posts by Prof. Fazal Rehman Shamil (see all)