Data discretization and its techniques in data mining

Data discretization converts a large number of data values into smaller once, so that data evaluation and data management becomes very easy.


we have an attribute age with the following values.

Age 10,11,13,14,17,19,30, 31, 32, 38, 40, 42,70 , 72, 73, 75

Table: Before discretization

Age 10,11,13,14,17,19,            30, 31, 32, 38, 40, 42,                     70 , 72, 73, 75
            Young                               Mature                                        Old

Table: How to discretization

 Age            Young                               Mature                                        Old

Table: After discretization

Another example is the Website visitor’s data. As seen in the figure below, data is discretized into the countries.

data discretization techniques

What are some famous techniques of data discretization?

  1. Histogram analysis
  2. Binning
  3. Correlation analysis
  4. Clustering analysis
  5. Decision tree analysis
  6. Equal width partitioning
  7. Equal depth partitioning

By:Prof. Fazal Rehman Shamil
CEO @ T4Tutorials
Last Modified: April 17, 2020

