Data discretization and its techniques in data mining

Data discretization and its techniques?

Data discretization converts a large number of data values into smaller once, so that data evaluation and data management becomes very easy.

Example:

we have an attribute age with following values.

Age  10,11,13,14,17,19,30, 31, 32, 38, 40, 42,70 , 72, 73, 75

Table: Before discretization

 

Age  10,11,13,14,17,19,            30, 31, 32, 38, 40, 42,                     70 , 72, 73, 75
             Young                               Mature                                        Old

Table: How to discretization

 

 Age             Young                               Mature                                        Old

Table: After discretization

What are some famous techniques of data discretization?

  1. Histogram analysis
  2. Binning
  3. Correlation analysis
  4. Clustering analysis
  5. Decision tree analysis
  6. Equal width partitioning
  7. Equal depth partitioning