Sage Journals: Discover world-class research

Abstract

Overfitting the training data is a major problem in machine learning, particularly when noise is present. Overfitting increases learning time and reduces both the accuracy and the comprehensibility of the generated rules, making learning from large data sets more difficult. Pruning is a technique widely used for addressing such problems and consequently forms an essential component of practical learning algorithms. An important class of pruning techniques is that based on the minimum description length (MDL) principle. This paper presents three new techniques using the MDL principle for pruning rule sets. An important advantage of these techniques is that all of the training data can be used for both inducing and evaluating rule sets. The performance of the techniques are evaluated using three criteria: classification accuracy, rule set complexity, and execution time. This shows that the new techniques, when incorporated into a rule induction algorithm, are more efficient and lead to accurate rule sets that are significantly smaller in size compared with the case before pruning.

Keywords

machine learning inductive learning rule induction pruning noise handling minimum description length principle

Get full access to this article

View all access options for this article.

References

Cohen

W. W.

Fast effective rule induction. In Proceedings of the 12th International Conference on Machine learning, Tahoe City, CA, 1995, pp. 115–123 (Morgan Kaufmann, San Fransisco, CA).

Farnkranz

Pruning algorithms for rule learning. Mach. Learn., 1996, 27, 139–171.

Pham

D. T.

Afify

A. A.

SRI: A scalable rule induction algorithm. Proc. Instn Mech. Engrs. Part C: J. Mechanical Engineering Science, 2006, 220(C4), 537–552.

Clark

Niblett

The CN2 induction algorithm. Mach. Learn., 1989, 3 (4), 261–284.

Farnkranz

Efficient pruning methods for relational learning. PhD Thesis, Technisch-Naturwissens chaftlichen Fakultat, Techniscen Universität Wien, 1994.

Frank

Pruning decision trees and lists. PhD Thesis, Department of Computer Science, University of Waikato, Hamilton, New Zealand, 2000.

Mingers

An empirical comparison of pruning methods for decision tree induction. Mach. Learn., 1989, 4, 227–243.

Breslow

Aha

D. W.

Simplifying decision trees: A survey. Knowl. Eng. Rev., 1996, 12, 1–40.

Esposito

Malerba

Semeraro

A comparative analysis of methods for pruning decision trees. IEEE Trans. Patt. Anal. Mach. Int., 1997, 19 (5), 476–491.

10.

Quinlan

J. R.

Simplifying decision trees. Int. J. Man Mach. Stud., 1987, 27, 221–234.

11.

Pagallo

Haussler

Boolean feature discovery in empirical learning. Mach. Learn., 1990, 3, 71–99.

12.

Brunk

C. A.

Pazzani

M. J.

An investigation of noise-tolerant relational concept learning algorithms. In Proceedings of the 8th International Workshop on Machine learning, Evanston, Illinois, 1991, 389–393.

13.

Cohen

W. W.

Efficient pruning methods for separate-and-conquer rule learning systems. In Proceedings of the 13th International Conference on Artificial intelligence, Chambery, France, 1993, 988–994 (Morgan Kaufmann, San Fransisco, CA).

14.

Frank

Witten

I. H.

Reduced-error pruning with significant tests. Working paper, Department of Computer Science, University of Waikato, Hamilton, New Zealand, 1999.

15.

Elomaa

Kaariainen

An analysis of reduced error pruning. J. Artif. Int. Res., 2001, 15, 163–187.

16.

Farnkranz

Widmer

Incremental reduced error pruning. In Proceedings of the 11th International Conference on Machine learning, New Brunswick, NJ, 1994, pp. 70–77 (Morgan Kaufmann, San Fransisco, CA).

17.

Farnkranz

Top-down pruning in relational learning. In Proceedings of the 11th European Conference on Artificial intelligence, Amsterdam, The Netherlands, 1994, 453–457.

18.

Rissanen

Stochastic complexity and modelling. Ann. Stat., 1986, 14 (3), 1080–1100.

19.

Barron

Rissanen

The minimum description length principle in coding and modelling. IEEE Trans. Inform. Theory, 1998, 44 (6), 2743–2760.

20.

Grunwald

Model selection based on minimum description length. J. Math. Psychol., 2000, 44, 133–170.

21.

Tirri

MDL and classification in machine learning. In Proceedings of the Neural Information Processing Systems Workshop on Minimum description length (NIPS-2001), Whistler, British Columbia, Canada, 2001, (MIT Press, Menlo Park, CA).

22.

Georgeff

M. P.

Wallace

C. S.

A general selection criterion for inductive inference. In Proceedings of the 6th European Conference on Artificial intelligence (ECAI-84), Pisa, Italy, 1984, 473–482.

23.

Quinlan

J. R.

Rivest

R. L.

Inferring decision trees using minimum description length principle. Inform. Comp., 1989, 80, 227–248.

24.

Quinlan

J. R.

C4.5: Programs for machine learning, 1993 (Morgan Kaufmann, San Mateo, CA).

25.

Quinlan

J. R.

The minimum description length principle and categorical theories. In Proceedings of the 11th International Conference on Machine learning, New Brunswick, NJ, 1994, 233–241 (Morgan Kaufmann, San Fransisco, CA).

26.

Quinlan

J. R.

MDL and categorical theories (continued). In Proceedings of the 12th International Conference on Machine learning, Lake Tahoe City, CA, 1995, 464–470 (Morgan Kaufmann, San Fransisco, CA).

27.

Mehta

Rissanen

Agrawal

MDL-based decision tree pruning. In Proceedings of the 1st International Conference on Knowledge discovery in databases and data mining (KDD-95), Montreal, Canada, 1995, 216–221 (AAAI Press, Menlo Park, CA).

28.

Pfahringer

Practical uses of the minimum description length principle in inductive learning. PhD Thesis, Institut Für Med. Kybernetik u. AI, Techniscen Universitat Wien, 1995.

29.

Pfahringer

Compression-based pruning of decision lists. In Proceedings of the 9th European Conference on Machine learning, Prague, Czech Republic, 1997, 199–212 (Springer-Verlag, Berlin).

30.

Robnik-Šikonja

Kononenko

Pruning regression trees with MDLP. In Proceedings of the 13th European Conference on Artificial intelligence (ECAI-98), Brighton, UK, 1998, 455–459 (John Wiley and Sons, Chichester).

31.

Pham

D. T.

Afify

A. A.

A new MDL-based pruning technique for rule induction algorithms. Technical report, Manufacturing Engineering Centre, Cardiff University, 2004.

32.

Rule Quest. Data Mining Tools C5.0. Pty Ltd, 30 Athena Avenue, St Ives NSW 2075, Australia, available from http://www.rulequest.com/see5-info.html.

33.

Blake

C. L.

Merz

C. J.

UCI Repository of Machine Learning Databases. University of California, Department of Information and Computer Science, Irvine, CA, 1998, available from http://www.ics.uci.edu/εmlearn/MLRepository.html.

34.

Devijver

P. A.

Kittler

Pattern recognition: A statistical approach, 1982 (Prentice Hall, Englewood Cliffs, London).

35.

Efron

Tibshirani

An introduction to the bootstrap, 1993 (Chapman & Hall, USA).

Three New MDL-Based Pruning Techniques for Robust Rule Induction

Abstract

Abstract

Keywords

Get full access to this article

References