General and Efficient Multisplitting of Numerical Attributes (1999)
Cached
Download Links
- [www.cs.helsinki.fi]
- [www.cs.helsinki.fi]
- [www.cs.helsinki.fi]
- [www.cs.tut.fi]
- DBLP
Other Repositories/Bibliography
| Citations: | 31 - 7 self |
BibTeX
@MISC{Elomaa99generaland,
author = {Tapio Elomaa and Juho Rousu and C. Holte},
title = {General and Efficient Multisplitting of Numerical Attributes},
year = {1999}
}
Years of Citing Articles
OpenURL
Abstract
. Often in supervised learning numerical attributes require special treatment and do not fit the learning scheme as well as one could hope. Nevertheless, they are common in practical tasks and, therefore, need to be taken into account. We characterize the well-behavedness of an evaluation function, a property that guarantees the optimal multi-partition of an arbitrary numerical domain to be defined on boundary points. Well-behavedness reduces the number of candidate cut points that need to be examined in multisplitting numerical attributes. Many commonly used attribute evaluation functions possess this property; we demonstrate that the cumulative functions Information Gain and Training Set Error as well as the non-cumulative functions Gain Ratio and Normalized Distance Measure are all well-behaved. We also devise a method of finding optimal multisplits efficiently by examining the minimum number of boundary point combinations that is required to produce partitions which are optimal wit...







