@MISC{Estivill-Castro_externalsorting, author = {Vladimir Estivill-Castro and Derick Wood}, title = {External Sorting and Nearly Sortedness}, year = {} }
Bookmark
OpenURL
Abstract
The availability of large main memories and the new technologies for disk drives have modified the models for external sorting and have renewed interest in their study. Little is known about the performance of traditional and more recent sorting methods on nearly sorted files although such files are common in practice. ffl We confirm mathematically that the lengths of the runs created by replacement selection during the first phase of external sorting increases as the order in the input file increases. Previous work has concentrated on the expected length of initial runs when all input files are equally likely to occur. It has long been accepted that when an input file has little disorder, the lengths of the generated runs will be long. We establish such results for two measures of disorder, namely, the number of ascending runs and the maximal distance between inversions. ffl We demonstrate that, during the merging phase, the floating-buffers technique not only reduces the sorting ti...