## A Fast Algorithm for Finding the Nearest Neighbor of a Word in a Dictionary (1993)

Venue: | In Proc. 2nd Int. Conference on Document Analysis and Recognition ICDAR’93 |

Citations: | 6 - 1 self |

### BibTeX

@INPROCEEDINGS{Bunke93afast,

author = {Horst Bunke and Horst Bunke},

title = {A Fast Algorithm for Finding the Nearest Neighbor of a Word in a Dictionary},

booktitle = {In Proc. 2nd Int. Conference on Document Analysis and Recognition ICDAR’93},

year = {1993},

pages = {632--637}

}

### Abstract

In this paper a new algorithm for string edit distance computation is proposed. It is based on the classical approach [11]. However, while in [11] the two strings to be compared may be given online, our algorithm assumes that one of the two strings to be compared is a dictionary entry that is known a priori. This dictionary word is converted, in an off-line phase to be carried out beforehand, into a special type of deterministic finite state automaton. Now, given an input string corresponding to a word with possible OCR errors and the automaton derived from the dictionary word, the computation of the edit distance between the two strings corresponds to a traversal of the states of the automaton. This procedure needs time which is only linear in the length of the OCR word. It is independent of the length of the dictionary word. Given not only one but N different dictionary words, their corresponding automata can be combined into a single deterministic finite state automaton. Thus the co...

### Citations

