• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

A formal theory for optimal and information theoretic syntactic pattern recognition (1998)

by B J Oommen, R L Kashyap
Venue:Pattern Recognition
Add To MetaCart

Tools

Sorted by:
Results 1 - 3 of 3

Optimal and Information Theoretic Syntactic Pattern Recognition for Traditional Errors

by B. J. Oommen, R. L. Kashyap - In Advances in Structural and Syntactic Pattern Recognition , 1996
"... In this paper we present a foundational basis for optimal and information theoretic syntactic pattern recognition. We do this by developing a rigorous model, M * , for channels which permit arbitrarily distributed substitution, deletion and insertion syntactic errors. More explicitly, if A is any ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
In this paper we present a foundational basis for optimal and information theoretic syntactic pattern recognition. We do this by developing a rigorous model, M * , for channels which permit arbitrarily distributed substitution, deletion and insertion syntactic errors. More explicitly, if A is any finite alphabet and A * the set of words over A, we specify a stochastically consistent scheme by which a string U A * can be transformed into any Y A * by means of arbitrarily distributed substitution, deletion and insertion operations. The scheme is shown to be Functionally Complete and stochastically consistent. Apart from the synthesis aspects, we also deal with the analysis of such a model and derive a technique by which Pr[Y|U], the probability of receiving Y given that U was transmitted, can be computed in cubic time using dynamic programming. Experimental results which involve dictionaries with strings of lengths between 7 and 14 with an overall average noise of 39.75 % demons...

Generalized Syntactic and Semantic Models of Query Reformulation

by Maria Holmqvist, Massimiliano Ciaramita, Daniel Mahler, Stefan Riezler
"... We present a novel approach to query reformulation which combines syntactic and semantic information by means of generalized Levenshtein distance algorithms where the substitution operation costs are based on probabilistic term rewrite functions. We investigate unsupervised, compact and efficient mo ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
We present a novel approach to query reformulation which combines syntactic and semantic information by means of generalized Levenshtein distance algorithms where the substitution operation costs are based on probabilistic term rewrite functions. We investigate unsupervised, compact and efficient models, and provide empirical evidence of their effectiveness. We further explore a generative model of query reformulation and supervised combination methods providing improved performance at variable computational costs. Among other desirable properties, our similarity measures incorporate information-theoretic interpretations of taxonomic relations such as specification and generalization.

AI-Based Syntactic Pattern Recognition of Sequences

by Inventors Ghada Badr, B. John Oommen
"... This patent concerns the traditional problem encountered in the syntactic Pattern Recognition (PR) of strings or sequences. The primary investigator 1 involved in this work is a Full Professor at Carleton University in Ottawa, Canada, and is a Fellow of the IEEE. The primary problem solved by the in ..."
Abstract - Add to MetaCart
This patent concerns the traditional problem encountered in the syntactic Pattern Recognition (PR) of strings or sequences. The primary investigator 1 involved in this work is a Full Professor at Carleton University in Ottawa, Canada, and is a Fellow of the IEEE. The primary problem solved by the invention involves determining the string or sequence that is most similar to a sequence presented to the system. The search could be initiated by presenting, to the system, a noisy or inexact version of a string contained in memory-for example, at a web-site or in the library or database. The invention will yield the closest string/sequence by searching the dictionary of possible words using a newly invented AIbased strategy. The core of this invention is this search strategy, called the Clustered Beam Search. Experiments have been done to show the benefits of the CBS over the current state-ofthe-art, and the results demonstrate an unbelievably marked improvement (sometimes as high as 90%) for large libraries and databases. The solution provided by the invention would be applicable in numerous areas including: Inexact or proximity searching on the Internet, keyword-based search in libraries and databases, spelling correction, speech and character recognition (including optical character recognition), and the processing of biological sequences, for example, in human genome projects. These applications are briefly described below.
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University