Results 1 -
4 of
4
Formal structure of Sanskrit text: Requirements analysis for a mechanical Sanskrit processor
"... Abstract. We discuss the mathematical structure of various levels of representation of Sanskrit text in order to guide the design of computer aids aiming at useful processing of the digitalised Sanskrit corpus. Two main levels are identified, respectively called the linear and functional level. The ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
Abstract. We discuss the mathematical structure of various levels of representation of Sanskrit text in order to guide the design of computer aids aiming at useful processing of the digitalised Sanskrit corpus. Two main levels are identified, respectively called the linear and functional level. The design space of these two levels is sketched, and the computational implications of the main design choices are discussed. Current solutions to the problems of mechanical segmentation, tagging, and parsing of Sanskrit text are briefly surveyed in this light. An analysis of the requirements of relevant linguistic resources is provided, in view of justifying standards allowing inter-operability of computer tools. This paper does not attempt to provide definitive solutions to the representation of Sanskrit at the various levels. It should rather be considered as a survey of various choices, allowing an open discussion of such issues in a formally precise general framework. 1
Automates, machines, moteurs réactifs
, 2008
"... 1.1 Automate sur un monoïde d’actions Soit S = 〈S, ·, 1 〉 un monoïde de support un ensemble S d’éléments appelés actions, muni d’une opération associative notée · appelée produit et d’un élément 1 neutre à gauche et à droite pour ce produit. On appelle S-automate un tuple 〈Q, I, T, δ 〉 où: – Q est u ..."
Abstract
- Add to MetaCart
1.1 Automate sur un monoïde d’actions Soit S = 〈S, ·, 1 〉 un monoïde de support un ensemble S d’éléments appelés actions, muni d’une opération associative notée · appelée produit et d’un élément 1 neutre à gauche et à droite pour ce produit. On appelle S-automate un tuple 〈Q, I, T, δ 〉 où: – Q est un ensemble fini d’états – I ∈ Q est l’ensemble des états initiaux – T ∈ Q est l’ensemble des états terminaux – δ ∈ Q → ℘(S × Q) est la relation de transition, qui associe à tout état un ensemble fini de paires (a, q) formées d’une action a et d’un état q. On appelle support de l’automate A = 〈Q, I, T, δ 〉 l’ensemble fini d’actions ΦA = {a ∈ S | ∃q, q ′ ∈ Q (a, q ′ ) ∈ δ(q)}. On appelle inverse de l’automate A = 〈Q, I, T, δ 〉 l’automate de même support 〈Q, T, I, δ ′ 〉 tel que δ ′(q′) = {(a, q) | (a, q ′) ∈ δ(q)}, noté Ã. On a bien sûr ˜Ã = A. 1.2 Comportement d’un automate On appelle parcours de l’automate A = 〈Q, I, T, δ 〉 une séquence p = q0 a1 a2 an → q1 →...q2 → qn (n ≥ 0) avec ∀i ≥ 0 qi ∈ Q ∧ (ai+1, qi+1) ∈ δ(qi). On définit l’action associée au parcours p comme act(p) = 1 si n = 0, et act(p) = a1 ·... · an sinon. Le parcours est dit acceptant si q0 ∈ I et qn ∈ T, et on note pa(A) pour l’ensemble des parcours acceptants de A. On appelle comportement de l’automate A l’ensemble |A | = {act(p) | p ∈ pa(A)}. On dit que qn est A-accessible à partir de q0, et que q est A-accessible s’il est A-accessible à partir d’un état initial de A. On dit que q est A-co-accessible s’il est Ā-accessible, et A-utile s’il est A-accessible et A-co-accessible. On dit que A est émondé ssi tous ses états sont utiles. Tout automate peut être réduit en un automate émondé de même comportement. II 1.3 Exemples Le monoïde des actions peut être le monoïde libre Σ ∗ engendré par un alphabet fini Σ. Un Σ ∗-automate est alors la généralisation d’un automate fini non déterministe, où on permet d’étiqueter une transition par un mot arbitraire, et non seulement une lettre. Un cas particulier est celui des automates avec transition spontanée (ɛ-move). On obtient les transducteurs d’un alphabet Σ dans un alphabet Σ ′ en considérant le monoïde (non-libre) produit des monoïdes libres Σ ∗ et Σ ′ ∗. 2
Computing with Relational Machines
, 2008
"... Abstract. We give a quick presentation of the X-machines of Eilenberg, a generalisation of finite state automata suitable for general nondeterministic computation. Such machines complement an automaton, seen as its control component, with a computation component over a data domain specified as an ac ..."
Abstract
- Add to MetaCart
Abstract. We give a quick presentation of the X-machines of Eilenberg, a generalisation of finite state automata suitable for general nondeterministic computation. Such machines complement an automaton, seen as its control component, with a computation component over a data domain specified as an action algebra. Actions are interpreted as binary relations over the data domain, structured by regular expression operations. We show various strategies for the sequential simulation of our relational machines, using variants of the reaction engine. In a particular case of finite machines, we show that bottom-up search yields an efficient complete simulator. Relational machines may be composed in a modular fashion, since atomic actions of one machine may be mapped to the characteristic relation of other relational machines acting as its parameters. The control components of machines is compiled from regular expressions. Several such translations have been proposed in the literature, that we briefly survey. Our view of machines is completely applicative. They may be defined constructively in type theory, where the correctness of their simulation may be formally checked. From formal proofs in the Coq proof assistant, efficient functional programs in the Objective Caml programming language may be mechanically extracted. Most of this material is extracted from the (forthcoming) Ph.D. thesis of Benoît Razet.
Sanskrit Segmentation
"... preprocessing We discuss in this paper the topic of Sanskrit segmentation, that is how to solve by computer software the problem of identifying in a Sanskrit sentence the division of a continuous enunciation into a sequence of discrete word forms. This ..."
Abstract
- Add to MetaCart
preprocessing We discuss in this paper the topic of Sanskrit segmentation, that is how to solve by computer software the problem of identifying in a Sanskrit sentence the division of a continuous enunciation into a sequence of discrete word forms. This

