Results 1  10
of
14
The Net for the Graphs – Towards Webgenre Representation for Corpus Linguistic Studies
 WaCky! Working Papers on the Web as Corpus
, 2006
"... corpus linguistic research (Baroni and Bernardini 2004; Keller and ..."
Abstract

Cited by 7 (3 self)
 Add to MetaCart
(Show Context)
corpus linguistic research (Baroni and Bernardini 2004; Keller and
ENTROPY BOUNDS FOR HIERARCHICAL MOLECULAR NETWORKS
, 2008
"... In this paper we derive entropy bounds for hierarchical networks. More precisely, starting from a recently introduced measure to determine the topological entropy of nonhierarchical networks, we provide bounds for estimating the entropy of hierarchical graphs. Apart from bounds to estimate the en ..."
Abstract

Cited by 4 (3 self)
 Add to MetaCart
(Show Context)
In this paper we derive entropy bounds for hierarchical networks. More precisely, starting from a recently introduced measure to determine the topological entropy of nonhierarchical networks, we provide bounds for estimating the entropy of hierarchical graphs. Apart from bounds to estimate the entropy of a single hierarchical graph, we see that the derived bounds can also be used for characterizing graph classes. Our contribution is an important extension to previous results about the entropy of nonhierarchical networks because for practical applications hierarchical networks are playing an important role in chemistry and biology. In addition to the derivation of the entropy bounds, we provide a numerical analysis for two special graph classes, rooted trees and generalized trees, and demonstrate hereby not only the computational feasibility of our method but also learn about its characteristics and interpretability with respect to data analysis.
Application of a similarity measure for graphs to webbased document structures
 International Conference on Data Analysis ICA 2005, in conjuction with the 7th World Enformatika Conference, Budapest/Hungary
"... Abstract — Due to the tremendous amount of information provided by the World Wide Web (WWW) developing methods for mining the structure of webbased documents is of considerable interest. In this paper we present a similarity measure for graphs representing webbased hypertext structures. Our simila ..."
Abstract

Cited by 3 (2 self)
 Add to MetaCart
(Show Context)
Abstract — Due to the tremendous amount of information provided by the World Wide Web (WWW) developing methods for mining the structure of webbased documents is of considerable interest. In this paper we present a similarity measure for graphs representing webbased hypertext structures. Our similarity measure is mainly based on a novel representation of a graph as linear integer strings, whose components represent structural properties of the graph. The similarity of two graphs is then defined as the optimal alignment of the underlying property strings. In this paper we apply the well known technique of sequence alignments for solving a novel and challenging problem: Measuring the structural similarity of generalized trees. In other words: We first transform our graphs considered as high dimensional objects in linear structures. Then we derive similarity values from the alignments of the property strings in order to measure the structural similarity of generalized trees. Hence, we transform a graph similarity problem to a string similarity problem for developing a efficient graph similarity measure. We demonstrate that our similarity measure captures important structural information by applying it to two different test sets consisting of graphs representing webbased document structures.
Measuring the Structural Similarity of Webbased Documents: A novel Approach
"... Abstract — Most known methods for measuring the structural similarity of document structures are based on, e.g., tag measures, path metrics and tree measures in terms of their DOMTrees. Other methods measures the similarity in the framework of the well known vector space model. In contrast to these ..."
Abstract

Cited by 2 (1 self)
 Add to MetaCart
(Show Context)
Abstract — Most known methods for measuring the structural similarity of document structures are based on, e.g., tag measures, path metrics and tree measures in terms of their DOMTrees. Other methods measures the similarity in the framework of the well known vector space model. In contrast to these we present a new approach to measuring the structural similarity of webbased documents represented by so called generalized trees which are more general than DOMTrees which represent only directed rooted trees. We will design a new similarity measure for graphs representing webbased hypertext structures. Our similarity measure is mainly based on a novel representation of a graph as strings of linear integers, whose components represent structural properties of the graph. The similarity of two graphs is then defined as the optimal alignment of the underlying property strings. In this paper we apply the well known technique of sequence alignments to solve a novel and challenging problem: Measuring the structural similarity of generalized trees. More precisely, we first transform our graphs considered as high dimensional objects in linear structures. Then we derive similarity values from the alignments of the property strings in order to measure the structural similarity of generalized trees. Hence, we transform a graph similarity problem to a string similarity problem. We demonstrate that our similarity measure captures important structural information by applying it to two different test sets consisting of graphs representing webbased documents.
A systems biology approach for the classification of dna microarray data
 in Proceedings of ICANN 2005, Poland/Torun
, 2006
"... ..."
(Show Context)
www.gldv.org Text MiningImpressum LDVForum
"... h�p://www.gldv.org/cms/vorstand.php, h�p://www.gldv.org/cms/topics.php � He�e im Jahr, halbjährlich zum ��. Mai und ��. Oktober. Preprints und redaktionelle Planungen sind über die Website der GLDV einsehbar (h�p://www.gldv.org). Unaufgefordert eingesandte Fachbeiträge werden vor Veröffentlichung vo ..."
Abstract
 Add to MetaCart
h�p://www.gldv.org/cms/vorstand.php, h�p://www.gldv.org/cms/topics.php � He�e im Jahr, halbjährlich zum ��. Mai und ��. Oktober. Preprints und redaktionelle Planungen sind über die Website der GLDV einsehbar (h�p://www.gldv.org). Unaufgefordert eingesandte Fachbeiträge werden vor Veröffentlichung von mindestens zwei ReferentInnen begutachtet. Manuskripte sollten deshalb möglichst frühzeitig eingereicht werden und bei Annahme zur Veröffentlichung in jedem Fall elektronisch und zusätzlich auf Papier übermi�elt werden. Die namentlich gezeichneten Beiträge geben ausschließlich die Meinung der AutorInnen wieder. Einreichungen sind an die Herausgeber zu übermi�eln. Für Mitglieder der GLDV ist der Bezugspreis des LDVForums im Jahresbeitrag mit eingeschlossen. Jahresabonnements können zum Preis von ��, € (inkl. Versand), Einzelexemplare zum Preis von ��, € (zzgl. Versandkosten) bei der Redaktion bestellt werden. Christoph Pfeiffer, Regensburg, mit LaTeX (pdfeTeX / MiKTeX)
HYPERTEXT TYPES AND MARKUP LANGUAGES The Relationship Between HTML and Web Genres
"... It is vital to take a closer look at the role of the Hypertext Markup Language (HTML, Raggett et al., 1999) with regard to text technological applications that aim at processing web documents (for example, automatic summarisation, information extraction, or text classifi ..."
Abstract
 Add to MetaCart
It is vital to take a closer look at the role of the Hypertext Markup Language (HTML, Raggett et al., 1999) with regard to text technological applications that aim at processing web documents (for example, automatic summarisation, information extraction, or text classifi