DFKI Research Report-04-01



Language: English

by Jan-Thies B�r, Peter Dannenmann , Ludger van Elst, Armin Hust, Andreas Lauer, Heiko Maus, Sven Schwarz

EPOS Evolving Personal to Organizational Knowledge Spaces

55 Pages


We present a simple and intuitive unsound corpus-driven approximation mehtod for turning unification-based grammars (UBGs), such as HPSG, CLE, or PATR-II into context-free grammars (CFGs). The mehtod is unsound in that it does not generate a CFG whose language is a true superset of the language accept by the original unification-based grammar. It is a corpus-driven method in that it relies on a corpus of parsed sentences and generates broader CFGs when given more input samples. Our open approach can be fine-tuned in different directions, allowing us to monotonically come close to the original parse trees by shifting more information onto the context-free symbols. The approach has been fully implemented in JAVA. This report updates and extends the paper presented at the International Colloquium on Grammatical Inference (ICGI 2004) and presents further measurements

This document is available as Postscript.

The next abstract is here, and the previous abstract is here.

DFKI-Bibliothek (bib@dfki.uni-kl.de)

Note: This page was written to look best with CSS stylesheet support Level 1 or higher. Since you can see this, your browser obviously doesn't support CSS, or you have turned it off. We highly recommend you use a browser that supports and uses CSS, and review this page once you do. However, don't fear, we've tried to write this page to still work and be readable without CSS.