Documents are a plentiful source of information available in any application domain
EXPLOITING THESAURUS GENERATION FOR KNOWLEDGE ACQUISITION
Example from FAKT project:
similar terms to ‘backup’
tapemountdevice not readyrestoredata safety
Example from ‘Die WELT’ Articles on the German spelling reform
similar terms to ‘Rechtschreibung’
ReformKultusministerkonferenzDudenRegeln (112 Regeln)Orthographie
In an Organizational Memory it is important to handle large amounts of knowledge
First results confirm our expectations that thesaurus generation methods may be profitably exploited for knowledge acquisition
Even a rough analysis of word frequencies and correlations ...
... identifies core topics in a new domain
... offers guidance for subsequent knowledge acquisition
An analysis of term similarities points out interesting relationships and dependencies
More sophisticated analyses based on additional knowledge are neededto separate meaningful from spurious results