(b) The TCW Tool for learning text classification has been integrated to automatically create meta information for text documents in the OM
The Text Classification Workbench TCW is trained with manually categorized example documents
The system learns characteristic complex text patterns
After training, new documents can be categorized automatically
This can be applied to all text documents added to the OM
TCW originated from the READ and Virtual Office document analysis projects
Automatic creation of more detailed formal descriptions will be investigated using information extraction techniques.