The thesaurus generation tool TRex was extended and enhanced
term and phrase generation
construction of term-context-matrix
computation of term-similarities
analysis of term-context-matrix
- TRex can be easily adapted to domain-specific document collections
- specification of document schemata, stopwords and non-text
- adjustment of term and phrase generation parameters
- TRex offers a variety of techniques for computing term-similarities
- different term contexts (document, window)
- various weighting schemes
- singular value decomposition of term-context matrix
- numerous similarity scores
- TRex can exploit lists of important terms (extracted from an ontology)
- for focussing term and phrase generation
- for weighting of term similarities