mblem - (dutch) lemmatizer

NAME

mblem -t test-file

mblem will tokenize, tag, and lemmatize word tokens in Dutch text files.

-c <configfile>

set the configuration using ’file’ The default is to use the Frog config file.

-d <level>

set debug level.

--notagger

Don’t use a tagger to disambiguate. All variants are shown. The default is to use the Dutch CGN tagger.

--notokenizer

Don’t use a tokenizer. Assume al input to be tokenized. The default is to use the Ucto Tokenizer for Dutch.

-h

give some help

-t <file>

process ’file’

likely

Antal van den Bosch [email protected]