Skip to content

[Docker] Initialization fails for nld-vnn and dum #99

@tcbrouwer

Description

@tcbrouwer

I run Frog from https://hub.docker.com/r/proycon/frog:latest . Currently running

docker run proycon/frog --language=dum

results in

frog-:config read from: /usr/local/share/frog/dum/frog.cfg
frog-:Missing [[mbma]] section in config file.
frog-:Disabled the Morhological analyzer.
frog-:Missing [[IOB]] section in config file.
frog-:Disabled the IOB Chunker.
frog-:Missing [[NER]] section in config file.
frog-:Disabled the NER.
frog-:Missing [[mwu]] section in config file.
frog-:Disabled the Multi Word Unit.
frog-:Also disabled the parser.
frog-mblem-:Initiating lemmatizer...
ucto: textcat configured from: /usr/local/share/ucto/textcat.cfg
frog-tok-:Language List =[dum]
ucto: No useful settingsfile(s) could be found (initiating from language list: [dum])
frog-tagger-tagger-:reading subsets from /usr/local/share/frog/dum//crmsub.cgn
frog-tagger-tagger-:reading constraints from /usr/local/share/frog/dum//crmconstraints.cgn
frog-:Initialization failed for: [tokenizer] 
frog-:fatal error: Frog init failed

Similar for nld-vnn (after mitigation LanguageMachines/frogdata#7 )

With my previous build from 6 months ago, I did not encounter this initialization problem.

My current mitigation is to use

docker run proycon/frog --language=dum,nld

instead. With this the initialization succeeds, and if I understand correctly from

WARNING: you used the --language=dum,nld option with more then one language 
	 specified. These values will be handled to the tokenizer, but Frog
	 will only handle the first language: dum for further processing!

still only dum is used for Frog, which is the core of our use case.

As a sidenote, in general the dockerized deployment of Frog is very helpful to me.

Metadata

Metadata

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions