我正试图用nltk-trainer(https://github.com/japerk/nltk-trainer)来解决问题 . 我设法用命令(直接在Anaconda控制台中)训练荷兰语标签和chunkers:

python train_tagger.py conll2002 --fileids ned.train --classifier IIS --filename ~/nltk_data/taggers/conll2002_ned_IIS.pickle
python train_chunker.py conll2002 --fileids ned.train --classifier NaiveBayes --filename ~/nltk_data/chunkers/conll2002_ned_NaiveBayes.pickle

然后我运行一个小脚本来测试tagger和chunker:

import nltk
from nltk.corpus import conll2002

# Loading training pickles
tokenizer = nltk.data.load('tokenizers/punkt/dutch.pickle')
tagger = nltk.data.load('taggers/conll2002_ned_IIS.pickle')
chunker = nltk.data.load('chunkers/conll2002_ned_NaiveBayes.pickle')

# Testing
test_sents = conll2002.tagged_sents(fileids="ned.testb")[0:1000]
print "tagger accuracy on test-set: " + str(tagger.evaluate(test_sents))

test_sents = conll2002.chunked_sents(fileids="ned.testb")[0:1000]
print "tagger accuracy on test-set: " + str(chunker.evaluate(test_sents))

这可以从nltk-trainer-master文件夹中正常工作,但是当我将脚本移动到其他地方时,我收到导入错误:

ImportError: No module named nltk_trainer.chunking.chunkers

如何在nltk-trainer-master文件夹之外完成此工作,而不复制nltk_trainer文件夹?

(Python 2.7,nltk 3.2.1)