为了在windows上跑CRF,我需要安装sklearn-crfsuite,最开始想到的是使用pycharm进行环境配置,装上了sklearn-crfsuite。跑起来,额,出错了:
正在训练评估CRF模型...
Traceback (most recent call last):
File "C:/Users/cc/Documents/xxx/yyy/window_version/main.py", line 73, in <module>
main()
File "C:/Users/cc/Documents/xxx/yyy/window_version/main.py", line 31, in main
(test_word_lists, test_tag_lists)
File "C:\Users\cc\Documents\xxx\yyy\window_version\evaluate.py", line 43, in crf_train_eval
crf_model.train(train_word_lists, train_tag_lists)
File "C:\Users\cc\Documents\xxx\yyy\window_version\models\crf.py", line 23, in train
self.model.fit(features, tag_lists)
File "C:\Users\cc\AppData\Local\Programs\Python\Python36\lib\site-packages\sklearn_crfsuite\estimator.py", line 331, in fit
trainer.train(self.modelfile.name, holdout=-1 if X_dev is None else 1)
File "pycrfsuite\_pycrfsuite.pyx", line 359, in pycrfsuite._pycrfsuite.BaseTrainer.train
File "stringsource", line 15, in string.from_py.__

在Windows环境下使用sklearn-crfsuite训练CRF模型遇到Unicode编码错误,通过更换计算机环境及使用pip安装包成功解决,揭示了_pycrfsuite.pyd与_pycrfsuite.pyx的区别。
最低0.47元/天 解锁文章
1060





