nlup contains some base libraries I use in natural language processing projects. Some highlights:
confusion.py: classifier evaluation objectsdecorators.py: clever decorators for various purposesjsonable.py: a mix-in which allows the state of most objects to be serialized to (and deserialized from) compressed JSONperceptron.py: perceptron-like classifiers (binary and multiclass), including some forms of structured predictionreader.py: classes and readers for tagged and dependency-parsed datatimer.py: awith-block that logs wall clock time elapsed
All have been tested on CPython 3.4.1 and PyPy 3.2.5 (PyPy version 2.3.1). They will not work on Python 2 without modification.
Some projects using nlup:
- Detector Morse: simple sentence boundary detection
- Perceptronix Point Never: simple part of speech tagging
- Where's Yr Head At: simple transition-based dependency parsing