Here is an overview of the natural language processing software currently used for the automatic annotation of the Groningen Meaning Bank.
elephant is the statistical tool for word and sentence
segmentation used in the GMB.
The C&C tools
provide the automatic annotation of the
GMB: POS tagging, NE tagging, and syntactic parsing (CCG).
We use morpha for morphological analysis.
- Further included in the suite is
which prodices semantic representations (DRS) on top of the CCG parse trees.
- An online demo of the C&C and Boxer pipeline is available here. The demo is based on a REST API.