Home

A free semantically annotated corpus that anyone can edit!

The current (development) version of the GMB is accessible via the GMB Explorer, and comprises thousands of texts in raw and tokenised format, tags for part of speech, named entities and lexical categories, and discourse representation structures compatible with first-order logic.



You're welcome to contribute to the GMB by providing corrections or opinions to existing linguistic annotations in a wiki-like environment (we kindly ask you to register though). Stable releases are available from the downloads page. Future stable releases will be made available periodically.