The AnnCor Project
The AnnCor project creates a repository with richly annotated text data for the Dutch langauge, together with software to automatically and manually enrich the data with morpho-syntactic, syntactic and discourse properties, as well as advanced software to search in the data and to analyze search results on orthographic, morpho-syntactic, syntactic and discourse properties. The existing treebnak search application GrETEL, developed by KU Leuven, has been taken as a basis but is being extende with new features. The repository will be made fuly compatible with requirements of the CLARIN infrastructure.
Four types of annotations are being created:
- morpho-syntactic and syntactic annotations
- discourse annotations
- annotation of learner corpora for errors and their corrections
- annotation of narrative corpora