The AnnCor Project

The AnnCor project creates a repository with richly annotated text data for the Dutch langauge, together with software to automatically and manually enrich the data with morpho-syntactic, syntactic and discourse properties, as well as advanced software to search in the data and to analyze search results on orthographic, morpho-syntactic, syntactic and discourse properties. The existing treebnak search application GrETEL, developed by KU Leuven, has been taken as a basis but is being extende with new features. The repository will be made fuly compatible with requirements of the CLARIN infrastructure.

Four types of annotations are being created:

morpho-syntactic and syntactic annotations
discourse annotations
annotation of learner corpora for errors and their corrections
annotation of narrative corpora