Logo Utrecht University

AnnCor

The AnnCor Project

The AnnCor project creates a repository with richly annotated text  data for the Dutch langauge, together with software to automatically and manually enrich  the data with morpho-syntactic, syntactic and discourse properties, as well as advanced software to search in the data and to analyze search results on orthographic, morpho-syntactic, syntactic and discourse properties. The existing treebnak search application GrETEL, developed by KU Leuven, has been taken as a basis but is being extende with new features. The repository will be made fuly compatible with requirements of the  CLARIN infrastructure.

Four types of annotations are being created:

  • morpho-syntactic and syntactic annotations
  • discourse annotations
  • annotation of learner corpora for errors and their corrections
  • annotation of narrative corpora