Orthographic Transcription




The MIMORE tool enables researchers to investigate morphosyntactic variation in the Dutch dialects by searching three related databases with a common on-line search engine. The three databases involved are also available as XML: DynaSAND (the dynamic syntactic atlas of the Dutch dialects), DiDDD (Diversity in Dutch DP Design) and GTRP (Goeman, Taeldeman, van Reenen Project).


A corpus of 250 interviews from the Living Oral History Workbench enriched with commentary in the Oral History Annotation tool, developed by the Centre for Language and Speech Technology (CLST) at the Radboud University Nijmegen. All 250 interviews are searchable through a fragment finder and can be annotated. These annotations can be shared with other researchers, making the interviews available and easier accessible for a much wider range of researchers in the humanities in general and in linguistics in particular. The Annotation Tool is only available for scientific research and only after approval by the Veterans Institute.


LAISEANG: Language Archive of Insular South East Asia and West New Guinea


The LAISEANG corpus contains an unrivaled collection of multimedia materials and written documents from over 50 languages in Insular South East Asia and West New Guinea.


VALID - vulnerability in language acquisition: language impairments in Dutch


An open access multimedia archive of language pathology data collected in the Netherlands, primarily on Dutch, audio files and transcripts. Currently, this corpus contains 5 different data sets. In the VALID data archive old, current and future data can be brought together.


IPNV - Interview Project Dutch Veterans


The IPNV data set contains the public part of a collection of interviews collected by the Dutch Veteran Institute. They contain stories covering almost all conflicts and military missions where the Netherlands were involved. The public part of this collection of about 500 interviews was made available via the internet the non-public part. For CLARIN the data was curated by the DCS (http://dev.clarin.nl/node/1963) in May 2013.