Textual data

WFT-GTB Data

The Dictionary of the Frisian Language (Wurdboek fan de Fryske Taal) is online available via the GTB dictionary web application. The GTB also holds other major Dutch historical dictionaries, such as the Dictionary of Old Dutch (ONW), the Dictionary of early Middle Dutch (VMNW), the Dictionary of Middle Dutch (MNW), and the Dictionary of the Dutch language (WNT). The digital surrounding enables extensive forms of free and structured search queries, including comparative studies with Dutch materials.

DiscAn

DiscAN: Towards a Discourse Annotation system for Dutch language corpora

Summary

LAISEANG

LAISEANG: Language Archive of Insular South East Asia and West New Guinea

Summary

The LAISEANG corpus contains an unrivaled collection of multimedia materials and written documents from over 50 languages in Insular South East Asia and West New Guinea.

EMIT-X

EMIT-X: Early-Modern Image and Text eXchange

Summary

VU-DNC

VU-DNC: VU Diachronic Newspaper Corpus

Summary

VU-DNC is a unique diachronic corpus of Dutch newspaper articles from five major Dutch newspapers from 1950/1951 and 2002 (2 MW). The VU-DNC has been annotated for quotations, which enables the researcher to differentiate between the words directly under responsibility of the journalist.

NEHOL

NEHOL: Negerhollands Database

Summary

C-DSD

C-DSD: Curating the Dutch Song Database

Summary

D-LUCEA

D-LUCEA: Database of the Longitudinal Utrecht Collection of English Accents

Summary

VALID

VALID - vulnerability in language acquisition: language impairments in Dutch

Summary

An open access multimedia archive of language pathology data collected in the Netherlands, primarily on Dutch, audio files and transcripts. Currently, this corpus contains 5 different data sets. In the VALID data archive old, current and future data can be brought together.

DBD/TCULT

DBD - Dutch Bilingualism Database, TCULT - Talen en Culturen in Utrechtse Lombok en Transvaal

Summary

The CLARIN NL supported data sets are part of an already existing collection: Dutch Bilingualism Database housed at the MPI for Psycholinguistics that are both also CLARIN compatible. The addtional DBD / TCULT data were curated by the CLARIN DCS (http://dev.clarin.nl/node/1963) and delivered in February 2014.

DiscAN: Towards a Discourse Annotation system for Dutch language corpora

LAISEANG: Language Archive of Insular South East Asia and West New Guinea

EMIT-X: Early-Modern Image and Text eXchange

VU-DNC: VU Diachronic Newspaper Corpus

NEHOL: Negerhollands Database

C-DSD: Curating the Dutch Song Database

D-LUCEA: Database of the Longitudinal Utrecht Collection of English Accents

VALID - vulnerability in language acquisition: language impairments in Dutch

DBD - Dutch Bilingualism Database, TCULT - Talen en Culturen in Utrechtse Lombok en Transvaal

Pages