Part-of-speech

Nederlab





Nederlab, online laboratory for humanities research on Dutch text collections

Summary

A user-friendly and tool-enriched open access web interface that aims at containing all digitized texts relevant for the Dutch national heritage and the history of Dutch language and culture (c. 800 - present).

MIMORE Data

MIMORE

Summary

The MIMORE tool enables researchers to investigate morphosyntactic variation in the Dutch dialects by searching three related databases with a common on-line search engine. The three databases involved are also available as XML: DynaSAND (the dynamic syntactic atlas of the Dutch dialects), DiDDD (Diversity in Dutch DP Design) and GTRP (Goeman, Taeldeman, van Reenen Project).

SHEBANQ Data







SHEBANQ: System for HEBrew Text: ANnotations for Queries and Markup

Summary

The WIVU Hebrew Text Database contains the Hebrew text of the Old Testament enriched with many linguistic features at the morpheme level up to the discourse level.

VK Data

The enriched publication of the important Dutch historiographical work Het Koninkrijk der Nederlanden in de Tweede Wereldoorlog (The Kingdom of the Netherlands in WWII) by Dr. Loe de Jong.

Cornetto Data

Cornetto is a lexical resource for the Dutch language which combines two resources with different semantic organisations: the Dutch Wordnet with its synset organisation and the Dutch Reference Lexicon which includes definitions, usage constraints, selectional restrictions, syntactic behaviours, illustrative contexts, etc. The Cornetto database contains over 92K lemmas and almost 120K word meanings.

GrNe Data

Online dictionary (ancient) Greek - Dutch for the letter Pi. Search functions include searches for Greek lemmata; search of Greek declined or conjugated word-forms that lead to the correct lemma (‘lemmatizer’); searches for Dutch words leading to different Greek lemmata; etymological searches. The dictionary is linked to Logeion, the international website of Greek dictionaries at the University of Chicago. The developers estimate that a complete version of the dictionary will be finished by the end of 2015 and that it will be published by the end of 2016.

FESLI Data

FESLI: Functional elements in Specific Language Impairment

Summary

A corpus with data from monolingual and bilingual children (Dutch - Turkish) with and without Specific Language Impairment (SLI).

Background

The FESLI-data come from two NWO-sponsored projects: BiSLI and Variflex. The numbers of children included in the resources are:

DUELME Data

DUELME is an electronic lexicon that contains more than 5,000 Dutch multiword expressions (MWEs). The DUELME lexicon is suitable for theoretical research on multiword expressions as well as for use in NLP systems. Multiword expressions with similar syntactic patterns are grouped in equivalence classes. Semantic restrictions on variable arguments are encoded.

VU-DNC






VU-DNC: VU Diachronic Newspaper Corpus

Summary

VU-DNC is a unique diachronic corpus of Dutch newspaper articles from five major Dutch newspapers from 1950/1951 and 2002 (2 MW). The VU-DNC has been annotated for quotations, which enables the researcher to differentiate between the words directly under responsibility of the journalist.

Pages