lemma

A string selected from an inflectional paradigm to represent the inflectional paradigm as a whole. A inflectional paradigm is a set of words that differ only in grammatical properties. For example, in English buy is the lemma for the inflectional paradigm { buy, buys, buying, bought}

SHEBANQ Data







SHEBANQ: System for HEBrew Text: ANnotations for Queries and Markup

Summary

The WIVU Hebrew Text Database contains the Hebrew text of the Old Testament enriched with many linguistic features at the morpheme level up to the discourse level.

VK Data

The enriched publication of the important Dutch historiographical work Het Koninkrijk der Nederlanden in de Tweede Wereldoorlog (The Kingdom of the Netherlands in WWII) by Dr. Loe de Jong.

Cornetto Data

Cornetto is a lexical resource for the Dutch language which combines two resources with different semantic organisations: the Dutch Wordnet with its synset organisation and the Dutch Reference Lexicon which includes definitions, usage constraints, selectional restrictions, syntactic behaviours, illustrative contexts, etc. The Cornetto database contains over 92K lemmas and almost 120K word meanings.

GrNe Data

Online dictionary (ancient) Greek - Dutch for the letter Pi. Search functions include searches for Greek lemmata; search of Greek declined or conjugated word-forms that lead to the correct lemma (‘lemmatizer’); searches for Dutch words leading to different Greek lemmata; etymological searches. The dictionary is linked to Logeion, the international website of Greek dictionaries at the University of Chicago. The developers estimate that a complete version of the dictionary will be finished by the end of 2015 and that it will be published by the end of 2016.

FESLI Data

FESLI: Functional elements in Specific Language Impairment

Summary

A corpus with data from monolingual and bilingual children (Dutch - Turkish) with and without Specific Language Impairment (SLI).

Background

The FESLI-data come from two NWO-sponsored projects: BiSLI and Variflex. The numbers of children included in the resources are:

DUELME Data

DUELME is an electronic lexicon that contains more than 5,000 Dutch multiword expressions (MWEs). The DUELME lexicon is suitable for theoretical research on multiword expressions as well as for use in NLP systems. Multiword expressions with similar syntactic patterns are grouped in equivalence classes. Semantic restrictions on variable arguments are encoded.