Lexicology

Dictionary of the Limburgian Dialects, Part III (General Vocabulary)

Dictionary of the Limburgian Dialects

Summary

Dictionary of the Limburgian Dialects, Part III (General Vocabulary)

Background

The dictionary of the Limburgian dialects, part II (General Vocabulary), abbreviated as WLD, is a semasiologically oriented dictionary. It consists of 14 subcollections, each covering a certain topic. The topics covered are:

Dictionary of the Brabantic Dialects, Part III (General Vocabulary)

Dictionary of the Brabantic Dialects

Summary

Dictionary of the Brabantic Dialects, Part III (General Vocabulary)

Background

The dictionary of the Brabantic dialects, part II (General Vocabulary), abbreviated as WBD, is a semasiologically oriented dictionary. It consists of 14 subcollections, each covering a certain topic. The topics covered are:

AutoSearch




















AutoSearch

Summary

AutoSearch allows users to upload corpora annotated at the token level for (extended) part of speech, lemma and word form in FoLiA or

DUELME Data

DUELME is an electronic lexicon that contains more than 5,000 Dutch multiword expressions (MWEs). The DUELME lexicon is suitable for theoretical research on multiword expressions as well as for use in NLP systems. Multiword expressions with similar syntactic patterns are grouped in equivalence classes. Semantic restrictions on variable arguments are encoded.

WFT-GTB Data

The Dictionary of the Frisian Language (Wurdboek fan de Fryske Taal) is online available via the GTB dictionary web application. The GTB also holds other major Dutch historical dictionaries, such as the Dictionary of Old Dutch (ONW), the Dictionary of early Middle Dutch (VMNW), the Dictionary of Middle Dutch (MNW), and the Dictionary of the Dutch language (WNT). The digital surrounding enables extensive forms of free and structured search queries, including comparative studies with Dutch materials.

OpenSONAR








OpenSONAR: a 500 MW reference corpus

Summary

OpenSoNaR is an online system that allows for analyzing and searching the large scale Dutch reference corpus SoNaR. Due to the size of the corpus (500 million words), accessing the information contained in the dataset has proven to be difficult for less technically inclined researchers. OpenSoNaR facilitates the use of the SoNaR corpus by providing a user-friendly online interface.

COAVA

In COAVA two sets of databases are made available in a standardized way: one with historical dialect data (the databases WBD and WLD with lexical data of the Brabantish and Limburgian dialect between 1880-1980) and one with first language acquisition data (four databases form the CHILDES project). The databases contain linguistic information (dialect form, standardised form (“Dutchified”), lexical meaning), geographical information (locality, dialect area, province) and information on the source (inquiry forms or monotopic dictionaries and the date of documentation). The visualisation of the first two sets of information will lead to lexical maps. The most typical way for the user to get to the data will be with the use of the browsable concept taxonomy. The databases are, in other words, approachable via search tools but also via a thematic taxonomy. This taxonomy was developed for the dialect databases and covers the general vocabulary.

WFT-GTB

The Dictionary of the Frisian Language (Wurdboek fan de Fryske Taal) is online available via the GTB dictionary web application. The GTB also holds other major Dutch historical dictionaries, such as the Dictionary of Old Dutch (ONW), the Dictionary of early Middle Dutch (VMNW), the Dictionary of Middle Dutch (MNW), and the Dictionary of the Dutch language (WNT). The digital surrounding enables extensive forms of free and structured search queries, including comparative studies with Dutch materials.

DuELME

The DUELME search interface provides access to the DUELME electronic lexicon, which contains more than 5,000 Dutch multiword expressions (MWEs). The search interface enables users to search for MWEs on the basis of a range of syntactic and semantic criteria, among them expression, pattern id, written form, type, conjugation, polarity, parameters, form, etc. Extensive documentation on the structure of the database is available.