Syntax

MIMORE Data

MIMORE

Summary

The MIMORE tool enables researchers to investigate morphosyntactic variation in the Dutch dialects by searching three related databases with a common on-line search engine. The three databases involved are also available as XML: DynaSAND (the dynamic syntactic atlas of the Dutch dialects), DiDDD (Diversity in Dutch DP Design) and GTRP (Goeman, Taeldeman, van Reenen Project).

SHEBANQ Data







SHEBANQ: System for HEBrew Text: ANnotations for Queries and Markup

Summary

The WIVU Hebrew Text Database contains the Hebrew text of the Old Testament enriched with many linguistic features at the morpheme level up to the discourse level.

FESLI Data

FESLI: Functional elements in Specific Language Impairment

Summary

A corpus with data from monolingual and bilingual children (Dutch - Turkish) with and without Specific Language Impairment (SLI).

Background

The FESLI-data come from two NWO-sponsored projects: BiSLI and Variflex. The numbers of children included in the resources are:

DUELME Data

DUELME is an electronic lexicon that contains more than 5,000 Dutch multiword expressions (MWEs). The DUELME lexicon is suitable for theoretical research on multiword expressions as well as for use in NLP systems. Multiword expressions with similar syntactic patterns are grouped in equivalence classes. Semantic restrictions on variable arguments are encoded.

OpenSONAR








OpenSONAR: a 500 MW reference corpus

Summary

OpenSoNaR is an online system that allows for analyzing and searching the large scale Dutch reference corpus SoNaR. Due to the size of the corpus (500 million words), accessing the information contained in the dataset has proven to be difficult for less technically inclined researchers. OpenSoNaR facilitates the use of the SoNaR corpus by providing a user-friendly online interface.

PaQu





PaQu - Parse and Query

Summary

The PaQu web service makes it possible to search in syntactically annotated corpora in Dutch. You can parse your own Dutch text corpus or use one of two corpora provided by the developers.

SHEBANQ







SHEBANQ: System for HEBrew Text: ANnotations for Queries and Markup

Summary

A web application that enables researchers to perform linguistic queries on the WIVU Hebrew Text Database and preserve significant results as annotations to this resource. This database contains the Hebrew text of the Old Testament enriched with many linguistic features at the morpheme level up to the discourse level.

GrETEL

GrETEL is a query engine in which linguists can use a natural language example as a starting point for searching a treebank with limited knowledge about tree representations and formal query languages. By allowing users to search for constructions which are similar to the example they provide, we hope to bridge the gap between traditional and computational linguistics.

LASSY Word Relations Search

The LASSY word relations web application makes it possible to search for sentences that contain pairs of words between which there is a grammatical relation. One can search in the Dutch LASSY-SMALL Treebank (1 million tokens), in which the syntactic parse of each sentence has been manually verified, and in (a part of) the LASSY-LARGE Treebank (700 million tokens ),in which the syntactic parse of each sentence has been added by the automatic parser Alpino.

TTNWW

TTNWW integrates and makes available existing Language Technology (LT) software components for the Dutch language that have been developed in the STEVIN and CGN projects. The LT components are made available as web-services in a simplified workflow system that enables researchers without much technical background to use standard LT workflow recipes. The web services are available in two separate domains: "Text" and "Speech" processing. The TTNWW services have been created in a Dutch and Flemish collaboration project building on the results of past Dutch and Flemish projects. The web services are partly deployed in the SURF-SARA BiG-Grid cloud or at CLARIN centres in the Netherlands and at CLARIN VL University partners.

Pages