Cornetto Data

Cornetto: Combinatorial and Relational Network as Toolkit for Dutch Language Technology

Summary

The data used in the Cornetto lexical resource (92K lemma's and almost 120K word meanings) are not available anymore. The best alternative data set is the Open Dutch WordNet. This data set consists of 116.992 synsets, form which 95.356 from WordNet 3.0.

Background

Open Source Dutch WordNet is a Dutch lexical semantic database. It was created by removing the proprietary content from Cornetto (http://dev.clarin.nl/node/1944) , and by using open source resources to replace this proprietary content. Open Source Dutch WordNet contains 116,992 synsets, of which 95,356 originate from WordNet 3.0 and 21,636 synsets are new synsets. The number of English synsets without Dutch synonyms is 60,743, which means that 34,613 WordNet 3.0 synsets have been filled with at least one Dutch synonym.

This project has been co-funded by the Nederlandse Taalunie (http://taalunie.org/).The Nederlandse TaalUnie and the Free University of Amsterdam share the ownership of Open Source Dutch WordNet.

Contacts
  • Project leader: Prof. dr. Piek Vossen (VU University Amsterdam) 

  • CLARIN center: Institute for Dutch Lexicology
  • Help contact
:
Links

Country

Netherlands

CLARIN centre

Dutch Language Institute

Language

Research domain

Resource tags

Annotations

Format

XML
LMF
RDF