<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xml:base="http://portal.clarin.nl"  xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel>
 <title>CLARIN-NL - text processing</title>
 <link>http://portal.clarin.nl/taxonomy/term/40</link>
 <description> the service is text processing service </description>
 <language>en</language>
<item>
 <title>OpenConvert</title>
 <link>http://portal.clarin.nl/node/4224</link>
 <description>
  &lt;div class=&quot;field-body&quot;&gt;
     &lt;img src=&quot;http://dev.clarin.nl/sites/default/files/picture.jpg&quot; height=&quot;400&quot; width=&quot;400&quot; /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;h4&gt;Title&lt;/h4&gt;
&lt;b&gt;Summary&lt;/b&gt;   
&lt;p style=&quot;text-align:justify&quot;&gt;
The OpenConvert tools convert to &lt;a class=&quot;lexicon-term&quot; href=&quot;/taxonomy/term/167&quot;&gt;&lt;dfn title=&quot;Text Encoding Initiative
See: http://en.wikipedia.org/wiki/Text_Encoding_Initiative&quot;&gt;TEI&lt;/dfn&gt;&lt;/a&gt; or FOLiA from a number of input formats (alto, text, word, HTML, ePub). The tools are available as a Java command line tool, a web service and a web application.
&lt;/p&gt;
&lt;b&gt;Background&lt;/b&gt;
&lt;p style=&quot;text-align:justify&quot;&gt;
The OpenConvert Tools were created by INL in the OpenConvert project.

The OpenConvert tools convert to &lt;a class=&quot;lexicon-term&quot; href=&quot;/taxonomy/term/167&quot;&gt;&lt;dfn title=&quot;Text Encoding Initiative
See: http://en.wikipedia.org/wiki/Text_Encoding_Initiative&quot;&gt;TEI&lt;/dfn&gt;&lt;/a&gt; or FOLiA from a number of input formats (alto, text, word, HTML, ePub). The tools are available as a Java command line tool, a web service and a web application.
Furthermore, as a proof of concept, the website currently provides two annotation tools: a simple Tokenizer for &lt;a class=&quot;lexicon-term&quot; href=&quot;/taxonomy/term/167&quot;&gt;&lt;dfn title=&quot;Text Encoding Initiative
See: http://en.wikipedia.org/wiki/Text_Encoding_Initiative&quot;&gt;TEI&lt;/dfn&gt;&lt;/a&gt; files and a modern Dutch part of speech tagger. 
&lt;/p&gt;
&lt;b&gt;Contacts&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Project leader: Jan Theo Bakker &lt;/li&gt;&lt;li&gt;CLARIN center:
&lt;/li&gt;&lt;li&gt;Help contact : &lt;a href=&quot;mailto:helpdesk@clarin.nl&quot;&gt;helpdesk@clarin.nl&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;Links&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Web-sites: n.a.
&lt;/li&gt;&lt;li&gt;User scenario&#039;s (screencasts, screenshots): n.a. 
&lt;/li&gt;&lt;li&gt;Manual: &lt;a href=&quot;http://openconvert.clarin.inl.nl/openconvert/web/help.html&quot;&gt;http://openconvert.clarin.inl.nl/openconvert/web/help.html&lt;/a&gt; 
&lt;/li&gt;&lt;li&gt;Tool/Service link: 
&lt;ul&gt;&lt;li&gt;web application: &lt;a href=&quot;http://openconvert.clarin.inl.nl/&quot;&gt;http://openconvert.clarin.inl.nl/&lt;/a&gt; &lt;/li&gt;
&lt;li&gt;                               web services: see &lt;a href=&quot;http://openconvert.clarin.inl.nl/openconvert/web/help.html&quot;&gt;http://openconvert.clarin.inl.nl/openconvert/web/help.html&lt;/a&gt;  &lt;/li&gt;
&lt;li&gt;                               command line tools: &lt;a href=&quot;https://github.com/INL/OpenConvert&quot;&gt;https://github.com/INL/OpenConvert&lt;/a&gt; &lt;/li&gt;
&lt;/ul&gt;&lt;/li&gt;&lt;li&gt;Publications: n.a.
&lt;/li&gt;&lt;/ul&gt;   &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Research domain  &lt;/h3&gt;

  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/213&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Religion Studies&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/45&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Communication &amp;amp; Media Studies&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/49&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Computational Linguistics&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/56&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Cultural Sciences&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/50&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;History&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/44&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Literary Studies&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/57&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Philosophy&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/55&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Political Studies&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/58&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Linguistics&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Language  &lt;/h3&gt;

  &lt;div class=&quot;field-language&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/67&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Dutch&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-language&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/92&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Language independent&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Resource tags  &lt;/h3&gt;

  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/41&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;service&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/40&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;text processing&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/37&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;web-application&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/24&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;web-service&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/25&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;local tool&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Tool task  &lt;/h3&gt;

  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/80&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;text processing&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/81&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;POS tagging&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/384&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;text conversion&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/73&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;tokenization&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    CLARIN centre  &lt;/h3&gt;

  &lt;div class=&quot;field-clarin-centre-ref&quot;&gt;
    &lt;a href=&quot;/node/1936&quot;&gt;Dutch Language Institute&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Country  &lt;/h3&gt;

  &lt;div class=&quot;field-country&quot;&gt;
    Netherlands  &lt;/div&gt;
</description>
 <pubDate>Tue, 21 Apr 2015 12:29:10 +0000</pubDate>
 <dc:creator>Jan Odijk</dc:creator>
 <guid isPermaLink="false">4224 at http://portal.clarin.nl</guid>
 <comments>http://portal.clarin.nl/node/4224#comments</comments>
</item>
<item>
 <title>PaQu</title>
 <link>http://portal.clarin.nl/node/4182</link>
 <description>
  &lt;div class=&quot;field-body&quot;&gt;
     &lt;img src=&quot;http://dev.clarin.nl/sites/default/files/PaQu.jpg&quot; /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;h4&gt;PaQu - Parse and Query&lt;/h4&gt;
&lt;b&gt;Summary&lt;/b&gt;   
&lt;p style=&quot;text-align:justify&quot;&gt;
The PaQu web service makes it possible to search in syntactically annotated corpora in Dutch. You can parse your own Dutch text corpus or use one of two corpora provided by the developers.
&lt;/p&gt;
&lt;b&gt;Background&lt;/b&gt;
&lt;p style=&quot;text-align:justify&quot;&gt;
Please note: This is a preliminary version of this application. Use Lassy (&lt;a href=&quot;http://dev.clarin.nl/node/1966&quot;&gt;http://dev.clarin.nl/node/1966&lt;/a&gt;) for searching the Lassy Treebanks.&lt;br /&gt;&lt;br /&gt;

PaQu uses the Alpino parser to make treebanks of your own text corpus, and to search in these treebanks using an interface based on the LASSY Word Relations Search interface (&lt;a href=&quot;http://dev.clarin.nl/node/1966&quot;&gt;http://dev.clarin.nl/node/1966&lt;/a&gt;). Two treebanks are already available in the application: Lassy Klein (1M words, manually checked syntactic analysis) and Lassy Groot (700M words, syntactic analysis automatically assigned by  Alpino).&lt;br /&gt;
PaQu offers two ways to search through the syntactically annotated texts. The first option is to use the search bar to look for word pairs, optionally complemented by their syntactic relationship. The second search option is to use the query language XPath.
&lt;/p&gt;
&lt;b&gt;Contacts&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Project leader: prof.dr. Gertjan van Noord  
&lt;/li&gt;&lt;li&gt; Developer: Drs. P. Kleiweg (Groningen University)
&lt;/li&gt;&lt;li&gt;CLARIN center: none
&lt;/li&gt;&lt;li&gt;Help contact : &lt;a href=&quot;mailto:p.c.j.kleiweg@rug.nl&quot;&gt;p.c.j.kleiweg@rug.nl&lt;/a&gt; &lt;/li&gt;&lt;/ul&gt;&lt;b&gt;Links&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Web-sites: n.a.
&lt;/li&gt;&lt;li&gt;User scenario&#039;s (screencasts, screenshots): n.a. 
&lt;/li&gt;&lt;li&gt;Manual: &lt;a href=&quot;http://www.let.rug.nl/alfa/paqu/info.html&quot;&gt;http://www.let.rug.nl/alfa/paqu/info.html&lt;/a&gt;  (&lt;a href=&quot;http://zardoz.service.rug.nl:8067/info.html&quot;&gt;http://zardoz.service.rug.nl:8067/info.html&lt;/a&gt; )
&lt;/li&gt;&lt;li&gt;Tool/Service link: &lt;a href=&quot;http://www.let.rug.nl/alfa/paqu&quot;&gt;http://www.let.rug.nl/alfa/paqu&lt;/a&gt; (&lt;a href=&quot;http://zardoz.service.rug.nl:8067/&quot;&gt;http://zardoz.service.rug.nl:8067/&lt;/a&gt;)
&lt;/li&gt;&lt;li&gt;Publications: n.a.
&lt;/li&gt;&lt;/ul&gt;   &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Research domain  &lt;/h3&gt;

  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/49&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Computational Linguistics&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/58&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Linguistics&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/51&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Syntax&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Language  &lt;/h3&gt;

  &lt;div class=&quot;field-language&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/67&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Dutch&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Resource tags  &lt;/h3&gt;

  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/72&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;data&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/26&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;corpus&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/33&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;text corpus&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/41&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;service&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/40&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;text processing&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/36&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;mono-lingual&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Tool task  &lt;/h3&gt;

  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/86&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;annotation&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/74&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;corpus exploration&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/77&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;browse&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/76&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;search&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/80&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;text processing&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/220&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;grammatical relation assignment&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/84&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;parsing&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Country  &lt;/h3&gt;

  &lt;div class=&quot;field-country&quot;&gt;
    Netherlands  &lt;/div&gt;
</description>
 <pubDate>Thu, 15 Jan 2015 13:00:04 +0000</pubDate>
 <dc:creator>Erica Renckens</dc:creator>
 <guid isPermaLink="false">4182 at http://portal.clarin.nl</guid>
 <comments>http://portal.clarin.nl/node/4182#comments</comments>
</item>
<item>
 <title>LASSY Word Relations Search</title>
 <link>http://portal.clarin.nl/node/1966</link>
 <description>
  &lt;div class=&quot;field-body&quot;&gt;
     &lt;img src=&quot;http://dev.clarin.nl/sites/default/files/Lassy.jpg&quot; width=&quot;400&quot; /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;h4&gt;LASSY Word Relations Search Web Application&lt;/h4&gt;
&lt;b&gt;Summary&lt;/b&gt;   
&lt;p style=&quot;text-align:justify&quot;&gt;
The LASSY word relations web application makes it possible to search for sentences that contain pairs of words between which there is a grammatical relation. One can search in the Dutch LASSY-SMALL Treebank (1 million tokens), in which the syntactic parse of each sentence has been manually verified, and in (a part of) the LASSY-LARGE Treebank (700 million tokens ),in which the syntactic parse of each sentence has been added by the automatic parser Alpino. 
&lt;/p&gt;
&lt;b&gt;Background&lt;/b&gt;
&lt;p style=&quot;text-align:justify&quot;&gt;
One can restrict the query to search for words of a particular Part-of-Speech, which is very useful in the case of syntactic ambiguities. One can also leave out the string of the word, so that one can obtain e.g. a list of sentences in  which  any adverb modifies a given verb, or even any word modifies a given verb. 
On the page that lists the found sentences one can view the exact syntactic structure of each sentence by a simple click. The application also provides detailed frequency information of all found sentences and word pairs.&lt;br /&gt; 
The Lassy treebanks have been made by the  KU Leuven and the Rijksuniversiteit Groningen through financing of the Dutch Language Union. One can obtain these treebanks through the HLT Agency (TST-Centrale).&lt;br /&gt;
Use PaQu (&lt;a href=&quot;http://dev.clarin.nl/node/4182&quot;&gt;http://dev.clarin.nl/node/4182&lt;/a&gt;) if you want to search for word pairs in your own text corpus. 
&lt;/p&gt;
&lt;b&gt;Contacts&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Project leader:  Gertjan van Noord &lt;/li&gt;&lt;li&gt;CLARIN center: none
&lt;/li&gt;&lt;li&gt;Help contact : &lt;a href=&quot;mailto:helpdesk@clarin.nl&quot;&gt;helpdesk@clarin.nl&lt;/a&gt; &lt;/li&gt;&lt;/ul&gt;&lt;b&gt;Links&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Web-sites:  &lt;a href=&quot;http://www.let.rug.nl/~alfa/lassy/bin/lassy&quot;&gt;http://www.let.rug.nl/~alfa/lassy/bin/lassy&lt;/a&gt; 
&lt;/li&gt;&lt;li&gt;User scenario&#039;s (screencasts, screenshots):  &lt;a href=&quot;http://www.let.rug.nl/~alfa/lassy/&quot;&gt;http://www.let.rug.nl/~alfa/lassy/&lt;/a&gt; 
&lt;/li&gt;&lt;li&gt;Manual:  &lt;a href=&quot;http://www.let.rug.nl/~alfa/lassy/&quot;&gt;http://www.let.rug.nl/~alfa/lassy/&lt;/a&gt;
&lt;/li&gt;&lt;li&gt;Tool/Service link: &lt;a href=&quot;http://www.let.rug.nl/~alfa/lassy/bin/lassy&quot;&gt;http://www.let.rug.nl/~alfa/lassy/bin/lassy&lt;/a&gt;
&lt;/li&gt;&lt;li&gt;Publications:  see &lt;a href=&quot;http://www.let.rug.nl/vannoord/Lassy/&quot;&gt;http://www.let.rug.nl/vannoord/Lassy/&lt;/a&gt; 
&lt;/li&gt;&lt;/ul&gt;   &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Research domain  &lt;/h3&gt;

  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/58&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Linguistics&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/51&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Syntax&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Language  &lt;/h3&gt;

  &lt;div class=&quot;field-language&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/67&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Dutch&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Resource tags  &lt;/h3&gt;

  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/41&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;service&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/40&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;text processing&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/37&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;web-application&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/36&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;mono-lingual&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Tool task  &lt;/h3&gt;

  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/83&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;analysis&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/74&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;corpus exploration&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/76&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;search&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Country  &lt;/h3&gt;

  &lt;div class=&quot;field-country&quot;&gt;
    Netherlands  &lt;/div&gt;
</description>
 <pubDate>Tue, 17 Jun 2014 09:41:35 +0000</pubDate>
 <dc:creator>Jan Odijk</dc:creator>
 <guid isPermaLink="false">1966 at http://portal.clarin.nl</guid>
 <comments>http://portal.clarin.nl/node/1966#comments</comments>
</item>
<item>
 <title>TTNWW</title>
 <link>http://portal.clarin.nl/node/1964</link>
 <description>
  &lt;div class=&quot;field-body&quot;&gt;
     &lt;img src=&quot;http://dev.clarin.nl/sites/default/files/ttnww.jpg&quot; height=&quot;400&quot; width=&quot;400&quot; /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;h4&gt;TTNWW - TST Tools voor het Nederlands als Webservices in een Workflow&lt;/h4&gt;
&lt;b&gt;Summary&lt;/b&gt;   
&lt;p style=&quot;text-align:justify&quot;&gt;
TTNWW integrates and makes available existing Language Technology (LT) software components for the Dutch language that have been developed in the STEVIN and CGN projects.
The LT components (for text and speech) are made available as web-services in a simplified &lt;a class=&quot;lexicon-term&quot; href=&quot;/taxonomy/term/122&quot;&gt;&lt;dfn title=&quot;A workflow consists of a sequence of concatenated (connected) steps. Emphasis is on the flow paradigm, where each step follows the precedent without delay or gap and ends just before the subsequent step may begin. This concept is related to non overlapping tasks of single resources.
&quot;&gt;workflow&lt;/dfn&gt;&lt;/a&gt; system that enables researchers without much technical background to use standard LT &lt;a class=&quot;lexicon-term&quot; href=&quot;/taxonomy/term/122&quot;&gt;&lt;dfn title=&quot;A workflow consists of a sequence of concatenated (connected) steps. Emphasis is on the flow paradigm, where each step follows the precedent without delay or gap and ends just before the subsequent step may begin. This concept is related to non overlapping tasks of single resources.
&quot;&gt;workflow&lt;/dfn&gt;&lt;/a&gt; recipes.
&lt;/p&gt;
&lt;p&gt;&lt;b&gt;Background&lt;/b&gt;&lt;/p&gt;
&lt;p style=&quot;text-align:justify&quot;&gt;
The web services are available in two separate domains: &quot;Text&quot; and &quot;Speech&quot; processing. For &quot;Text&quot;, workflows for the following functionality is offered by TTNWW:
&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Orthographic Normalisation using TICCLops (version CLARIN-NL 1.0) &lt;/li&gt;
&lt;li&gt;Part of Speech Tagging, Lemmatisation, Chunking, limited Multiword Unit Recognition, and Grammatical Relation Assignment by Frog (Version 012.012) &lt;/li&gt;
&lt;li&gt;Syntactic Parsing (including grammatical relation assignment, limited named entity recognition, and limited multiword unit recognition) by the Alpino Parser (version 1.3)&lt;/li&gt;
&lt;li&gt;Semantic Annotation&lt;/li&gt;
&lt;li&gt;Named Entity Recognition&lt;/li&gt;
&lt;li&gt;Co-reference Assignment&lt;/li&gt;
&lt;/ul&gt;

For &quot;Speech&quot;, the following workflows are offered:
&lt;ul&gt;&lt;li&gt;Automatic Transcription of speech files using a Netherlands Dutch acoustic model&lt;/li&gt;
&lt;li&gt;Automatic Transcription of speech files using a Flemish Dutch acoustic model&lt;/li&gt;
&lt;li&gt;Conversion of the input speech file to the required sampling rate, followed by automatic transcription&lt;/li&gt;
&lt;/ul&gt;
 
The TTNWW services have been created in a Dutch and Flemish collaboration project building on the results of past Dutch and Flemish projects. The web services are partly deployed in the SURF-SARA BiG-Grid cloud or at CLARIN centres in the Netherlands and at CLARIN VL University partners.&lt;br /&gt;&lt;br /&gt;
The architecture of the TTNWW portal consists out of several components and follows the principles of Service Oriented Architecture (&lt;a class=&quot;lexicon-term&quot; href=&quot;/taxonomy/term/168&quot;&gt;&lt;dfn title=&quot;Service-oriented architecture
See: http://en.wikipedia.org/wiki/Service-oriented_architecture&quot;&gt;SOA&lt;/dfn&gt;&lt;/a&gt;). The TTNWW  GUI front-end is a Flex module that communicates with the TTNWW web-application which keeps track of the different sessions and knows which LT recipes are available. TTNWW communicates assigments (&lt;a class=&quot;lexicon-term&quot; href=&quot;/taxonomy/term/122&quot;&gt;&lt;dfn title=&quot;A workflow consists of a sequence of concatenated (connected) steps. Emphasis is on the flow paradigm, where each step follows the precedent without delay or gap and ends just before the subsequent step may begin. This concept is related to non overlapping tasks of single resources.
&quot;&gt;workflow&lt;/dfn&gt;&lt;/a&gt; specifications)  to the WorkflowService that evaluates the requested &lt;a class=&quot;lexicon-term&quot; href=&quot;/taxonomy/term/122&quot;&gt;&lt;dfn title=&quot;A workflow consists of a sequence of concatenated (connected) steps. Emphasis is on the flow paradigm, where each step follows the precedent without delay or gap and ends just before the subsequent step may begin. This concept is related to non overlapping tasks of single resources.
&quot;&gt;workflow&lt;/dfn&gt;&lt;/a&gt; and requests the DeploymentSevice to start the required LT web-services. After initialization of the LT web-services, the &lt;a class=&quot;lexicon-term&quot; href=&quot;/taxonomy/term/122&quot;&gt;&lt;dfn title=&quot;A workflow consists of a sequence of concatenated (connected) steps. Emphasis is on the flow paradigm, where each step follows the precedent without delay or gap and ends just before the subsequent step may begin. This concept is related to non overlapping tasks of single resources.
&quot;&gt;workflow&lt;/dfn&gt;&lt;/a&gt; specification is sent to the Taverna Server, that takes further care of the &lt;a class=&quot;lexicon-term&quot; href=&quot;/taxonomy/term/122&quot;&gt;&lt;dfn title=&quot;A workflow consists of a sequence of concatenated (connected) steps. Emphasis is on the flow paradigm, where each step follows the precedent without delay or gap and ends just before the subsequent step may begin. This concept is related to non overlapping tasks of single resources.
&quot;&gt;workflow&lt;/dfn&gt;&lt;/a&gt;.
&lt;br /&gt;&lt;br /&gt;
To facilitate the process of wrapping applications that were originally designed as standalone applications into web services, the CLAM  (Computational Linguistics Application Mediator) wrapper software allows for easy and transparent transformation of applications into RESTful web services.
The CLAM software has extensively been used in the TTNWW project for both text and speech processing tools. With the exception of Alpino and MBSRL all web services work operate on CLAM wrappers.
&lt;br /&gt;&lt;br /&gt;
Given the number of web services involved in the TTNWW project and possibilities offered by the cloud environment the preferred method of delivering the web service installations was delivery of complete virtual machine images by the LT providers. These could be directly uploaded into the cloud environment and thus relieving the CLARIN centres nd LT providers from the original foreseen task of running the webservices themselves. A potential advantage of this method, that has not been exploited in the project yet, is that these images may be also be delivered directly to the end user so these can be run in a local configuration using virtualization software such as VMWare of VirtualBox.
&lt;br /&gt;&lt;br /&gt;
The &lt;a class=&quot;lexicon-term&quot; href=&quot;/taxonomy/term/122&quot;&gt;&lt;dfn title=&quot;A workflow consists of a sequence of concatenated (connected) steps. Emphasis is on the flow paradigm, where each step follows the precedent without delay or gap and ends just before the subsequent step may begin. This concept is related to non overlapping tasks of single resources.
&quot;&gt;workflow&lt;/dfn&gt;&lt;/a&gt; engine used in the project was Taverna.  But build on top of this was a a number of selectable task recipes, following a task oriented approach in line with the premises that users with no or little technical expertise should be able to use the system. In this context, tasks are understood in terms of end results of processes such as semantic role labelling, pos tagging or syntactic analysis and ready-made workflows are constructed that can be readily used by the end user.

&lt;b&gt;Contacts&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Project leader: Marc Kemps-Snijders (NL), Ineke Schuurman (VL)
 &lt;/li&gt;&lt;li&gt;CLARIN center: Meertens Institute (portal) and others (web-services).
&lt;/li&gt;&lt;li&gt;Help contact : n.a.&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;Links&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Web-sites: n.a.
&lt;/li&gt;&lt;li&gt;User scenario&#039;s (screencasts, screenshots): n.a. 
&lt;/li&gt;&lt;li&gt;Manual: &lt;a href=&quot;http://yago.meertens.knaw.nl/apache/TTNWW/assets/TTNWW.pdf&quot;&gt;http://yago.meertens.knaw.nl/apache/TTNWW/assets/TTNWW.pdf&lt;/a&gt;
&lt;/li&gt;&lt;li&gt;Tool/Service link: &lt;a href=&quot;http://yago.meertens.knaw.nl/apache/TTNWW/&quot;&gt;http://yago.meertens.knaw.nl/apache/TTNWW/&lt;/a&gt;
&lt;/li&gt;&lt;li&gt;Publications: n.a.
&lt;/li&gt;&lt;/ul&gt;   &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Research domain  &lt;/h3&gt;

  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/45&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Communication &amp;amp; Media Studies&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/50&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;History&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/209&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Oral History&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/58&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Linguistics&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/218&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Morpho-syntax&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/207&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Orthography&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/54&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Discourse&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/53&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Semantics&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/51&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Syntax&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Language  &lt;/h3&gt;

  &lt;div class=&quot;field-language&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/67&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Dutch&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Resource tags  &lt;/h3&gt;

  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/41&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;service&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/39&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;speech processing&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/40&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;text processing&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/37&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;web-application&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/24&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;web-service&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Tool task  &lt;/h3&gt;

  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/84&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;parsing&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/89&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;audio-visual processing&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/90&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;speech recognition&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/223&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;speech transcription&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/224&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;up/down sampling&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/80&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;text processing&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/222&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;chunking&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/219&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;co-reference assignment&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/220&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;grammatical relation assignment&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/221&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;multiword unit identification&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/208&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;orthographic normalisation&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/81&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;POS tagging&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/73&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;tokenization&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/79&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;lemmatisation&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/91&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;NE recognition&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    CLARIN centre  &lt;/h3&gt;

  &lt;div class=&quot;field-clarin-centre-ref&quot;&gt;
    &lt;a href=&quot;/node/1935&quot;&gt;Meertens/HuC &lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Country  &lt;/h3&gt;

  &lt;div class=&quot;field-country&quot;&gt;
    Netherlands  &lt;/div&gt;
</description>
 <pubDate>Sun, 15 Jun 2014 13:01:54 +0000</pubDate>
 <dc:creator>root</dc:creator>
 <guid isPermaLink="false">1964 at http://portal.clarin.nl</guid>
 <comments>http://portal.clarin.nl/node/1964#comments</comments>
</item>
<item>
 <title>DCS</title>
 <link>http://portal.clarin.nl/node/1963</link>
 <description>
  &lt;div class=&quot;field-body&quot;&gt;
     &lt;h4&gt;DCS - Digital Curation Service&lt;/h4&gt;
&lt;b&gt;Summary&lt;/b&gt;   
&lt;p style=&quot;text-align:justify&quot;&gt;
The Digital Curation Service was a CLARIN task force responsible for the curation of several language resources that were not available on-line and/or endangered without a proper documentation context (adding &lt;a class=&quot;lexicon-term&quot; href=&quot;/taxonomy/term/130&quot;&gt;&lt;dfn title=&quot;Component Metadata Infrastructure
See: http://www.clarin.eu/cmdi&quot;&gt;CMDI&lt;/dfn&gt;&lt;/a&gt; metadata etc.). The DCS has curated these data and made them available by depositing them at different CLARIN centra. 
&lt;/p&gt;
&lt;b&gt;Background&lt;/b&gt;
&lt;p style=&quot;text-align:justify&quot;&gt;
In October 2011 the Data Curation Service (DCS) was established. The DCS was supported by CLARIN-NL until 1 January 2014. The DCS was charged with:
&lt;/p&gt;&lt;ul&gt;&lt;li&gt;the curation of resources, especially those presently held by individual researchers or research groups;
&lt;/li&gt;&lt;li&gt;assisting in the curation efforts of CLARIN centres (if and when such is desired);
&lt;/li&gt;&lt;li&gt;advising researchers who wanted to undertake the curation of their resources themselves.
&lt;/li&gt;&lt;/ul&gt;
The curation of resources held by individual researchers or research groups comprised the core of the work undertaken by the DCS. 
and the project was run from the Centre for Language and Speech Technology (CLST) at Radboud University Nijmegen and led by Nelleke Oostdijk (head) and Henk van den Heuvel (deputy head).
&lt;br /&gt;&lt;br /&gt;
The DCS was involved in the curation of the following resources:&lt;br /&gt;
Completed:
&lt;ul&gt;&lt;li&gt;LESLLA (delivered in August 2013)
&lt;ul&gt;&lt;li&gt;CLARIN centre: MPI for Psycholinguistics
&lt;/li&gt;&lt;li&gt;link: &lt;a href=&quot;http://hdl.handle.net/1839/00-37EBCC6D-04A5-4598-88E2-E0F390D5FCE1@format=imdi&quot;&gt;http://hdl.handle.net/1839/00-37EBCC6D-04A5-4598-88E2-E0F390D5FCE1@forma...&lt;/a&gt;@view
&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;li&gt;IPNV Interviews with veterans (delivered in May 2013)
&lt;ul&gt;&lt;li&gt;CLARIN centre: &lt;a class=&quot;lexicon-term&quot; href=&quot;/taxonomy/term/131&quot;&gt;&lt;dfn title=&quot;Data Archiving and Networked Services
See: http://www.dans.knaw.nl&quot;&gt;DANS&lt;/dfn&gt;&lt;/a&gt;
&lt;/li&gt;&lt;li&gt;link: &lt;a href=&quot;https://easy.dans.knaw.nl/ui/?wicket:bookmarkablePage=:nl.knaw.dans.easy.web.search.pages.PublicSearchResultPage&amp;amp;q=IPNV&quot;&gt;https://easy.dans.knaw.nl/ui/?wicket:bookmarkablePage=:nl.knaw.dans.easy...&lt;/a&gt;
&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;li&gt;Woordenboek Gelderse Dialecten, Rivierengebied (delivered in June 2013)
&lt;ul&gt;&lt;li&gt;CLARIN centre: Meertens Institute &lt;/li&gt;&lt;li&gt;link: ? &lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;li&gt;Woordenboek Gelderse Dialecten, Veluwe (part)  (delivered in June 2013)
&lt;ul&gt;&lt;li&gt;CLARIN centre: Meertens Institute &lt;/li&gt;&lt;li&gt;link: ? &lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;li&gt;Curation organisation names for OpenSkos (delivered in June 2013; finalised in August 2013)
&lt;/li&gt;&lt;li&gt;Six dialect dictionaries from Brabant (delivered in October 2013)
&lt;ul&gt;&lt;li&gt;CLARIN centre: Meertens Institute &lt;/li&gt;&lt;li&gt;link: ? &lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;li&gt;WLD and WBD part III (delivered in December 2013)
&lt;ul&gt;&lt;li&gt;CLARIN centre: Meertens Institute &lt;/li&gt;&lt;li&gt;link: ? &lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;li&gt;DBD/TCULT (delivered in February 2014)
&lt;ul&gt;&lt;li&gt;CLARIN centre: MPI for Psycholinguistics
&lt;/li&gt;&lt;li&gt;link: &lt;a href=&quot;https://corpus1.mpi.nl/ds/asv/?openpath=node:84720&quot;&gt;https://corpus1.mpi.nl/ds/asv/?openpath=node:84720&lt;/a&gt;
&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;/ul&gt;
Unfinished (and being considered by CLARIN centres for adoption):
&lt;ul&gt;&lt;li&gt;Roots of Etnolect (collection Linda van Meel) 
&lt;/li&gt;&lt;li&gt;Traces of Contact (collection Kofi Yakbo)
&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;Contacts&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Project leader: Nelleke Oostdijk
&lt;/li&gt;&lt;li&gt;CLARIN center: several
&lt;/li&gt;&lt;li&gt;Help contact : &lt;a href=&quot;https://www.ru.nl/letteren/datacuratieservice/informatie/contact/&quot;&gt;https://www.ru.nl/letteren/datacuratieservice/informatie/contact/&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;Links&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Web-sites: &lt;a href=&quot;http://www.ru.nl/letteren/datacuratieservice/&quot;&gt;http://www.ru.nl/letteren/datacuratieservice/&lt;/a&gt;
&lt;/li&gt;&lt;li&gt;User scenario&#039;s (screencasts, screenshots): n.a. 
&lt;/li&gt;&lt;li&gt;Manual: n.a.
&lt;/li&gt;&lt;li&gt;Tool/Service link: n.a.
&lt;/li&gt;&lt;li&gt;Publications: &lt;ul&gt;&lt;li&gt;Oostdijk, N. &amp;amp; H. van den Heuvel. 2012. Introducing the CLARIN - NL Data Curation Service. In Proceedings of the Workshop Challenges in the management of large corpora. LREC2012, Istanbul, 22 May 2012. &lt;a href=&quot;http://www.lrec-conf.org/proceedings/lrec2012/index.html&quot;&gt;http://www.lrec-conf.org/proceedings/lrec2012/index.html&lt;/a&gt; . Retrieval date: 20 March 2014. 
&lt;/li&gt;&lt;li&gt;Heuvel, H. van den, Oostdijk, N.H.J., Sanders, E.P. &amp;amp; Lint, V. de (2014). Data curations by the Dutch Data Curation Service. In Proceedings CLARIN Annual Conference (pp. online). &lt;a href=&quot;http://www.clarin.eu/sites/default/files/cac2014_submission_15_0.pdf&quot;&gt;http://www.clarin.eu/sites/default/files/cac2014_submission_15_0.pdf&lt;/a&gt;
&lt;/li&gt;&lt;li&gt;Oostdijk, N., Heuvel, H. van den, Treurniet, M. (2013): The  CLARIN-NL Data Curation Service. Bringing data to the foreground.  International Journal of Digital Curation, Vol 8, No 2 (2013), pp. 134-145.&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;/ul&gt;   &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Resource tags  &lt;/h3&gt;

  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/41&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;service&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/39&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;speech processing&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/40&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;text processing&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Country  &lt;/h3&gt;

  &lt;div class=&quot;field-country&quot;&gt;
    Netherlands  &lt;/div&gt;
</description>
 <pubDate>Mon, 09 Jun 2014 09:40:50 +0000</pubDate>
 <dc:creator>root</dc:creator>
 <guid isPermaLink="false">1963 at http://portal.clarin.nl</guid>
 <comments>http://portal.clarin.nl/node/1963#comments</comments>
</item>
<item>
 <title>NameScape</title>
 <link>http://portal.clarin.nl/node/1940</link>
 <description>
  &lt;div class=&quot;field-body&quot;&gt;
     &lt;img src=&quot;http://dev.clarin.nl/sites/default/files/Namescape.jpg&quot; height=&quot;400&quot; width=&quot;400&quot; /&gt;&lt;h4&gt;NameScape: Mapping the Landscape of Names in Modern Dutch Literature&lt;/h4&gt;
&lt;b&gt;Summary&lt;/b&gt;   
&lt;p&gt;
Searching and visualizing Named Entities in modern Dutch novels.
&lt;/p&gt;
&lt;b&gt;Background&lt;/b&gt;
&lt;p&gt;
The named entity (NE) tagging and resolution in NameScape enables quantitative and repeatable research where previously only guesswork and anecdotal evidence was feasible. The visualisation module enables researchers with a less technical background to draw conclusions about functions of names in literary work and help them to explore the material in search of more interesting questions (and answers).
Users from other communities (sociolinguistics, sentiment analysis, …) also benefit from the NE tagged data, especially since the NE recognizer is available as a web service, enabling researchers to annotate their own research data.

Datasets in NameScape (total of 1.129 books):
&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Corpus Sanders: A corpus of 582 Dutch novels written and published between 1970 and 2009 will.
&lt;/li&gt;&lt;li&gt;Corpus Huygens: Consists of 22 novels manually tagged with detailed named entity information. IPR for this corpus do not allow distribution.
&lt;/li&gt;&lt;li&gt;Corpus eBooks: Consists of 7000+ Dutch eBooks tagged automatically with basic NER features and person name Part information. IPR for this corpus do not allow distribution.
&lt;/li&gt;&lt;li&gt;Corpus SoNaR Books: 105 Dutch books; NE tagged.
&lt;/li&gt;&lt;li&gt;Corpus Gutenberg Dutch: Consists of 530 NE tagged &lt;a class=&quot;lexicon-term&quot; href=&quot;/taxonomy/term/167&quot;&gt;&lt;dfn title=&quot;Text Encoding Initiative&amp;#13;&amp;#10;See: http://en.wikipedia.org/wiki/Text_Encoding_Initiative&quot;&gt;TEI&lt;/dfn&gt;&lt;/a&gt; files converted from the Epub versions of the corresponding Gutenberg documents.
&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;Contacts&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Project leader:  dr. Karina van Dalen-Oskam (Huygens ING)
&lt;/li&gt;&lt;li&gt;CLARIN center: Huygens ING
&lt;/li&gt;&lt;li&gt;Help contact : &lt;a href=&quot;mailto:karina.van.dalen@huygens.knaw.nl&quot;&gt;karina.van.dalen@huygens.knaw.nl&lt;/a&gt; 
&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;Links&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Web-sites: &lt;a href=&quot;http://www.namescape.nl/&quot;&gt;http://www.namescape.nl/&lt;/a&gt; 
&lt;/li&gt;&lt;li&gt;User scenario&#039;s (demo): &lt;a href=&quot;http://dev.clarin.nl/sites/default/files/Namescape_demonstration_scenario.pdf&quot;&gt;http://dev.clarin.nl/sites/default/files/Namescape_demonstration_scenari...&lt;/a&gt; 
&lt;/li&gt;&lt;li&gt;Manual: &lt;a href=&quot;http://ner.namescape.nl/namescape/web/help.html&quot;&gt;http://ner.namescape.nl/namescape/web/help.html&lt;/a&gt;  
&lt;/li&gt;&lt;li&gt;Metadata (&lt;a class=&quot;lexicon-term&quot; href=&quot;/taxonomy/term/155&quot;&gt;&lt;dfn title=&quot;Virtual Language Observatory&amp;#13;&amp;#10;See: http://www.clarin.eu/vlo&quot;&gt;VLO&lt;/dfn&gt;&lt;/a&gt;): &lt;a href=&quot;http://catalog.clarin.eu/vlo/search?fq=collection:Namescape:+mapping+the+onymic+landscape&quot;&gt;http://catalog.clarin.eu/vlo/search?fq=collection:Namescape:+mapping+the...&lt;/a&gt;
&lt;/li&gt;&lt;li&gt;Tool/Service link:
&lt;ul&gt;&lt;li&gt;Named entity tagger: &lt;a href=&quot;http://ner.namescape.nl/namescape/tagger&quot;&gt;http://ner.namescape.nl/namescape/tagger&lt;/a&gt;
&lt;/li&gt;&lt;li&gt;Search: &lt;a href=&quot;http://search.namescape.nl/&quot;&gt;http://search.namescape.nl/&lt;/a&gt; 
&lt;/li&gt;&lt;li&gt;Barcode browser: &lt;a href=&quot;http://barcode-browser.namescape.nl/index.xql&quot;&gt;http://barcode-browser.namescape.nl/index.xql&lt;/a&gt; 
&lt;/li&gt;&lt;li&gt;Visualiser: &lt;a href=&quot;http://visualizer.namescape.nl/&quot;&gt;http://visualizer.namescape.nl/&lt;/a&gt; 
&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;li&gt;Publications: Karina van Dalen-Oskam (2013), Nordic Noir: a background check on Inspector Van Veeteren, 31 May 2012, &lt;a href=&quot;http://blog.namescape.nl/?p=47&quot;&gt;http://blog.namescape.nl/?p=47&lt;/a&gt;
&lt;/li&gt;&lt;/ul&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Research domain  &lt;/h3&gt;

  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/49&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Computational Linguistics&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/44&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Literary Studies&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Language  &lt;/h3&gt;

  &lt;div class=&quot;field-language&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/67&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Dutch&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Resource tags  &lt;/h3&gt;

  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/26&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;corpus&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/33&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;text corpus&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/41&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;service&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/40&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;text processing&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/37&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;web-application&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/24&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;web-service&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/36&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;mono-lingual&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Tool task  &lt;/h3&gt;

  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/76&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;search&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/80&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;text processing&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/91&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;NE recognition&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    CLARIN centre  &lt;/h3&gt;

  &lt;div class=&quot;field-clarin-centre-ref&quot;&gt;
    &lt;a href=&quot;/node/1938&quot;&gt;Huygens ING&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Country  &lt;/h3&gt;

  &lt;div class=&quot;field-country&quot;&gt;
    Netherlands  &lt;/div&gt;
</description>
 <pubDate>Sun, 27 Apr 2014 07:52:07 +0000</pubDate>
 <dc:creator>root</dc:creator>
 <guid isPermaLink="false">1940 at http://portal.clarin.nl</guid>
 <comments>http://portal.clarin.nl/node/1940#comments</comments>
</item>
<item>
 <title>INPOLDER</title>
 <link>http://portal.clarin.nl/node/1927</link>
 <description>
  &lt;div class=&quot;field-body&quot;&gt;
     &lt;!--
&lt;img src=&quot;http://dev.clarin.nl/sites/default/files/mouths.jpg&quot; height=&quot;400&quot; width=&quot;400&quot;/&gt;
&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;
--&gt;
&lt;h4&gt;INPOLDER: Integrated Parser and Lemmatizer Dutch in Retrospect&lt;/h4&gt;
&lt;b&gt;Summary&lt;/b&gt;   
&lt;p style=&quot;text-align:justify&quot;&gt;
INPOLDER (Integrated Parser and Lemmatizer of Dutch in Retrospect) provides a tool that assigns morphological tagging, lemmatization, and syntactic parsing for historical Dutch texts. It is built on the Adelheid tool (tagging and lemmatization) and Collins-Bikel statistical Parser.
&lt;/p&gt;
&lt;b&gt;Background&lt;/b&gt;
&lt;p style=&quot;text-align:justify&quot;&gt;
As an essential part of the Dutch cultural heritage, it is of vital importance that the Dutch historical record be made accessible for research into a wide range of historical and linguistic research questions. In the transition from the Middle Ages to the modern era, the Netherlands developed from speaking a diverse group of dialects (Hollandic, Brabantic, Flemish, North-eastern, Limburgian) to a country with a standard language, and there is good reason to believe that this process was an extremely dynamic one. Systematic research into these processes affecting syntax, phonology, morphology and spelling cannot be done without access to lemmatized, tagged and parsed corpora of historical Dutch. In recent years, a tagger-lemmatizer has been developed by Hans van Halteren (Adelheid, also available in the CLARIN infrastructure). INPOLDER complements these enrichment tool with a parser for historical Dutch. 
&lt;br /&gt;&lt;br /&gt;
The INPOLDER parser is trained using a subset of the corpus of fourteenth-century texts (Corpus van Reenen/Mulder CRM, van Reenen and Mulder, 1993; Rem, 2003) and a subset of the Drenthe corpus (&lt;a class=&quot;lexicon-term&quot; href=&quot;/taxonomy/term/184&quot;&gt;&lt;dfn title=&quot;Dublin Core
See: http://dublincore.org/&quot;&gt;DC&lt;/dfn&gt;&lt;/a&gt;). CRM consists of 2700 charters from 345 places of origin. The corpus was designed as representative for the local language use of Middle Dutch and to be suitable for all types of linguistic research.

&lt;/p&gt;
&lt;b&gt;Contacts&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Project leader:  Prof. Dr. Ans van Kemenade (Radboud University)
&lt;/li&gt;&lt;li&gt;CLARIN center: Meertens Institute
&lt;/li&gt;&lt;li&gt;Help contact : gertjan.postmaATmeertens.knaw.nl (linguistic issues), marc.kemps.snijdersATmeertens.knaw.nl (tech issues)
&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;Links&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Web-sites: &lt;a href=&quot;http://194.171.119.69/InPolderClient/inpolder.html&quot;&gt;http://194.171.119.69/InPolderClient/inpolder.html&lt;/a&gt;
&lt;/li&gt;&lt;li&gt;User scenario&#039;s (screencasts, screenshots): &lt;a href=&quot;http://dev.clarin.nl/sites/default/files/User%20manual-demonstatiescenarioINPOLDER.pdf&quot;&gt;http://dev.clarin.nl/sites/default/files/User%20manual-demonstatiescenar...&lt;/a&gt;
&lt;/li&gt;&lt;li&gt;Manual: &lt;a href=&quot;http://dev.clarin.nl/sites/default/files/User%20manual-demonstatiescenarioINPOLDER.pdf&quot;&gt;http://dev.clarin.nl/sites/default/files/User%20manual-demonstatiescenar...&lt;/a&gt;
&lt;/li&gt;&lt;li&gt;Tool/Service link: &lt;a href=&quot;http://194.171.119.69/InPolderClient/inpolder.html&quot;&gt;http://194.171.119.69/InPolderClient/inpolder.html&lt;/a&gt; 
&lt;/li&gt;&lt;li&gt;Publications: n.a.
&lt;/li&gt;&lt;/ul&gt;   &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Research domain  &lt;/h3&gt;

  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/58&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Linguistics&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/51&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Syntax&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/66&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;historical linguistics&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Language  &lt;/h3&gt;

  &lt;div class=&quot;field-language&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/67&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Dutch&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Resource tags  &lt;/h3&gt;

  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/41&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;service&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/40&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;text processing&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/37&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;web-application&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/36&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;mono-lingual&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Tool task  &lt;/h3&gt;

  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/84&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;parsing&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    CLARIN centre  &lt;/h3&gt;

  &lt;div class=&quot;field-clarin-centre-ref&quot;&gt;
    &lt;a href=&quot;/node/1935&quot;&gt;Meertens/HuC &lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Country  &lt;/h3&gt;

  &lt;div class=&quot;field-country&quot;&gt;
    Netherlands  &lt;/div&gt;
</description>
 <pubDate>Mon, 21 Apr 2014 14:39:57 +0000</pubDate>
 <dc:creator>root</dc:creator>
 <guid isPermaLink="false">1927 at http://portal.clarin.nl</guid>
 <comments>http://portal.clarin.nl/node/1927#comments</comments>
</item>
<item>
 <title>TICClops</title>
 <link>http://portal.clarin.nl/node/1914</link>
 <description>
  &lt;div class=&quot;field-body&quot;&gt;
     &lt;img src=&quot;http://dev.clarin.nl/sites/default/files/TICClops.jpg&quot; width=&quot;150&quot; /&gt;&lt;h4&gt;TiCCLops: Text-Induced Corpus Clean-up online processing system&lt;/h4&gt;
&lt;b&gt;Summary&lt;/b&gt;   
&lt;p style=&quot;text-align:justify&quot;&gt;
TICCL (Text Induced Corpus Clean-up) is a system that is designed to search a corpus for all existing variants of (potentially) all words occurring in the corpus. This corpus can be one text, or several, in one or more directories, located on one or more machines. TICCL creates word frequency lists, listing for each word type how often the word occurs in the corpus. These frequencies of the normalized word forms are the sum of the frequencies of the actual word forms found in the corpus. TICCL is a system that is intended to detect and correct typographical errors (misprints) and OCR errors (optical character recognition) in texts. When books or other texts are scanned from paper by a machine, that then turns these scans, i.e. images, into digital text files, errors occur. For instance, the letter combination `in&#039; can be read as `m&#039;, and so the word `regeering&#039; is incorrectly reproduced as `regeermg&#039;. TICCL can be used to detect these errors and to suggest a correct form.
&lt;/p&gt;
&lt;b&gt;Background&lt;/b&gt;
&lt;p style=&quot;text-align:justify&quot;&gt;
Text-Induced Corpus Clean-up (TICCL) was developed first as a prototype at the request of the Koninklijke Bibliotheek - The Hague (KB) and reworked into a production tool according to KB specifications (currently at production version 2.0) mainly during the second half of 2008. It is a fully functional environment for processing possibly very large corpora in order to largely remove the undesirable lexical variation in them. It has provisions for various input and output formats, is flexible and robust and has very high recall and acceptable precision. As a spelling variation detection system it is to the developer’s knowledge unique in making principled use of the input text as possible source for target output canonical forms. As such it is far less domain-sensitive than other approaches: the  domain is largely covered by the input text collection.
&lt;/p&gt;
&lt;b&gt;Contacts&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Project leader: dr. Martin Reynaert (Tilburg University)
 &lt;/li&gt;&lt;li&gt;CLARIN center: The Institute for Dutch Lexicology (INL)
&lt;/li&gt;&lt;li&gt;Help contact: n.a.
 &lt;/li&gt;&lt;/ul&gt;&lt;b&gt;Links&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Web-sites: n.a.
&lt;/li&gt;&lt;li&gt;User scenario&#039;s (screencasts, screenshots): n.a.
&lt;/li&gt;&lt;li&gt;Manual: &lt;a href=&quot;http://ticclops.uvt.nl/ticclops_manual.v101.pdf&quot;&gt;http://ticclops.uvt.nl/ticclops_manual.v101.pdf&lt;/a&gt; 
&lt;/li&gt;&lt;li&gt;Tool/Service link: (classic CLAM interface) &lt;a href=&quot;http://ticclops.clarin.inl.nl/ticclops/&quot;&gt;http://ticclops.clarin.inl.nl/ticclops/&lt;/a&gt;
&lt;/li&gt;&lt;li&gt;Tool/Service link: (@PHILOSTEI interface) &lt;a href=&quot;http://ticclops.clarin.inl.nl/philostei/&quot;&gt;http://ticclops.clarin.inl.nl/philostei/&lt;/a&gt;
&lt;/li&gt;&lt;li&gt;Publications: 
&lt;ul&gt;&lt;li&gt;Reynaert, M. (2008). All, and only, the errors: More complete and consistent spelling and OCR-error correction evaluation. In: Proceedings of the Sixth International Language Resources and Evaluation (LREC’08), Marrakech, Morocco.
&lt;/li&gt;&lt;li&gt;Reynaert, M. (2010). Character confusion versus focus word-based correction of spelling and ocr variants in corpora. International Journal on Document Analysis and Recognition, pp 1-15, URL &lt;a href=&quot;http://dx.doi.org/10.1007/s10032-010-0133-5&quot;&gt;http://dx.doi.org/10.1007/s10032-010-0133-5&lt;/a&gt;, 10.1007/s10032-010-0133-5
&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;/ul&gt;   &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Research domain  &lt;/h3&gt;

  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/58&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Linguistics&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-research-domain&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/207&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Orthography&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Language  &lt;/h3&gt;

  &lt;div class=&quot;field-language&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/67&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Dutch&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Resource tags  &lt;/h3&gt;

  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/41&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;service&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/40&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;text processing&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-resource-tags&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/37&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;web-application&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Tool task  &lt;/h3&gt;

  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/80&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;text processing&lt;/a&gt;  &lt;/div&gt;
  &lt;div class=&quot;field-tool-task&quot;&gt;
    &lt;a href=&quot;/taxonomy/term/208&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;orthographic normalisation&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    CLARIN centre  &lt;/h3&gt;

  &lt;div class=&quot;field-clarin-centre-ref&quot;&gt;
    &lt;a href=&quot;/node/1936&quot;&gt;Dutch Language Institute&lt;/a&gt;  &lt;/div&gt;
  &lt;h3 class=&quot;field-label&quot;&gt;
    Country  &lt;/h3&gt;

  &lt;div class=&quot;field-country&quot;&gt;
    Netherlands  &lt;/div&gt;
</description>
 <pubDate>Sun, 20 Apr 2014 14:24:40 +0000</pubDate>
 <dc:creator>root</dc:creator>
 <guid isPermaLink="false">1914 at http://portal.clarin.nl</guid>
 <comments>http://portal.clarin.nl/node/1914#comments</comments>
</item>
</channel>
</rss>
