RC-A1
From UNL Wiki
(Difference between revisions)
(→The corpus500) |
(→The corpus500) |
||
Line 2: | Line 2: | ||
== The corpus<sup>500</sup> == | == The corpus<sup>500</sup> == | ||
− | + | *Corpus 500 according to the complexity of the graphs | |
− | + | ||
− | + | ||
− | + | ||
− | *Corpus 500 according to the complexity of the graphs | + | |
{| border="1" cellpadding="2" align=center | {| border="1" cellpadding="2" align=center | ||
|+Corpus | |+Corpus | ||
Line 94: | Line 90: | ||
|[http://www.unlweb.net/resources/geneva2012/problems.txt problems_unl.txt] | |[http://www.unlweb.net/resources/geneva2012/problems.txt problems_unl.txt] | ||
|} | |} | ||
+ | *The whole corpus in one single file | ||
+ | **[http://www.unlweb.net/resources/geneva2012/corpus_eng.txt Corpus500 in English], experimental corpus in English (500 sentences), to be manually translated to the target languages, in order to be used as the input for [[IAN]] | ||
+ | **[http://www.unlweb.net/resources/geneva2012/corpus_unl.txt Corpus500 in UNL], experimental corpus in UNL (500 graphs), to be used as the input for [[EUGENE]] |
Revision as of 20:21, 23 July 2012
The Corpus500 is an experimental corpus used to prepare the initial versions of the grammar for sentence-based UNLization and NLization, using IAN and EUGENE, respectively. It comprises a list of 500 sentences in English and their corresponding graphs in UNL, and is supposed to cover very basic linguistic phenomena.
The corpus500
- Corpus 500 according to the complexity of the graphs
- The whole corpus in one single file
- Corpus500 in English, experimental corpus in English (500 sentences), to be manually translated to the target languages, in order to be used as the input for IAN
- Corpus500 in UNL, experimental corpus in UNL (500 graphs), to be used as the input for EUGENE