RC-A1
From UNL Wiki
(Difference between revisions)
Line 1: | Line 1: | ||
The Corpus<sup>500</sup> is an experimental corpus used to prepare the initial versions of the grammar for sentence-based [[UNLization]] and [[NLization]], using [[IAN]] and [[EUGENE]], respectively. It comprises a list of 500 sentences in English and their corresponding graphs in UNL, and is supposed to cover very basic linguistic phenomena. | The Corpus<sup>500</sup> is an experimental corpus used to prepare the initial versions of the grammar for sentence-based [[UNLization]] and [[NLization]], using [[IAN]] and [[EUGENE]], respectively. It comprises a list of 500 sentences in English and their corresponding graphs in UNL, and is supposed to cover very basic linguistic phenomena. | ||
− | == The | + | == The corpus<sup>500</sup> == |
+ | *The whole corpus in one single file | ||
+ | **[http://www.unlweb.net/resources/geneva2012/corpus_eng.txt Corpus 500] Experimental corpus in English (500 sentences), to be manually translated to the target languages, in order to be used as the input for IAN | ||
+ | *UNL | ||
+ | **[http://www.unlweb.net/resources/geneva2012/corpus_unl.txt Corpus 500], Experimental corpus in UNL (500 sentences), to be used as the input for EUGENE | ||
*Corpus 500 according to the complexity of the graphs (the same as above, but split in different files) | *Corpus 500 according to the complexity of the graphs (the same as above, but split in different files) | ||
{| border="1" cellpadding="2" align=center | {| border="1" cellpadding="2" align=center |
Revision as of 00:10, 23 July 2012
The Corpus500 is an experimental corpus used to prepare the initial versions of the grammar for sentence-based UNLization and NLization, using IAN and EUGENE, respectively. It comprises a list of 500 sentences in English and their corresponding graphs in UNL, and is supposed to cover very basic linguistic phenomena.
The corpus500
- The whole corpus in one single file
- Corpus 500 Experimental corpus in English (500 sentences), to be manually translated to the target languages, in order to be used as the input for IAN
- UNL
- Corpus 500, Experimental corpus in UNL (500 sentences), to be used as the input for EUGENE
- Corpus 500 according to the complexity of the graphs (the same as above, but split in different files)