VIII UNL School
From UNL Wiki
(Difference between revisions)
(→Slides) |
|||
Line 1: | Line 1: | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
== Goals == | == Goals == | ||
*To build the basic modules of a NL-UNL (analysis) grammar | *To build the basic modules of a NL-UNL (analysis) grammar | ||
*To build the basic modules of a UNL-UNL (generation) grammar | *To build the basic modules of a UNL-UNL (generation) grammar | ||
− | == | + | == Corpus == |
− | + | *[http://www.unlweb.net/resources/geneva2012/corpus_eng.txt Reference corpus in English (500 sentences)], to be manually translated to the target languages, in order to be used as the input for IAN | |
− | + | *[http://www.unlweb.net/resources/geneva2012/corpus_unl.txt Reference corpus in UNL (500 graphs)], to be used as the input for EUGENE | |
− | + | * Reference corpus according to the complexity of the graphs (same as the above, but split in different files) | |
− | + | ||
− | + | ||
− | + | ||
− | *[http://www.unlweb.net/resources/geneva2012/ | + | |
− | *[http://www.unlweb.net/resources/geneva2012/ | + | |
− | * | + | |
{| border="1" cellpadding="2" align=center | {| border="1" cellpadding="2" align=center | ||
|+Corpus | |+Corpus | ||
Line 129: | Line 90: | ||
|} | |} | ||
− | == | + | == Deliverables == |
+ | ;ANALYSIS (IAN) | ||
+ | *The manual translated version of the 500 sentences of the reference corpus (corpus_LID.txt) | ||
+ | *The analysis dictionary used to analyze those 500 sentences (ana_dic_LID.txt) | ||
+ | *The analysis grammar used to analyze those 500 sentences (ana_gra_LID.txt) | ||
+ | *The analysis disambiguation grammar, if any, used to analyze those 500 sentences (ana_dis_LID.txt) | ||
+ | *The UNL output for those 500 sentences generated from the dictionary and grammars above (ana_out_LID.txt) | ||
+ | ;GENERATION (EUGENE) | ||
+ | *The generation dictionary used to generate the reference corpus onto natural language (gen_dic_LID.txt) | ||
+ | *The generation grammar, including inflectional paradigms, used to generate the reference corpus onto natural language (gen_gra_LID.txt) | ||
+ | *The generation disambiguation grammar used to generate the reference corpus onto natural language (gen_dis_LID.txt) | ||
+ | *The natural language output generated from the dictionary and grammars above (gen_out_LID.txt) | ||
+ | |||
+ | == Presentations == | ||
*[http://www.unlweb.net/resources/geneva2012/Corpus.pdf Corpus] | *[http://www.unlweb.net/resources/geneva2012/Corpus.pdf Corpus] | ||
*[http://www.unlweb.net/resources/geneva2012/Dictionary1.pdf NL-UNL Dictionary] | *[http://www.unlweb.net/resources/geneva2012/Dictionary1.pdf NL-UNL Dictionary] | ||
*[http://www.unlweb.net/resources/geneva2012/Dictionary2.pdf UNL-NL Dictionary] | *[http://www.unlweb.net/resources/geneva2012/Dictionary2.pdf UNL-NL Dictionary] | ||
*[http://www.unlweb.net/resources/geneva2012/Grammar.pdf Grammar] | *[http://www.unlweb.net/resources/geneva2012/Grammar.pdf Grammar] | ||
+ | |||
+ | == Participants == | ||
+ | *Carolin ARNOLD (German) | ||
+ | *Ewa CZAJKOWSKA (Polish) | ||
+ | *Grega MILHARCIC (Slovenian) | ||
+ | *Luisa GOUVEIA (Portuguese) | ||
+ | *Martin LUTS (Estonian) | ||
+ | *Mihaela ILIOAIA (Romanian) | ||
+ | *Ofelia HOVHANNISYAN (Armenian) | ||
+ | *Olga VARTZIOTI (Greek) | ||
+ | *Polina LENKOVA (Russian) | ||
+ | *Ronaldo MARTINS (UNL) | ||
+ | *Sameh ALANSARY (Arabic) | ||
+ | *Sara STYMNE (Swedish) | ||
+ | *Yordanka STANCHEVA (Bulgarian) | ||
+ | |||
+ | == Schedule == | ||
+ | ;Feb 06th, 2012 - Monday | ||
+ | :09:00-10:00 Introduction | ||
+ | :10:00-12:00 I – Corpus | ||
+ | :14:00-17:00 II – UNL-NL dictionary | ||
+ | ;Feb 07th, 2012 - Tuesday | ||
+ | :09:00-12:00 III – Morphology (inflectional paradigms) | ||
+ | :14:00-17:00 IV – NL dictionary | ||
+ | ;Feb 08th, 2012- Wednesday | ||
+ | :09:00-12:00 V – UNL-NL grammar (I) | ||
+ | :14:00-17:00 V – UNL-NL grammar (II) | ||
+ | ;Feb 09th, 2012 - Thursday | ||
+ | :09:00-12:00 VI – NL-UNL grammar (I) | ||
+ | :14:00-17:00 VI – NL-UNL grammar (II) | ||
+ | ;Feb 10th, 2012 - Friday | ||
+ | :09:00-12:00 Evaluation | ||
+ | :14:00-17:00 Discussion |
Revision as of 12:59, 13 February 2012
Contents |
Goals
- To build the basic modules of a NL-UNL (analysis) grammar
- To build the basic modules of a UNL-UNL (generation) grammar
Corpus
- Reference corpus in English (500 sentences), to be manually translated to the target languages, in order to be used as the input for IAN
- Reference corpus in UNL (500 graphs), to be used as the input for EUGENE
- Reference corpus according to the complexity of the graphs (same as the above, but split in different files)
Deliverables
- ANALYSIS (IAN)
- The manual translated version of the 500 sentences of the reference corpus (corpus_LID.txt)
- The analysis dictionary used to analyze those 500 sentences (ana_dic_LID.txt)
- The analysis grammar used to analyze those 500 sentences (ana_gra_LID.txt)
- The analysis disambiguation grammar, if any, used to analyze those 500 sentences (ana_dis_LID.txt)
- The UNL output for those 500 sentences generated from the dictionary and grammars above (ana_out_LID.txt)
- GENERATION (EUGENE)
- The generation dictionary used to generate the reference corpus onto natural language (gen_dic_LID.txt)
- The generation grammar, including inflectional paradigms, used to generate the reference corpus onto natural language (gen_gra_LID.txt)
- The generation disambiguation grammar used to generate the reference corpus onto natural language (gen_dis_LID.txt)
- The natural language output generated from the dictionary and grammars above (gen_out_LID.txt)
Presentations
Participants
- Carolin ARNOLD (German)
- Ewa CZAJKOWSKA (Polish)
- Grega MILHARCIC (Slovenian)
- Luisa GOUVEIA (Portuguese)
- Martin LUTS (Estonian)
- Mihaela ILIOAIA (Romanian)
- Ofelia HOVHANNISYAN (Armenian)
- Olga VARTZIOTI (Greek)
- Polina LENKOVA (Russian)
- Ronaldo MARTINS (UNL)
- Sameh ALANSARY (Arabic)
- Sara STYMNE (Swedish)
- Yordanka STANCHEVA (Bulgarian)
Schedule
- Feb 06th, 2012 - Monday
- 09:00-10:00 Introduction
- 10:00-12:00 I – Corpus
- 14:00-17:00 II – UNL-NL dictionary
- Feb 07th, 2012 - Tuesday
- 09:00-12:00 III – Morphology (inflectional paradigms)
- 14:00-17:00 IV – NL dictionary
- Feb 08th, 2012- Wednesday
- 09:00-12:00 V – UNL-NL grammar (I)
- 14:00-17:00 V – UNL-NL grammar (II)
- Feb 09th, 2012 - Thursday
- 09:00-12:00 VI – NL-UNL grammar (I)
- 14:00-17:00 VI – NL-UNL grammar (II)
- Feb 10th, 2012 - Friday
- 09:00-12:00 Evaluation
- 14:00-17:00 Discussion