VIII UNL School

From UNL Wiki
(Difference between revisions)
Jump to: navigation, search
(Slides)
Line 1: Line 1:
== Participants ==
 
*Carolin ARNOLD (German)
 
*Ewa CZAJKOWSKA (Polish)
 
*Grega MILHARCIC (Slovenian)
 
*Luisa GOUVEIA (Portuguese)
 
*Martin LUTS (Estonian)
 
*Mihaela ILIOAIA (Romanian)
 
*Ofelia HOVHANNISYAN (Armenian)
 
*Olga VARTZIOTI (Greek)
 
*Polina LENKOVA (Russian)
 
*Ronaldo MARTINS (UNL)
 
*Sameh ALANSARY (Arabic)
 
*Sara STYMNE (Swedish)
 
*Yordanka STANCHEVA (Bulgarian)
 
 
== Schedule ==
 
;Feb 06th, 2012 - Monday
 
:09:00-10:00 Introduction
 
:10:00-12:00 I – Corpus
 
:14:00-17:00 II – UNL-NL dictionary
 
;Feb 07th, 2012 - Tuesday
 
:09:00-12:00 III – Morphology (inflectional paradigms)
 
:14:00-17:00 IV – NL dictionary
 
;Feb 08th, 2012- Wednesday
 
:09:00-12:00 V – UNL-NL grammar (I)
 
:14:00-17:00 V – UNL-NL grammar (II)
 
;Feb 09th, 2012 - Thursday
 
:09:00-12:00 VI – NL-UNL grammar (I)
 
:14:00-17:00 VI – NL-UNL grammar (II)
 
;Feb 10th, 2012 - Friday
 
:09:00-12:00 Evaluation
 
:14:00-17:00 Discussion
 
 
 
== Goals ==
 
== Goals ==
 
*To build the basic modules of a NL-UNL (analysis) grammar
 
*To build the basic modules of a NL-UNL (analysis) grammar
 
*To build the basic modules of a UNL-UNL (generation) grammar
 
*To build the basic modules of a UNL-UNL (generation) grammar
  
== Deliverables ==
+
== Corpus ==
*Reference corpus (500 sentences)
+
*[http://www.unlweb.net/resources/geneva2012/corpus_eng.txt Reference corpus in English (500 sentences)], to be manually translated to the target languages, in order to be used as the input for IAN
*UNL-NL dictionary  (corresponding to the corpus)
+
*[http://www.unlweb.net/resources/geneva2012/corpus_unl.txt Reference corpus in UNL (500 graphs)], to be used as the input for EUGENE
*NL dictionary (corresponding to the corpus)
+
* Reference corpus according to the complexity of the graphs (same as the above, but split in different files)
*UNL-NL grammar (corresponding to the corpus)
+
 
+
== Material ==
+
*[http://www.unlweb.net/resources/geneva2012/corpus.xls Reference corpus and wordlist (Excel worksheet)]
+
*[http://www.unlweb.net/resources/geneva2012/corpus_dic.txt EN-UNL Dictionary (for the reference corpus)]
+
*Corpus
+
 
{| border="1" cellpadding="2" align=center
 
{| border="1" cellpadding="2" align=center
 
|+Corpus
 
|+Corpus
Line 129: Line 90:
 
|}
 
|}
  
== Slides ==
+
== Deliverables ==
 +
;ANALYSIS (IAN)
 +
*The manual translated version of the 500 sentences of the reference corpus (corpus_LID.txt)
 +
*The analysis dictionary used to analyze those 500 sentences (ana_dic_LID.txt)
 +
*The analysis grammar used to analyze those 500 sentences (ana_gra_LID.txt)
 +
*The analysis disambiguation grammar, if any, used to analyze those 500 sentences (ana_dis_LID.txt)
 +
*The UNL output for those 500 sentences generated from the dictionary and grammars above (ana_out_LID.txt)
 +
;GENERATION (EUGENE)
 +
*The generation dictionary used to generate the reference corpus onto natural language (gen_dic_LID.txt)
 +
*The generation grammar, including inflectional paradigms, used to generate the reference corpus onto natural language (gen_gra_LID.txt)
 +
*The generation disambiguation grammar used to generate the reference corpus onto natural language (gen_dis_LID.txt)
 +
*The natural language output generated from the dictionary and grammars above (gen_out_LID.txt)
 +
 
 +
== Presentations ==
 
*[http://www.unlweb.net/resources/geneva2012/Corpus.pdf Corpus]
 
*[http://www.unlweb.net/resources/geneva2012/Corpus.pdf Corpus]
 
*[http://www.unlweb.net/resources/geneva2012/Dictionary1.pdf NL-UNL Dictionary]
 
*[http://www.unlweb.net/resources/geneva2012/Dictionary1.pdf NL-UNL Dictionary]
 
*[http://www.unlweb.net/resources/geneva2012/Dictionary2.pdf UNL-NL Dictionary]
 
*[http://www.unlweb.net/resources/geneva2012/Dictionary2.pdf UNL-NL Dictionary]
 
*[http://www.unlweb.net/resources/geneva2012/Grammar.pdf Grammar]
 
*[http://www.unlweb.net/resources/geneva2012/Grammar.pdf Grammar]
 +
 +
== Participants ==
 +
*Carolin ARNOLD (German)
 +
*Ewa CZAJKOWSKA (Polish)
 +
*Grega MILHARCIC (Slovenian)
 +
*Luisa GOUVEIA (Portuguese)
 +
*Martin LUTS (Estonian)
 +
*Mihaela ILIOAIA (Romanian)
 +
*Ofelia HOVHANNISYAN (Armenian)
 +
*Olga VARTZIOTI (Greek)
 +
*Polina LENKOVA (Russian)
 +
*Ronaldo MARTINS (UNL)
 +
*Sameh ALANSARY (Arabic)
 +
*Sara STYMNE (Swedish)
 +
*Yordanka STANCHEVA (Bulgarian)
 +
 +
== Schedule ==
 +
;Feb 06th, 2012 - Monday
 +
:09:00-10:00 Introduction
 +
:10:00-12:00 I – Corpus
 +
:14:00-17:00 II – UNL-NL dictionary
 +
;Feb 07th, 2012 - Tuesday
 +
:09:00-12:00 III – Morphology (inflectional paradigms)
 +
:14:00-17:00 IV – NL dictionary
 +
;Feb 08th, 2012- Wednesday
 +
:09:00-12:00 V – UNL-NL grammar (I)
 +
:14:00-17:00 V – UNL-NL grammar (II)
 +
;Feb 09th, 2012 - Thursday
 +
:09:00-12:00 VI – NL-UNL grammar (I)
 +
:14:00-17:00 VI – NL-UNL grammar (II)
 +
;Feb 10th, 2012 - Friday
 +
:09:00-12:00 Evaluation
 +
:14:00-17:00 Discussion

Revision as of 12:59, 13 February 2012

Contents

Goals

  • To build the basic modules of a NL-UNL (analysis) grammar
  • To build the basic modules of a UNL-UNL (generation) grammar

Corpus

Corpus
Order Analysis (English original) Generation (UNL) Word list (English original)
1 temp_org.txt temp_unl.txt temp_dic.txt
2 attribute0_org.txt attribute0_unl.txt attribute0_dic.txt
3 attribute1_org.txt attribute1_unl.txt attribute1_dic.txt
4 attribute2_org.txt attribute2_unl.txt attribute2_dic.txt
5 attribute3_org.txt attribute3_unl.txt attribute3_dic.txt
6 relation1_org.txt relation1_unl.txt relation1_dic.txt
7 relation2_org.txt relation2_unl.txt relation2_dic.txt
8 relation3_org.txt relation3_unl.txt relation3_dic.txt
9 relation4_org.txt relation4_unl.txt relation4_dic.txt
10 relation5_org.txt relation5_unl.txt relation5_dic.txt
11 relation6_org.txt relation6_unl.txt relation6_dic.txt
12 numbers_org.txt numbers_unl.txt numbers_dic.txt
13 time_org.txt time_unl.txt time_dic.txt
14 relatives_org.txt relatives_unl.txt relatives_dic.txt
15 problems_org.txt problems_unl.txt problems_dic.txt

Deliverables

ANALYSIS (IAN)
  • The manual translated version of the 500 sentences of the reference corpus (corpus_LID.txt)
  • The analysis dictionary used to analyze those 500 sentences (ana_dic_LID.txt)
  • The analysis grammar used to analyze those 500 sentences (ana_gra_LID.txt)
  • The analysis disambiguation grammar, if any, used to analyze those 500 sentences (ana_dis_LID.txt)
  • The UNL output for those 500 sentences generated from the dictionary and grammars above (ana_out_LID.txt)
GENERATION (EUGENE)
  • The generation dictionary used to generate the reference corpus onto natural language (gen_dic_LID.txt)
  • The generation grammar, including inflectional paradigms, used to generate the reference corpus onto natural language (gen_gra_LID.txt)
  • The generation disambiguation grammar used to generate the reference corpus onto natural language (gen_dis_LID.txt)
  • The natural language output generated from the dictionary and grammars above (gen_out_LID.txt)

Presentations

Participants

  • Carolin ARNOLD (German)
  • Ewa CZAJKOWSKA (Polish)
  • Grega MILHARCIC (Slovenian)
  • Luisa GOUVEIA (Portuguese)
  • Martin LUTS (Estonian)
  • Mihaela ILIOAIA (Romanian)
  • Ofelia HOVHANNISYAN (Armenian)
  • Olga VARTZIOTI (Greek)
  • Polina LENKOVA (Russian)
  • Ronaldo MARTINS (UNL)
  • Sameh ALANSARY (Arabic)
  • Sara STYMNE (Swedish)
  • Yordanka STANCHEVA (Bulgarian)

Schedule

Feb 06th, 2012 - Monday
09:00-10:00 Introduction
10:00-12:00 I – Corpus
14:00-17:00 II – UNL-NL dictionary
Feb 07th, 2012 - Tuesday
09:00-12:00 III – Morphology (inflectional paradigms)
14:00-17:00 IV – NL dictionary
Feb 08th, 2012- Wednesday
09:00-12:00 V – UNL-NL grammar (I)
14:00-17:00 V – UNL-NL grammar (II)
Feb 09th, 2012 - Thursday
09:00-12:00 VI – NL-UNL grammar (I)
14:00-17:00 VI – NL-UNL grammar (II)
Feb 10th, 2012 - Friday
09:00-12:00 Evaluation
14:00-17:00 Discussion
Software