X UNL School
(→Slides) |
(→Samples) |
||
Line 87: | Line 87: | ||
The following resources have been used to deal with UC-A2 in English and may be used as a sample of what is expected to be provided | The following resources have been used to deal with UC-A2 in English and may be used as a sample of what is expected to be provided | ||
*UNLization | *UNLization | ||
− | **[http://www.unlweb.net/resources/ | + | **[http://www.unlweb.net/resources/A2/eng_ana_tgrammar.txt ENG-UNL T-Grammar] (Transformation Grammar used for the UNLization of UC-A1) (Documentation available at [[English_grammar|English Grammar]]) |
− | + | ||
**[http://www.unlweb.net/resources/A2/UCA2t1_eng_ana_dic.txt ENG-UNL Dictionary] (Dictionary used for the UNLization of UC-A1) (Documentation available at [[English_dictionary|English Dictionary]]) | **[http://www.unlweb.net/resources/A2/UCA2t1_eng_ana_dic.txt ENG-UNL Dictionary] (Dictionary used for the UNLization of UC-A1) (Documentation available at [[English_dictionary|English Dictionary]]) | ||
+ | **[http://www.unlweb.net/resources/A2/ana_tgrammar.txt Default NL-NL T-Grammar] (Default Transformation Grammar used for the UNLization of UC-A1) | ||
*NLization | *NLization | ||
**[http://www.unlweb.net/resources/A1/eng_gen_tgrammar.txt UNL-ENG T-Grammar] (Transformation grammar used for the NLization of UC-A1) (Documentation available at [[English_grammar|English Grammar]]) | **[http://www.unlweb.net/resources/A1/eng_gen_tgrammar.txt UNL-ENG T-Grammar] (Transformation grammar used for the NLization of UC-A1) (Documentation available at [[English_grammar|English Grammar]]) |
Revision as of 07:07, 10 October 2012
The X UNL School will be dedicated to the development and improvement of the grammatical resources for the UNL framework. It will take place in the Library of Alexandria, in Alexandria, Egypt, from October 7th to 11th, 2012.
Contents |
Call for Participation
The CFP, already expired, is available at [1].
Important dates
01/09/2012: Deadline for the applications10/09/2012: Notification of accepted candidates30/09/2012: Deadline for the pre-workshop tasks (corpus translation and dictionary preparation)- 07-11/10/2012: X UNL School
- 15/01/2013: Deadline for the post-workshop tasks (Corpus UC-A2)
Pre-workshop tasks (to be completed before 30/09/2012)
The following files must be sent to r.martins@undlfoundation.org before 30/09/2012:
- <LID>_rca2t1.txt, with the human translation, to the target language, of the sentences of Text 1;
- <LID>_ana_dic.txt, with the natural language analysis dictionary corresponding to the translated version of Text 1;
- <LID>_gen_dic.txt, with the natural language generation dictionary corresponding to the UNL version of the Text 1.
Where <LID> is to be replaced by the three-character code of the ISO 639-2 of the participant's language.
The Text 1 and further instructions are available at Corpus UC-A2.
Workshop (Oct 7-11, 2012)
During the workshop, the participants are expected to provide the syntactic and semantic modules of the grammar necessary to generate the Text 1 from the Corpus A2 from UNL into their native language, and from their native language into UNL. The grammar is expected to comply with the formalism described at the Grammar Specs, and will be provided through the UNLdev, a web-based integrated development environment for creating and editing dictionary entries and grammar rules for natural language processing. The UNDL Foundation will provide all the training and support necessary for the accomplishment of the tasks.
Program
- Oct 07th, 2012 - Sunday
- 09:00-10:00 Introduction
- 10:00-11:00 Sentence 1
- 11:00-11:30 Coffee-break
- 11:30-13:00 Sentence 3
- 13:00-14:00 Lunch
- 14:00-16:00 Sentence 8
- 16:00-16:30 Coffee-break
- 16:30-18:00 Sentence 12
- Oct 08th, 2012 - Monday
- 09:00-11:00 Sentence 2
- 11:00-11:30 Coffee-break
- 11:30-13:00 Sentence 13
- 13:00-14:00 Lunch
- 14:00-16:00 Sentence 7
- 16:00-16:30 Coffee-break
- 16:30-18:00 Sentence 6
- Oct 09th, 2012- Tuesday
- 09:00-11:00 Sentence 9
- 11:00-11:30 Coffee-break
- 11:30-13:00 Sentence 10
- 13:00-14:00 Lunch
- 14:00-16:00 Sentence 5
- 16:00-16:30 Coffee-break
- 16:30-18:00 Sentence 5
- Oct 10th, 2012 - Wednesday
- 09:00-11:00 Sentence 11
- 11:00-11:30 Coffee-break
- 11:30-13:00 Sentence 11
- 13:00-14:00 Lunch
- 14:00-16:00 Sentence 4
- 16:00-16:30 Coffee-break
- 16:30-18:00 Sentence 4
- Oct 11th, 2012 - Thursday
- 09:00-11:00 Evaluation
- 11:00-11:30 Coffee-break
- 11:30-13:00 Discussion
Participants
- Alshaimaa Ahmed Gaber
- Athanasios Karasimos
- Grega Milharcic
- Israa El-Sayed El-Hosiny
- Martin Luts
- Marwa Saber Arafat
- Milica Denic
- Nahla Mohamed Mahmoud
- Ofelia Hovhannisyan
- Olga Vartzioti
- Omneya Hosni Zayyan
- Parameswarappa S
- Parteek Kumar
- Rehab Arafat Rashwan
Material
Slides
Samples
The following resources have been used to deal with UC-A2 in English and may be used as a sample of what is expected to be provided
- UNLization
- ENG-UNL T-Grammar (Transformation Grammar used for the UNLization of UC-A1) (Documentation available at English Grammar)
- ENG-UNL Dictionary (Dictionary used for the UNLization of UC-A1) (Documentation available at English Dictionary)
- Default NL-NL T-Grammar (Default Transformation Grammar used for the UNLization of UC-A1)
- NLization
- UNL-ENG T-Grammar (Transformation grammar used for the NLization of UC-A1) (Documentation available at English Grammar)
- UNL-ENG I-Grammar (Inflectional grammar used for the NLization of UC-A1) (Documentation available at English Grammar)
Certification
The UNDL Foundation will issue a Certificate of Participation, upon evaluation, for all the participants.
Venue
Library of Alexandria, Alexandria, Egypt
Local Organization
Prof. Dr. Sameh Alansary (University of Alexandria, Library of Alexandria)
Post-workshop tasks (to be completed before 15/01/2013)
After the workshop, the participants will be invited to extend their experimental grammars (for both analysis and generation) for the whole Corpus UC-A2.
Follow-up
The participants who accomplish the post-workshop tasks before January 15th, 2013, will be invited to sign a contract for the development of the grammar modules for their respective languages. This contract will include:
- Lemmatization of all word forms appearing in the corpus (USD0.10 per lemma, up to USD1,000.00 total, i.e., 10,000 lemmas)
- The development of a grammar for the analysis of the 1,000 most frequent syntactic structures of the source language, extracted from the corpus (USD2,000.00)
- The development of a grammar for the generation of the 1,000 most frequent semantic structures of UNL (USD2,000.00)
- The participation in the Advanced Level Grammar Workshop in 2013