XI UNL School
From UNL Wiki
(Difference between revisions)
(→Participants) |
|||
Line 9: | Line 9: | ||
*11-15/Mar/2013: XI UNL School | *11-15/Mar/2013: XI UNL School | ||
− | == | + | == Material == |
− | + | *Corpus | |
− | * | + | **For the X-bar exercises: [http://www.unlweb.net/resources/corpus/UCB1/UCB1_t1_eng.txt UCB1] |
− | * | + | **For UNLization and NLization: [[UCA1]] |
− | * | + | *Slides |
− | * | + | **[http://www.unlweb.net/school/macau2013/day1.pdf Day #1] |
− | * | + | *Additional material |
− | * | + | **[[Tagset]], because you are expected to use only the tags included in the tagset |
− | * | + | **[[UNL Dictionary Specs]], which is essential to understand the dictionary structure |
− | * | + | **[[UNL Grammar Specs]], which is essential to understand the grammar structure |
− | * | + | **[[X-bar]] |
− | * | + | **[[Default Dictionary]] |
− | * | + | **[[English dictionary]] |
− | * | + | **[[Default grammar]] |
− | * | + | **[[English grammar]] |
− | * | + | |
− | * | + | |
− | * | + | |
− | * | + | |
− | * | + | |
== Participants == | == Participants == | ||
− | *Angela Manukian | + | *Angela Manukian (Armenian) |
− | *Chittaphone Chanylilath | + | *Chittaphone Chanylilath (Laotian) |
− | *Chunlei Yang | + | *Chunlei Yang (Mandarin) |
− | *Jeanette Tan | + | *Jeanette Tan (Japanese) |
− | *Kim Sokphyrum | + | *Kim Sokphyrum (Khmer) |
− | *Lilit Paremuzyan | + | *Lilit Paremuzyan (Armenian) |
− | *Muhammad Zulhelmy bin Mohd Rosman | + | *Muhammad Zulhelmy bin Mohd Rosman (Malay) |
− | *Siyou Liu | + | *Siyou Liu (Cantonese) |
− | *Somdev Kar | + | *Somdev Kar (Bengali) |
− | *Suos Samak | + | *Suos Samak (Khmer) |
− | *Teng Wei Min | + | *Teng Wei Min |
− | *Vinh Dang | + | *Vinh Dang (Vietnamese) |
− | *Viraj Karunananda | + | *Viraj Karunananda (Sinhala) |
− | *Viraj Welgama | + | *Viraj Welgama (Sinhala) |
− | == Pre-workshop tasks | + | == Pre-workshop tasks == |
#Corpus | #Corpus | ||
##Translate (manually) the 100 sentences of the corpus [http://www.unlweb.net/resources/corpus/UCA1/UCA1_eng.txt UC-A1] from English into your native language. Be as close as possible to the original, and provide one single translation for each sentence. This will be your input document file, and your goal will be to provide (automatically, through [[IAN]]) the UNL graphs for each sentence | ##Translate (manually) the 100 sentences of the corpus [http://www.unlweb.net/resources/corpus/UCA1/UCA1_eng.txt UC-A1] from English into your native language. Be as close as possible to the original, and provide one single translation for each sentence. This will be your input document file, and your goal will be to provide (automatically, through [[IAN]]) the UNL graphs for each sentence | ||
Line 57: | Line 52: | ||
##Save the UNL-NL dictionary in a plain text (.txt) file with UTF-8 encoding and name it unl_???_dic.txt where ??? must be replaced by the corresponding [http://en.wikipedia.org/wiki/List_of_ISO_639-2_codes ISO 639-2 language code]. | ##Save the UNL-NL dictionary in a plain text (.txt) file with UTF-8 encoding and name it unl_???_dic.txt where ??? must be replaced by the corresponding [http://en.wikipedia.org/wiki/List_of_ISO_639-2_codes ISO 639-2 language code]. | ||
#Send the 3 files to r.martins@undlfoundation.org before March 4th | #Send the 3 files to r.martins@undlfoundation.org before March 4th | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
== Venue == | == Venue == |
Revision as of 09:28, 10 March 2013
The XI UNL School will be dedicated to the development and improvement of the grammatical resources for the languages below. It will take place at the University of Macau, in Macau, from March 11th to 15th, 2013.
Contents |
Call for Participation
The CFP is available at [1].
Important dates
03/Feb/2013: Deadline for the applications10/Feb/2013: Notification of accepted candidates- 11-15/Mar/2013: XI UNL School
Material
- Corpus
- Slides
- Additional material
- Tagset, because you are expected to use only the tags included in the tagset
- UNL Dictionary Specs, which is essential to understand the dictionary structure
- UNL Grammar Specs, which is essential to understand the grammar structure
- X-bar
- Default Dictionary
- English dictionary
- Default grammar
- English grammar
Participants
- Angela Manukian (Armenian)
- Chittaphone Chanylilath (Laotian)
- Chunlei Yang (Mandarin)
- Jeanette Tan (Japanese)
- Kim Sokphyrum (Khmer)
- Lilit Paremuzyan (Armenian)
- Muhammad Zulhelmy bin Mohd Rosman (Malay)
- Siyou Liu (Cantonese)
- Somdev Kar (Bengali)
- Suos Samak (Khmer)
- Teng Wei Min
- Vinh Dang (Vietnamese)
- Viraj Karunananda (Sinhala)
- Viraj Welgama (Sinhala)
Pre-workshop tasks
- Corpus
- Translate (manually) the 100 sentences of the corpus UC-A1 from English into your native language. Be as close as possible to the original, and provide one single translation for each sentence. This will be your input document file, and your goal will be to provide (automatically, through IAN) the UNL graphs for each sentence
- Save the translated text (without the English original) in a plain text (.txt) file with UTF-8 encoding and name it corpus_???.txt where ??? must be replaced by the corresponding ISO 639-2 language code.
- Analysis Dictionary
- Localize the dictionary available at eng_unl_dic.txt. Note that "localization" is not the same as "translation". You may need other features (in English, for instance, nouns do not have gender or case) or other entries. In any case, the resulting dictionary should fit your translated version of the corpus (i.e., all the entries appearing in your translated version of the corpus should appear in the dictionary). For further information on localization, see Localization. For information on the dictionary structure, see Dictionary Specs. For an explanation of the structure of the English dictionary, see English Dictionary. In case you need additional features, use only the tags available at the tagset.
- Save the NL-UNL dictionary in a plain text (.txt) file with UTF-8 encoding and name it ???_unl_dic.txt where ??? must be replaced by the corresponding ISO 639-2 language code.
- Generation Dictionary
- Localize the dictionary available at unl_eng_dic.txt. Follow the same procedures described above.
- Save the UNL-NL dictionary in a plain text (.txt) file with UTF-8 encoding and name it unl_???_dic.txt where ??? must be replaced by the corresponding ISO 639-2 language code.
- Send the 3 files to r.martins@undlfoundation.org before March 4th
Venue
University of Macau, Macau
Local Organization
Prof. Dr. Ana Luísa Varani Leal (University of Macau)