XI UNL School
From UNL Wiki
(Difference between revisions)
(→Important dates) |
|||
Line 29: | Line 29: | ||
*Thai | *Thai | ||
*Vietnamese | *Vietnamese | ||
+ | |||
+ | == Pre-workshop tasks (to be sent before March 4th) == | ||
+ | #Corpus | ||
+ | ##Translate (manually) the sentences of the [http://www.unlweb.net/resources/corpus/UCA1/UCA1_eng.txt UC-A1] from English into your native language. Be as close as possible to the original, and provide one single translation for each sentence. This will be your input document file, and your goal will be to provide (automatically, through [[IAN]]) the UNL graphs for each sentence | ||
+ | ##Save the translated text (without the English original) in a plain text (.txt) file with UTF-8 encoding | ||
+ | #Analysis Dictionary | ||
+ | ##Localize the dictionary available at [http://www.unlweb.net/resources/dic/UCA1/eng_unl_dic.txt eng_unl_dic.txt]. Note that "localization" is not the same as "translation". You may need other features (in English, for instance, nouns do not have gender or case) or other entries. In any case, the resulting dictionary should fit your translated version of the corpus (i.e., all the entries appearing in your translated version of the corpus should appear in the dictionary). For further information on localization, see [[Localization]]. For information on the dictionary structure, see [[Dictionary Specs]]. For an explanation of the structure of the English dictionary, see [[English_dictionary|English Dictionary]]. In case you need additional features, use only the tags available at the [[tagset]]. | ||
+ | #Generation Dictionary | ||
+ | ##Localize the dictionary available at [http://www.unlweb.net/resources/corpus/UCA1/unl_eng_dic.txt unl_eng_dic.txt] to your locale. Note, once again, that localization is not the same as translation. The localized version must reflect the word list of your translated corpus. In any case, use only the tags available at the [[tagset]]. For further information on the dictionary structure, see [[Dictionary Specs]]. | ||
+ | ##Save the UNL-NL dictionary in a plain text (.txt) file with UTF-8 encoding | ||
+ | Send the files to r.martins@undlfoundation.org before March 4th | ||
== Material == | == Material == | ||
− | *to be | + | *Recommended readings |
+ | Before starting the activity, and in order to fully understand what is expected to be done, it is important for you to be acquainted with the following documentation: | ||
+ | **[[Tagset]], because you are expected to use only the tags included in the tagset | ||
+ | **[[UNL Dictionary Specs]], which is essential to understand the dictionary structure | ||
+ | **[[UNL Grammar Specs]], which is essential to understand the grammar structure | ||
+ | **[[X-bar]] | ||
+ | **[[Default Dictionary]] | ||
+ | **[[English dictionary]] | ||
+ | **[[Default grammar]] | ||
+ | **[[English grammar]] | ||
+ | It is also interesting to make a test drive with [[IAN]] and [[EUGENE]]. | ||
== Venue == | == Venue == |
Revision as of 15:22, 5 February 2013
The XI UNL School will be dedicated to the development and improvement of the grammatical resources for the languages below. It will take place at the University of Macau, in Macau, from March 11th to 15th, 2013.
Contents |
Call for Participation
The CFP is available at [1].
Important dates
03/Feb/2013: Deadline for the applications- 10/Feb/2013: Notification of accepted candidates
- 11-15/Mar/2013: XI UNL School
Languages
The workshop is dedicated to the development of the grammatical resources for the processing of the languages below:
- Bahasa Indonesia
- Bengali
- Burmese
- Cantonese
- Chinese
- Japanese
- Khmer
- Korean
- Laotian
- Malay
- Mongolian
- Nepalese
- Portuguese (Macau)
- Sinhalese
- Tagalog
- Tetum
- Thai
- Vietnamese
Pre-workshop tasks (to be sent before March 4th)
- Corpus
- Translate (manually) the sentences of the UC-A1 from English into your native language. Be as close as possible to the original, and provide one single translation for each sentence. This will be your input document file, and your goal will be to provide (automatically, through IAN) the UNL graphs for each sentence
- Save the translated text (without the English original) in a plain text (.txt) file with UTF-8 encoding
- Analysis Dictionary
- Localize the dictionary available at eng_unl_dic.txt. Note that "localization" is not the same as "translation". You may need other features (in English, for instance, nouns do not have gender or case) or other entries. In any case, the resulting dictionary should fit your translated version of the corpus (i.e., all the entries appearing in your translated version of the corpus should appear in the dictionary). For further information on localization, see Localization. For information on the dictionary structure, see Dictionary Specs. For an explanation of the structure of the English dictionary, see English Dictionary. In case you need additional features, use only the tags available at the tagset.
- Generation Dictionary
- Localize the dictionary available at unl_eng_dic.txt to your locale. Note, once again, that localization is not the same as translation. The localized version must reflect the word list of your translated corpus. In any case, use only the tags available at the tagset. For further information on the dictionary structure, see Dictionary Specs.
- Save the UNL-NL dictionary in a plain text (.txt) file with UTF-8 encoding
Send the files to r.martins@undlfoundation.org before March 4th
Material
- Recommended readings
Before starting the activity, and in order to fully understand what is expected to be done, it is important for you to be acquainted with the following documentation:
- Tagset, because you are expected to use only the tags included in the tagset
- UNL Dictionary Specs, which is essential to understand the dictionary structure
- UNL Grammar Specs, which is essential to understand the grammar structure
- X-bar
- Default Dictionary
- English dictionary
- Default grammar
- English grammar
It is also interesting to make a test drive with IAN and EUGENE.
Venue
University of Macau, Macau
Local Organization
Prof. Dr. Ana Luísa Varani Leal (University of Macau)