UNL-NL Memory
(New page: The '''UNLization Memory''', or simply '''UM''', is a set of mappings between a given natural language and UNL. It is claimed to improve the results of UNLization process, as it provid...) |
|||
Line 1: | Line 1: | ||
− | The '''UNLization Memory''', or simply '''UM''', is a set of mappings between a given natural language and UNL. It | + | The '''UNLization Memory''', or simply '''UM''', is a set of mappings between a given natural language and UNL. It improves and to normalizes the results of the [[UNLization]] process, as it contains segments that have been previously UNLized. The UNL UM may be provided in two different formats: |
− | + | ||
− | The UNL UM may be provided in two different formats: | + | |
*Extended, in TMX; or | *Extended, in TMX; or | ||
*Simplified, as a set of [[Grammar_Specs#Disambiguation_Rules|network disambiguation rules]] | *Simplified, as a set of [[Grammar_Specs#Disambiguation_Rules|network disambiguation rules]] | ||
Line 35: | Line 33: | ||
SOURCE is the source segment;<br /> | SOURCE is the source segment;<br /> | ||
TARGET is the target segment; <br /> | TARGET is the target segment; <br /> | ||
− | DC is the degree of certainty (i.e., the likelihood of the relation between the SOURCE and the TARGET) | + | DC is the degree of certainty (i.e., the likelihood of the relation between the SOURCE and the TARGET)<br /> |
Revision as of 11:02, 8 December 2010
The UNLization Memory, or simply UM, is a set of mappings between a given natural language and UNL. It improves and to normalizes the results of the UNLization process, as it contains segments that have been previously UNLized. The UNL UM may be provided in two different formats:
- Extended, in TMX; or
- Simplified, as a set of network disambiguation rules
Extended format
UNL UM entries in extended format must comply with the [Translation Memory eXchange Specs], as follows:
<tu> <tuv xml:lang="en"><seg>a good deal</seg><tuv> <tuv xml:lang="unl"><seg>400059171</seg><tuv> </tu>
Where:
<tu> is the beginning of the translation unit
</tu> is the end of the translation unit
<tuv> is the beginning translation unit variant
</tuv> is the end of the translation unit variant
<seg> is the beginning of the translation segment
</seg> is the end of the translation segment
Simplified format
UNL UM entries in simplified format must have the following structure:
equ(SOURCE;TARGET)=DC;
Where:
equ is the UNL relation for "equivalent";
SOURCE is the source segment;
TARGET is the target segment;
DC is the degree of certainty (i.e., the likelihood of the relation between the SOURCE and the TARGET)