UNL-NL Memory

From UNL Wiki

Revision as of 19:30, 14 December 2010 by Martins (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

The UNLization Memory, or UNL Memory Base, or simply UNL^MB, is a set of mappings between a given natural language and UNL. It improves and to normalizes the results of the UNLization process, as it contains segments that have been previously UNLized.

The UNL^MB may be provided in two different formats:

Extended, in TMX; or
Simplified, as a set of network disambiguation rules

Extended format

UNL^MB entries in extended format must comply with the Translation Memory eXchange Specs, as follows:

   <tu>
       <tuv xml:lang="en"><seg>a good deal</seg><tuv>
       <tuv xml:lang="unl"><seg>400059171</seg><tuv>
   </tu>

Where:
<tu> is the beginning of the translation unit
</tu> is the end of the translation unit
<tuv> is the beginning translation unit variant
</tuv> is the end of the translation unit variant
<seg> is the beginning of the translation segment
</seg> is the end of the translation segment

Simplified format

UNL^MB entries in simplified format must be represented as a set of network disambiguation rules, as follows:

equ(SOURCE;TARGET)=DC;

Where:
equ is the UNL relation for "equivalent";
SOURCE is the source segment;
TARGET is the target segment;
DC is the degree of certainty (i.e., the likelihood of the relation between the SOURCE and the TARGET)

UNL-NL Memory

Extended format

Simplified format

Views

Personal tools

Search

UNL

Lingware

Software

UNL Program

Navigation

Toolbox

Print/export