UNL System
From UNL Wiki
The UNL System is the set of basic modules of UNL and the rules to combine them.
Contents |
UNLization
Main article: UNLization
UNLization is the process of representing the content of a natural language structure into UNL. It may involve the following modules:
- The UNL Ontology
- The UNL Knowledge Base
- The UNL Memory
- The NL Memory
- The NL-UNL Dictionary
- The NL-UNL Memory
- The NL-UNL Transformation Grammar
- The NL-UNL Disambiguation Grammar
- A UNLizer
NLization
Main article: NLization
NLization is the process of generating a natural language document out of UNL. It may involve the following modules:
- The UNL Ontology
- The UNL Knowledge Base
- The UNL Memory
- The NL Memory
- The UNL-NL Dictionary
- The UNL-NL Memory
- The UNL-NL Transformation Grammar
- The UNL-NL Disambiguation Grammar
- A NLizer
Modules
- Lexical Databases
- The UNL Ontology, the UNL Knowledge Base and the UNL Memory contain semantic relations between UW's along with a degree of necessity, i.e., the possibility of occurrence. These repositories include ontological relations ("a kind of" and "an instance of"), in case of UNL Ontology; necessary (ontological and thematic) relations (such as "is the agent of", "is the place where", "is the moment when"), in case of UNL Knowledge Base; and any typical relations, in case of the UNL Memory.
- The UNL-NL Dictionary and the NL-UNL Dictionary contain mappings between UW's and natural language entries. They are bilingual lexica where UW's are translated into lexical realisation units of a given natural language.
- The UNL-NL Memory and the NL-UNL Memory contain mappings between UNL and natural language entries.
- The NL Memory contains syntactic relations between natural language entries
- Grammars
- The UNL-NL D-Grammar and the UNL-NL T-Grammar are the set of rules used for natural language generation (out of UNL). The NL-UNL D-Grammar and the NL-UNL T-Grammar are the set of rules used for natural language analysis (to UNL). The T-Grammar is the repository of the transformation rules, which are used to manipulate data structures; the D-Grammar is the repository of disambiguation rules, which are used to control the use of transformation rules.
Specs
- Tagset
- The Tagset is a harmonized system for representing linguistic constants.
- Dictionary Specs
- The Dictionary Specs state the syntax of dictionaries.
- Grammar Specs
- The Grammar Specs state the syntax of grammar rules.
- KB Specs
- The UNL Knowledge Base Specs state the syntax of knowledge bases.
- Memory Specs
- The Memory Specs state the syntax of example bases.