Lingware: Difference between revisions

From UNLwiki
Jump to navigationJump to search
imported>Martins
No edit summary
imported>Martins
No edit summary
 
(8 intermediate revisions by the same user not shown)
Line 1: Line 1:
The [[UNL System]] comprises three different types of language resources: '''lexica''', '''grammars''' and '''corpora'''. These resources are being provided mainly through the [[UNLarium]], a linguist-friendly dabatase management system to provide dictionary entries and grammar rules to the several language modules participating in the UNL Programme.
The [[UNL System]] comprises three different types of language resources: '''lexica''', '''grammars''' and '''corpora'''. These resources have been provided mainly through the [[UNLarium]].


== Lexica ==
== [[Lexica]] ==
''Main article: [[Lexica]]''
*[[UNL Dictionary]], or UNL<sup>dic</sup>: a flat list of UW's and their corresponding semantic features
*[[UNL Ontology]]: a hierarchy of UW's
*[[UNL Knowledge Base]], or UNL<sup>KB</sup>: a network of necessary interactions between UW's
*[[UNL Memory]]: a network of necessary and typical interactions between UW's
*[[NL Dictionary]], or NL<sup>dic</sup>: a list of natural language entries and their corresponding features
*[[NL Memory]]: a list of typical interactions between natural language entries
*[[UNL-NL Dictionary]], or UNL-NL<sup>dic</sup>: a list of lexical mappings between UNL and a given natural language
*[[UNL-NL Memory]]: a list of mappings between UNL and a given natural language


In the UNL framework, there are two major types of lexical databases: monolingual and bilingual.<br /><br />
== [[Grammar]]s ==
The monolingual resources are the following:
''Main article: [[Grammar]]''
*in UNL
**The [[UNL Dictionary]], or simply UNL<sup>dic</sup>, which is a flat list of UWs along with non-relational semantic features
**The [[UNL Ontology]], which is a hierarchy of UWs
**The [[UNL Knowledge Base]], or simply UNL<sup>kb</sup>, which is a network with systematic relations between UWs
**The [[UNL Example Base]], or simply UNL<sup>eb</sup>, which is a network with episodic relations between UWs
*in natural language
**The [[NL Dictionary]], or simply NL<sup>dic</sup>, which is a list of natural language headwords
**The [[NL Collocations Dictionary]], or simply NL<sup>cd</sup>, which is a list of collocations and
The bilingual resources are the following
*The [[UNL-NL Dictionary]], or simply UNL-NL<sup>dic</sup>, which is list of systematic lexical mappings between UNL and a given natural language
*The [[UNLization Memory]], or UNL Memory Base, or simply UNL<sup>MB</sup>, which is a list of mappings between UNL and a given natural language


== Grammars ==
*UNL-NL T-Grammar: used for natural language generation
Sets of rules to UNL-ize natural language documents and to NL-ize UNL documents, to be provided according to the [[Grammar Specs]].
*UNL-NL D-Grammar: used for improving the results of the UNL-NL T-Grammar
*NL-UNL T-Grammar: used for natural language analysis
*NL-UNL D-Grammar: used for tokenization and for improving the results of the NL-UNL T-Grammar


== Corpora ==
== [[Corpus|Corpora]] ==
Documents written in UNL, to be provided according to the [[UNL document|UNL document structure]]
''Main article: [[Corpus]]''


*[[Corpora|UNL Corpora]]
*UNL Corpus: documents written in UNL, to be provided according to the [[UNL document|UNL document structure]]
*NL Corpus: documents written in natural language

Latest revision as of 18:43, 21 September 2012

The UNL System comprises three different types of language resources: lexica, grammars and corpora. These resources have been provided mainly through the UNLarium.

Main article: Lexica

  • UNL Dictionary, or UNLdic: a flat list of UW's and their corresponding semantic features
  • UNL Ontology: a hierarchy of UW's
  • UNL Knowledge Base, or UNLKB: a network of necessary interactions between UW's
  • UNL Memory: a network of necessary and typical interactions between UW's
  • NL Dictionary, or NLdic: a list of natural language entries and their corresponding features
  • NL Memory: a list of typical interactions between natural language entries
  • UNL-NL Dictionary, or UNL-NLdic: a list of lexical mappings between UNL and a given natural language
  • UNL-NL Memory: a list of mappings between UNL and a given natural language

Main article: Grammar

  • UNL-NL T-Grammar: used for natural language generation
  • UNL-NL D-Grammar: used for improving the results of the UNL-NL T-Grammar
  • NL-UNL T-Grammar: used for natural language analysis
  • NL-UNL D-Grammar: used for tokenization and for improving the results of the NL-UNL T-Grammar

Main article: Corpus

  • UNL Corpus: documents written in UNL, to be provided according to the UNL document structure
  • NL Corpus: documents written in natural language