Lingware: Difference between revisions

From UNLwiki
Jump to navigationJump to search
imported>Admin
No edit summary
imported>Martins
No edit summary
 
(20 intermediate revisions by the same user not shown)
Line 1: Line 1:
The [[UNL System]] comprises three different types of language resources: '''lexica''', '''grammars''' and '''corpora'''. These resources are being provided mainly through the [http://www.undlfoundation.org/unlarium UNLarium], a linguist-friendly dabatase management system to provide dictionary entries and grammar rules to the several language modules participating in the UNL Programme.
The [[UNL System]] comprises three different types of language resources: '''lexica''', '''grammars''' and '''corpora'''. These resources have been provided mainly through the [[UNLarium]].


== Lexica ==
== [[Lexica]] ==
''Main article: [[Lexica]]''
*[[UNL Dictionary]], or UNL<sup>dic</sup>: a flat list of UW's and their corresponding semantic features
*[[UNL Ontology]]: a hierarchy of UW's
*[[UNL Knowledge Base]], or UNL<sup>KB</sup>: a network of necessary interactions between UW's
*[[UNL Memory]]: a network of necessary and typical interactions between UW's
*[[NL Dictionary]], or NL<sup>dic</sup>: a list of natural language entries and their corresponding features
*[[NL Memory]]: a list of typical interactions between natural language entries
*[[UNL-NL Dictionary]], or UNL-NL<sup>dic</sup>: a list of lexical mappings between UNL and a given natural language
*[[UNL-NL Memory]]: a list of mappings between UNL and a given natural language


In the UNL framework, there are two major types of lexical databases: monolingual (in UNL only) and bilingual (in UNL and other natural language). The monolingual resources are:
== [[Grammar]]s ==
* The UNL Dictionary, which is a flat list of UWs
''Main article: [[Grammar]]''
* The UNL Ontology, which is a hierarchy of UWs
* The UNL Knowledge Base, or simply UNLKB, which is a network of UWs
The bilingual resources are the UNL-NL dictionaries, which are used to map UWs and natural language words or expressions.


=== Specs ===
*UNL-NL T-Grammar: used for natural language generation
*UNL-NL D-Grammar: used for improving the results of the UNL-NL T-Grammar
*NL-UNL T-Grammar: used for natural language analysis
*NL-UNL D-Grammar: used for tokenization and for improving the results of the NL-UNL T-Grammar


*The [[UNL Dictionary Specs]]
== [[Corpus|Corpora]] ==
*The [[UNL Ontology Specs]]
''Main article: [[Corpus]]''
*The [[UNL KB Specs]]


=== Databases ===
*UNL Corpus: documents written in UNL, to be provided according to the [[UNL document|UNL document structure]]
 
*NL Corpus: documents written in natural language
*The [[UNL Dictionary]]
*The [[UNL Ontology]]
*The [[UNL Knowledge Base]]
*The [[UNL-NL Dictionaries]]
 
== Grammars ==
Enconversion and deconversion grammars for UNL
 
=== Specs ===
 
*The [[UNL Grammar Specs]]
 
== Corpora ==
Documents written in UNL
 
=== Specs ===
 
*The [[UNL document|UNL Document Structure]]
 
=== Databases ===
 
*[[Corpora|UNL Corpora]]

Latest revision as of 18:43, 21 September 2012

The UNL System comprises three different types of language resources: lexica, grammars and corpora. These resources have been provided mainly through the UNLarium.

Main article: Lexica

  • UNL Dictionary, or UNLdic: a flat list of UW's and their corresponding semantic features
  • UNL Ontology: a hierarchy of UW's
  • UNL Knowledge Base, or UNLKB: a network of necessary interactions between UW's
  • UNL Memory: a network of necessary and typical interactions between UW's
  • NL Dictionary, or NLdic: a list of natural language entries and their corresponding features
  • NL Memory: a list of typical interactions between natural language entries
  • UNL-NL Dictionary, or UNL-NLdic: a list of lexical mappings between UNL and a given natural language
  • UNL-NL Memory: a list of mappings between UNL and a given natural language

Main article: Grammar

  • UNL-NL T-Grammar: used for natural language generation
  • UNL-NL D-Grammar: used for improving the results of the UNL-NL T-Grammar
  • NL-UNL T-Grammar: used for natural language analysis
  • NL-UNL D-Grammar: used for tokenization and for improving the results of the NL-UNL T-Grammar

Main article: Corpus

  • UNL Corpus: documents written in UNL, to be provided according to the UNL document structure
  • NL Corpus: documents written in natural language