Corpus

From UNL Wiki

Revision as of 21:51, 17 September 2012 by Martins (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Jump to: navigation, search

A UNL corpus is a collection of documents written in UNL according to the UNL document structure.

Types

UNL corpora are normally classified according to the UNLization strategies:

Fully automatic UNLization (the whole process is carried out by the machine, without any intervention of the human user)
Human-aided machine UNLization (the process is carried mainly by the machine, with some intervention of the human user, either as a pre-editor or as a post-editor, or during the UNLization itself, as in dialogue-based UNLization)
Machine-aided human UNLization (the process is carried mainly by the human user, with some help of the machine, as in the dictionary or memory lookup)
Fully human UNLization (the whole process is carried by the human user, without any intervention of the machine)

UNL Reference Corpus (UC)

The UNL Reference Corpus (UC) is the corpus used to prepare and to assess grammars for sentence-based NLization.

NL Reference Corpus (NC)

The Natural Language Reference Corpus (NC) is the corpus used to prepare and to assess grammars for sentence-based UNLization. It is divided in 6 different levels according to the Framework of Reference for UNL (FRAU):

List of UNL Corpora

Corpus

Contents

Types

UNL Reference Corpus (UC)

NL Reference Corpus (NC)

List of UNL Corpora

Views

Personal tools

Search

UNL

Lingware

Software

UNL Program

Navigation

Toolbox

Print/export