UNL Reference Corpus

From UNL Wiki
(Difference between revisions)
Jump to: navigation, search
 
Line 1: Line 1:
 
The UNL Reference Corpus (UC) is the corpus used to prepare and to assess grammars for sentence-based [[NLization]]. It is divided in 6 different levels according to the [[FoR-UNL|Framework of Reference for UNL (FoR-UNL)]]:
 
The UNL Reference Corpus (UC) is the corpus used to prepare and to assess grammars for sentence-based [[NLization]]. It is divided in 6 different levels according to the [[FoR-UNL|Framework of Reference for UNL (FoR-UNL)]]:
*[[UC-A1]]: UNL Reference Corpus A1 (500 isolated sentences with very simple semantic structures)
+
*[[UC-A1]]: UNL Reference Corpus A1 (100 isolated sentences with very simple semantic structures)
*[[UC-A2]]: UNL Reference Corpus A2 (5 very short stories)
+
*[[UC-A2]]: UNL Reference Corpus A2 (300 isolated sentences with very simple semantic structures)
*UC-B1: UNL Reference Corpus B1
+
*[[UC-B1]]: UNL Reference Corpus B1 (5 short stories)
 
*UC-B2: UNL Reference Corpus B2
 
*UC-B2: UNL Reference Corpus B2
 
*UC-C1: UNL Reference Corpus C1
 
*UC-C1: UNL Reference Corpus C1
 
*UC-C2: UNL Reference Corpus C2
 
*UC-C2: UNL Reference Corpus C2

Latest revision as of 18:40, 19 October 2012

The UNL Reference Corpus (UC) is the corpus used to prepare and to assess grammars for sentence-based NLization. It is divided in 6 different levels according to the Framework of Reference for UNL (FoR-UNL):

  • UC-A1: UNL Reference Corpus A1 (100 isolated sentences with very simple semantic structures)
  • UC-A2: UNL Reference Corpus A2 (300 isolated sentences with very simple semantic structures)
  • UC-B1: UNL Reference Corpus B1 (5 short stories)
  • UC-B2: UNL Reference Corpus B2
  • UC-C1: UNL Reference Corpus C1
  • UC-C2: UNL Reference Corpus C2
Software