FoR-UNL

From UNL Wiki
(Difference between revisions)
Jump to: navigation, search
(Assessment)
(Descriptors)
Line 28: Line 28:
 
|align="center"|BRUNO-A1
 
|align="center"|BRUNO-A1
 
|align="center"|NADIA-A1
 
|align="center"|NADIA-A1
|align="center"|GG-A1
+
|align="center"|UC-A1
|align="center"|AG-A1
+
|align="center"|NC-A1
 
|-
 
|-
 
|align="center"|A2
 
|align="center"|A2
Line 35: Line 35:
 
|align="center"|BRUNO-A2
 
|align="center"|BRUNO-A2
 
|align="center"|NADIA-A2
 
|align="center"|NADIA-A2
|align="center"|GG-A2
+
|align="center"|UC-A2
|align="center"|AG-A2
+
|align="center"|NC-A2
 
|-
 
|-
 
|align="center"|B1
 
|align="center"|B1
Line 42: Line 42:
 
|align="center"|BRUNO-B1
 
|align="center"|BRUNO-B1
 
|align="center"|NADIA-B1
 
|align="center"|NADIA-B1
|align="center"|GG-B1
+
|align="center"|UC-B1
|align="center"|AG-B1
+
|align="center"|NC-B1
 
|-
 
|-
 
|align="center"|B2
 
|align="center"|B2
Line 49: Line 49:
 
|align="center"|BRUNO-B2
 
|align="center"|BRUNO-B2
 
|align="center"|NADIA-B2
 
|align="center"|NADIA-B2
|align="center"|GG-B2
+
|align="center"|UC-B2
|align="center"|AG-B2
+
|align="center"|NC-B2
 
|-
 
|-
 
|align="center"|C1
 
|align="center"|C1
Line 56: Line 56:
 
|align="center"|BRUNO-C1
 
|align="center"|BRUNO-C1
 
|align="center"|NADIA-C1
 
|align="center"|NADIA-C1
|align="center"|GG-C1
+
|align="center"|UC-C1
|align="center"|AG-C1
+
|align="center"|NC-C1
 
|-
 
|-
 
|align="center"|C2
 
|align="center"|C2
Line 63: Line 63:
 
|align="center"|BRUNO-C2
 
|align="center"|BRUNO-C2
 
|align="center"|NADIA-C2
 
|align="center"|NADIA-C2
|align="center"|GG-C2
+
|align="center"|UC-C2
|align="center"|AG-C2
+
|align="center"|NC-C2
 
|}
 
|}
  
Line 71: Line 71:
 
*[[BRUNO]] is an [[AD]] (analysis dictionary) project
 
*[[BRUNO]] is an [[AD]] (analysis dictionary) project
 
*[[NADIA]] is [[ND]] (natural language dictionary) project
 
*[[NADIA]] is [[ND]] (natural language dictionary) project
*[[GG]] is the Generation (UNL->NL) Grammar
+
*[[UC]] is a UNL corpus (to be used in NLization)
*[[AG]] is the Analysis (NL->UNL) Grammar
+
*[[NC]] is a natural language corpus (to be used in UNLization)
  
 
== Assessment ==
 
== Assessment ==
 
*A dictionary (GD, AD, ND) is considered completed when all its entries are either valid or reported as underspecified or overspecified.
 
*A dictionary (GD, AD, ND) is considered completed when all its entries are either valid or reported as underspecified or overspecified.
 
*A gramamr (GG, AG) is considered completed if its [[F-measure]] is equal or higher than 0.8.
 
*A gramamr (GG, AG) is considered completed if its [[F-measure]] is equal or higher than 0.8.

Revision as of 19:26, 7 August 2013

FoR-UNL (Framework of Reference for UNL) is a guideline used to describe achievements of natural languages in relation to UNL. It was inspired by the Common European Framework of Reference for Languages (CEFR), and its main goal is to provide a method for assessing the availability and quality of natural language resources inside the UNL System.

Reference Levels

The FoR-UNL classifies languages in three broad divisions which can be divided into six levels, according to the recall and precision of the corresponding resources:

  • A - Basic Level
    • A1 - Breakthrough or beginner
    • A2 - Waystage or elementary
  • B - Intermediate Level
    • B1 - Threshold or intermediate
    • B2 - Vantage or upper intermediate
  • C - Advanced Level
    • C1 - Effective Operational
    • C2 - Mastery

Descriptors

The descriptors below inform what is required for a language to be classified in each level:

Level GD
(entries)
AD
(entries)
ND
(entries)
GG
(sentences)
AG
(sentences)
A1 MIR-A1 BRUNO-A1 NADIA-A1 UC-A1 NC-A1
A2 MIR-A2 BRUNO-A2 NADIA-A2 UC-A2 NC-A2
B1 MIR-B1 BRUNO-B1 NADIA-B1 UC-B1 NC-B1
B2 MIR-B2 BRUNO-B2 NADIA-B2 UC-B2 NC-B2
C1 MIR-C1 BRUNO-C1 NADIA-C1 UC-C1 NC-C1
C2 MIR-C2 BRUNO-C2 NADIA-C2 UC-C2 NC-C2

Where:

  • MIR is a GD (generation dictionary) project
  • BRUNO is an AD (analysis dictionary) project
  • NADIA is ND (natural language dictionary) project
  • UC is a UNL corpus (to be used in NLization)
  • NC is a natural language corpus (to be used in UNLization)

Assessment

  • A dictionary (GD, AD, ND) is considered completed when all its entries are either valid or reported as underspecified or overspecified.
  • A gramamr (GG, AG) is considered completed if its F-measure is equal or higher than 0.8.
Software