Projects

From UNL Wiki

Revision as of 13:59, 10 February 2014 by Martins (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

The UNL Program is organized in many different projects leading to the development of the language and computational resources required by the UNL System. The projects can be open or closed, and funded or non-funded, depending on the language and on the scope. Most projects involving the development of language resources follow the flow defined by the FoR-UNL, and range from A1 (most basic level) to C2 (most advanced level). Software development projects follow the itinerary defined in the UNDL Foundation Road Map.

Language Resources (Lingware)

There are three different types of projects dealing

Dictionary projects aims at proving entries to the UNL-driven dictionaries
Corpus projects aims at providing corpora for assessing UNL grammars
Memory projects aims at providing further lexical resources for UNL-based systems

List of Active Projects

Title	Type	Open	Funded	Active	Description
BRUNO	AD	YES	YES^[1]	YES	The project BRUNO (Basic Resources for UNlizatiOn) aims at providing NL->UNL (analysis) dictionaries based in the frequency of occurrence of lemmas in the source language.
CORNELIA	AC	YES	YES^[2]	NO	The project CORNELIA aims at UNLizing simple and frequent natural language structures in order to induce NL>UNL grammars.
FRIDA	AD	NO	YES	SOON	The project FRIDA (Français, Rumantsch, Italiano and Deutsch for Analysis) aims at mapping the most frequent lemmas of the official languages of Switzerland (German, French, Italian and Romansh) into UNL.
IGLU	AC	NO	NO	YES	The project IGLU (from GLosses to Unl) intends to map WordNet glosses from English into UNL.
LE PETIT PRINCE	AC and GD	YES	NO	YES	The project Le Petit Prince (or LPP) aims at UNLizing the integral text of Le Petit Prince, a French novel published by Antoine de Saint-Exupéry in 1943. The main goal is to set standards and guidelines for human UNLization, and to test several tools that have been developed at the UNDL Foundation.
LEWIS & SHORT	AD	NO	YES	YES	The project Lewis & Short aims at mapping lemmas extracted from the Lewis & Short Latin Dictionary (1879) into UNL. The project is coordinated by the UNL Center at the University of Patras, in Greece.
MIR	GD	YES	YES^[3]	YES	The project MIR (Multilingual InfrastRucture) aims at creating UNL->NL (generation) dictionaries based in the WordNet3.0.
NADIA	ND	YES	YES^[4]	YES	The project NADIA (NAtural language Dictionary for UNL-NL mAppings) aims at creating NL dictionaries based in generation dictionaries.
UGO	GC	YES	YES^[5]	YES	The project UGO aims at NLizing simple and frequent UNL structures in order to induce UNL>NL grammars.

List of Past Projects

Title	Type	Open	Funded	Active	Description
CRATYLUS	AC	NO	NO	NO	The project Cratylus aims at UNLizing the integral text of Cratylus (360 BC), written by the Greek philosopher Plato (427? BC-347? BC). Cratylus is one of the most well-known Platonic dialogues, and an outstanding cornerstone in the history of language studies. The text was used mainly to provide some standards for UNLization.
EOLSS	AC and GD	NO	NO	NO	The project EOLSS aims at multilingualizing, via UNL, the content of 30 articles of the Encyclopedia of Water, one of the many encyclopedias of the Encyclopedia of Life Support Systems (EOLSS), an integrated compendium of several encyclopaedias, which attempts to forge pathways between disciplines and to foster the transdisciplinary relations between subjects especially related to the life supporting systems.
LIS	AC and GD	NO	NO	NO	The Library Information System (LIS) is an information retrieval system that aims at performing multilingual search over bibliographical metadata. The main goal of the project is to UNLize a small set of MARC21 records and to provide the resources necessary to generate it into at least five different languages other than Arabic. The project has been developed by the UNL Center at the Library of Alexandria.

How to participate?

Active projects

All the projects dealing with language resources are developed within the UNLarium, the UNDL Foundation linguist-friendly crowd-sourcing environment.

Types of UNLarium Projects

The UNLarium hosts three different types of projects:

Dictionary Projects

There are four types of dictionary projects:

GD: Generation (UNL->NL) Dictionary projects aims at mapping UWs into natural language lexical items
AD: Analysis (NL->UNL) Dictionary projects aims at mapping natural language lexical items into UWs
ND: Natural Language Dictionary projects aims at treating entries resulting from GD Dictionary projects
UD: UNL Dictionary projects aims at analyzing, defining and exemplifying UWs

Corpus Projects

There are two types of corpus projects:

GC: Generation (UNL->NL) Corpus projects aims at NL-izing a UNL document
AC: Analysis (NL->UNL) Corpus projects aims at UNL-izing a natural language document

Memory Projects

There are five types of memory projects:

KB: Knowledge Base projects aims at providing entries for the UNL Knowledge Base
UM: UNL Memory projects aims at providing entries for the UNL Memory
NM: NL Memory projects aims at providing entries for the NL Memory
AM: Analysis (NL->UNL) Memory projects aims at mapping translation units into UNL
GM: Generation (UNL->NL) Memory projects aims at UNL segments into natural language expressions

Requisites for participating in UNLarium projects

Project Type	Requisite^[6]
GD	CLEA250
ND	CLEA750
AD	CLEA750
UD	CUP500
GC	CUP1000
AC	CLEA1000 CUP1000
KB	CUP1000
UM	CUP1000
GM	CUP1000
AM	CUP1000

Other projects

LACE

The main goal of the project LACE (Lexical Acquisition from Comparable tExts) is to build language modules out of data automatically extracted from comparable corpora. The results are expected to be incorporated in the architecture of UNL-based systems as supplementary resources for natural language disambiguation, both in analysis and generation, and will be used for improving the performance of applications in machine translation, summarization, information retrieval and semantic reasoning. The project has been developed under the CADMOS consortium (University of Geneva, University of Lausanne and École Politechnique Fédérale de Lausanne), and is supported by the Wilsdorf Foundation.

Notes

↑ Only BRUNO-A1 is funded for all languages.
↑ Only CORNELIA-A1 is funded for all languages.
↑ Only MIR-A1 is funded for all languages.
↑ Only NADIA-A1 is funded for all languages.
↑ Only UGO-A1 is funded for all languages.
↑ As of September 1st, 2013

[0] Only BRUNO-A1 is funded for all languages.

[1] Only CORNELIA-A1 is funded for all languages.

[2] Only MIR-A1 is funded for all languages.

[3] Only NADIA-A1 is funded for all languages.

[4] Only UGO-A1 is funded for all languages.

[5] As of September 1st, 2013

[1]

[2]

[3]

[4]

[5]

[6]

Projects

Contents

Language Resources (Lingware)

List of Active Projects

List of Past Projects

How to participate?

Types of UNLarium Projects

Dictionary Projects

Corpus Projects

Memory Projects

Requisites for participating in UNLarium projects

Other projects

LACE

Notes

Views

Personal tools

Search

UNL

Lingware

Software

UNL Program

Navigation

Toolbox

Print/export