Projects

From UNL Wiki

(Difference between revisions)

Revision as of 19:43, 13 December 2010

MIR

The project MIR (Multilingual Infra-stRucture

IGLU

The project IGLU intends to map WordNet glosses from English into UNL. The project is divided into two main phases: the first one (iGLU#1) addresses a subset of 27,255 synsets and is supposed to be carried out in a predominantly human basis; the second one (iGLU#2) focuses on the remaining 90,404 synsets and it is expected to be mainly automatic. In iGLU#1, linguists are supposed to UNL-ize WordNet definitions through the UNL Editor, a graph-based UNL authoring tool available at the UNLdev. Decisions are stored in a UNL-ization memory, which comprises mappings between lexical items of English and Universal Words. Information on attributes and relations are also encoded. These data will be used in the second phase, when the UNL-ization process is expected to be performed by IAN - the UNDL Foundation Interactive ANalyzer -, under development. IAN requires much less human intervention than the UNL Editor, and it is a first step towards a fully-automatic natural language analysis system. Results of the project iGLU are expected to be used not only in compiling the UNL-ization memory, but also in populating the UNL Knowledge Base, which is an essential part of the architecture of the UNL system. It will improve the quality of word sense disambiguation and enhance the capability of information retrieval and extraction through UNL.

LE PETIT PRINCE

The project Le Petit Prince (or LPP) aims at UNLizing the integral text of Le Petit Prince, a French novel published by Antoine de Saint-Exupéry in 1943. The main goal is to set standards and guidelines for human UNLization, and to test several tools that have been developed at the UNDL Foundation. The resulting UNL document is also planned to be used in the evaluation of UNL-based translations, and as a training material for VALERIE, the Virtual Learning Environment for UNL.

LACE

The main goal of the project LACE is to build language modules out of data automatically extracted from comparable corpora. The results are expected to be incorporated in the architecture of UNL-based systems as supplementary resources for natural language disambiguation, both in analysis and generation, and will be used for improving the performance of applications in machine translation, summarization, information retrieval and semantic reasoning.

CRATYLUS

The project Cratylus aims at UNLizing the integral text of Cratylus (360 BC), written by the Greek philosopher Plato (427? BC-347? BC). Cratylus is one of the most well-known Platonic dialogues, and an outstanding cornerstone in the history of language studies. The text was used mainly to provide some standards for UNLization.

EOLSS

The project EOLSS aims at multilingualizing, via UNL, the content of 30 articles of the Encyclopedia of Water, one of the many encyclopedias of the Encyclopedia of Life Support Systems (EOLSS), an integrated compendium of several encyclopaedias, which attempts to forge pathways between disciplines and to foster the transdisciplinary relations between subjects especially related to the life supporting systems.

LIS

The Library Information System (LIS) is an information retrieval system that aims at performing multilingual search over bibliographical metadata. The main goal of the project is to UNLize a small set of MARC21 records and to provide the resources necessary to generate it into at least five different languages other than Arabic.

@@ Line 1: / Line 1: @@
 == MIR ==
 The project [[MIR]] (Multilingual Infra-stRucture
+== IGLU ==
+The project [[IGLU]] intends to map WordNet glosses from English into UNL. The project is divided into two main phases: the first one (iGLU#1) addresses a subset of 27,255 synsets and is supposed to be carried out in a predominantly human basis; the second one (iGLU#2) focuses on the remaining 90,404 synsets and it is expected to be mainly automatic. In iGLU#1, linguists are supposed to UNL-ize WordNet definitions through the [[UNL Editor]], a graph-based UNL authoring tool available at the [[UNLdev]]. Decisions are stored in a UNL-ization memory, which comprises mappings between lexical items of English and Universal Words. Information on attributes and relations are also encoded. These data will be used in the second phase, when the UNL-ization process is expected to be performed by [[IAN]] - the UNDL Foundation Interactive ANalyzer -, under development. IAN requires much less human intervention than the UNL Editor, and it is a first step towards a fully-automatic natural language analysis system. Results of the project iGLU are expected to be used not only in compiling the UNL-ization memory, but also in populating the UNL Knowledge Base, which is an essential part of the architecture of the UNL system. It will improve the quality of word sense disambiguation and enhance the capability of information retrieval and extraction through UNL.
 == LE PETIT PRINCE ==
+The project [[Le Petit Prince]] (or LPP) aims at UNLizing the integral text of Le Petit Prince, a French novel published by Antoine de Saint-Exupéry in 1943. The main goal is to set standards and guidelines for human UNLization, and to test several tools that have been developed at the UNDL Foundation. The resulting UNL document is also planned to be used in the evaluation of UNL-based translations, and as a training material for [[VALERIE]], the Virtual Learning Environment for UNL.
-The project [[Le Petit Prince]] (or LPP) aims at translating, to the Universal Networking Language (UNL), the integral text of Le Petit Prince, a French novel published by Antoine de Saint-Exupéry in 1943. The main goal is 1) to set standards and guidelines for human UNLization; and 2) to test several tools that have been developed at the UNDL Foundation. The resulting UNL document is also planned to be used in the evaluation of UNL-based translations, and as a training material for [[VALERIE]], the Virtual Learning Environment for UNL.
+== LACE ==
+The main goal of the project [[LACE ]] is to build language modules out of data automatically extracted from comparable corpora. The results are expected to be incorporated in the architecture of UNL-based systems as supplementary resources for natural language disambiguation, both in analysis and generation, and will be used for improving the performance of applications in machine translation, summarization, information retrieval and semantic reasoning.
 == CRATYLUS ==
+The project [[Cratylus]] aims at UNLizing the integral text of  Cratylus (360 BC), written by the Greek philosopher Plato (427? BC-347? BC). Cratylus is one of the most well-known Platonic dialogues, and an outstanding cornerstone in the history of language studies. The text was used mainly to provide some standards for UNLization.
-[http://www.ronaldomartins.pro.br/cratylus The Cratylus Project], supported by the Brazilian agency FAPESP (05/04410-2), aimed at translating, manually, from English into UNL, the integral text of  Cratylus (360 BC), written by the Greek philosopher Plato (427? BC-347? BC). Cratylus is one of the most well-known Platonic dialogues, and an outstanding cornerstone in the history of language studies. The text was used mainly to provide some standards for UNL enconverting ([http://www.ronaldomartins.pro.br/enconverting the English-TO-UNL Enconversion Guidelines]).
 == EOLSS ==
+The project [[EOLSS]] aims at multilingualizing, via UNL, the content of 30 articles of the Encyclopedia of Water, one of the many encyclopedias of the Encyclopedia of Life Support Systems (EOLSS), an integrated compendium of several encyclopaedias, which attempts to forge pathways between disciplines and to foster the transdisciplinary relations between subjects especially related to the life supporting systems.
-[http://www.undlfoundation.org/eolss The UNL-EOLSS Project] aims at multilingualizing, via UNL, the content of 30 articles of the Encyclopedia of Water, one of the many encyclopedias of EOLSS.
-The Encyclopedia of Life Support Systems (EOLSS) is one of the world’s largest on-line publications dedicated to the health, maintenance and future of the web of life on planet Earth. Available at http://www.eolss.net, it is an integrated compendium of several encyclopaedias, which attempts to forge pathways between disciplines and to foster the transdisciplinary relations between subjects especially related to the life supporting systems.
-Being produced by thousands of experts from over 100 countries, EOLSS has been facing some shortcomings related to its knowledge management structure: 1) it is monolingual: all articles have been produced in English; 2) it is unidimensional: articles are not hypertexts (i.e., they do not contain hyperlinks to other texts, except for the section “related chapters”); 3) it is poorly standardized: author’s names, institution’s names, biographical sketches and other metadata are not uniform and are not inter-related.
-In order to improve the access to EOLSS, the UNDL Foundation has proposed to represent it in UNL (i.e., to UNLize the EOLSS data) and to generate it into five different languages other than English: Arabic, French, Japanese, Russian and Spanish.
 == LIS ==
 The Library Information System (LIS) is an information retrieval system that aims at performing multilingual search over bibliographical metadata. The main goal of the project is to UNLize a small set of MARC21 records and to provide the resources necessary to generate it into at least five different languages other than Arabic.

Projects

Revision as of 19:43, 13 December 2010

Contents

MIR

IGLU

LE PETIT PRINCE

LACE

CRATYLUS

EOLSS

LIS

Views

Personal tools

Search

UNL

Lingware

Software

UNL Program

Navigation

Toolbox

Print/export